Search Sciweavers | Sciweavers

3718 search results - page 132 / 744

» On learning with dissimilarity functions

click to vote

ML
2002
ACM

154views Machine Learning» more ML 2002»

Technical Update: Least-Squares Temporal Difference Learning

14 years 9 months ago

Download www.research.rutgers.edu

TD() is a popular family of algorithms for approximate policy evaluation in large MDPs. TD() works by incrementally updating the value function after each observed transition. It h...

Justin A. Boyan

claim paper

Read More »

100

click to vote

ICDM
2005
IEEE

185views Data Mining» more ICDM 2005»

Adaptive Product Normalization: Using Online Learning for Record Linkage in Comparison Shopping

15 years 3 months ago

Download userweb.cs.utexas.edu

The problem of record linkage focuses on determining whether two object descriptions refer to the same underlying entity. Addressing this problem effectively has many practical ap...

Mikhail Bilenko, Sugato Basu, Mehran Sahami

claim paper

Read More »

click to vote

NIPS
2007

143views Information Technology» more NIPS 2007»

A Game-Theoretic Approach to Apprenticeship Learning

14 years 11 months ago

Download books.nips.cc

We study the problem of an apprentice learning to behave in an environment with an unknown reward function by observing the behavior of an expert. We follow on the work of Abbeel ...

Umar Syed, Robert E. Schapire

claim paper

Read More »

click to vote

ICRA
2009
IEEE

227views Robotics» more ICRA 2009»

Adaptive autonomous control using online value iteration with gaussian processes

15 years 4 months ago

Download www-personal.acfr.usyd.edu.au

— In this paper, we present a novel approach to controlling a robotic system online from scratch based on the reinforcement learning principle. In contrast to other approaches, o...

Axel Rottmann, Wolfram Burgard

claim paper

Read More »

click to vote

SIGIR
2010
ACM

174views Information Technology» more SIGIR 2010»

Learning more powerful test statistics for click-based retrieval evaluation

15 years 1 months ago

Download www.yisongyue.com

Interleaving experiments are an attractive methodology for evaluating retrieval functions through implicit feedback. Designed as a blind and unbiased test for eliciting a preferen...

Yisong Yue, Yue Gao, Olivier Chapelle, Ya Zhang, T...

claim paper

Read More »

« Prev « First page 132 / 744 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers