Search Sciweavers | Sciweavers

495 search results - page 48 / 99

» Approximation algorithms for budgeted learning problems

click to vote

NIPS
2000

127views Information Technology» more NIPS 2000»

Using Free Energies to Represent Q-values in a Multiagent Reinforcement Learning Task

15 years 1 months ago

Download members.chello.at

The problem of reinforcement learning in large factored Markov decision processes is explored. The Q-value of a state-action pair is approximated by the free energy of a product o...

Brian Sallans, Geoffrey E. Hinton

claim paper

Read More »

117

click to vote

NIPS
1997

107views Information Technology» more NIPS 1997»

Learning to Order Things

15 years 1 months ago

Download people.csail.mit.edu

There are many applications in which it is desirable to order rather than classify instances. Here we consider the problem of learning how to order, given feedback in the form of ...

William W. Cohen, Robert E. Schapire, Yoram Singer

claim paper

Read More »

click to vote

COMPGEOM
2005
ACM

107views Discrete Geometry» more COMPGEOM 2005»

Learning smooth objects by probing

15 years 1 months ago

Download graphics.stanford.edu

We consider the problem of discovering a smooth unknown surface S bounding an object O in R3 . The discovery process consists of moving a point probing device in the free space ar...

Jean-Daniel Boissonnat, Leonidas J. Guibas, Steve ...

claim paper

Read More »

click to vote

ATAL
2008
Springer

127views Intelligent Agents» more ATAL 2008»

Autonomous transfer for reinforcement learning

15 years 1 months ago

Download www.cs.utexas.edu

Recent work in transfer learning has succeeded in making reinforcement learning algorithms more efficient by incorporating knowledge from previous tasks. However, such methods typ...

Matthew E. Taylor, Gregory Kuhlmann, Peter Stone

claim paper

Read More »

102

click to vote

ICML
2009
IEEE

186views Machine Learning» more ICML 2009»

Regularization and feature selection in least-squares temporal difference learning

16 years 14 days ago

Download ai.stanford.edu

We consider the task of reinforcement learning with linear value function approximation. Temporal difference algorithms, and in particular the Least-Squares Temporal Difference (L...

J. Zico Kolter, Andrew Y. Ng

claim paper

Read More »

« Prev « First page 48 / 99 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers