Search Sciweavers | Sciweavers

52 search results - page 10 / 11

» A Reinforcement Learning approach to evaluating state repres...

click to vote

CI
2005

106views more CI 2005»

Incremental Learning of Procedural Planning Knowledge in Challenging Environments

13 years 5 months ago

Download www.sunnyhome.org

Autonomous agents that learn about their environment can be divided into two broad classes. One class of existing learners, reinforcement learners, typically employ weak learning ...

Douglas J. Pearson, John E. Laird

claim paper

Read More »

click to vote

ML
2000
ACM

150views Machine Learning» more ML 2000»

Adaptive Retrieval Agents: Internalizing Local Context and Scaling up to the Web

13 years 5 months ago

Download informatics.indiana.edu

This paper discusses a novel distributed adaptive algorithm and representation used to construct populations of adaptive Web agents. These InfoSpiders browse networked information ...

Filippo Menczer, Richard K. Belew

claim paper

Read More »

click to vote

JMLR
2006

124views more JMLR 2006»

Policy Gradient in Continuous Time

13 years 5 months ago

Download hal.inria.fr

Policy search is a method for approximately solving an optimal control problem by performing a parametric optimization search in a given class of parameterized policies. In order ...

Rémi Munos

claim paper

Read More »

click to vote

IJCAI
2007

201views Artificial Intelligence» more IJCAI 2007»

Using Linear Programming for Bayesian Exploration in Markov Decision Processes

13 years 6 months ago

Download www.cs.mcgill.ca

A key problem in reinforcement learning is ﬁnding a good balance between the need to explore the environment and the need to gain rewards by exploiting existing knowledge. Much ...

Pablo Samuel Castro, Doina Precup

claim paper

Read More »

click to vote

UAI
2008

242views Artificial Intelligence» more UAI 2008»

Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping

13 years 6 months ago

Download uai2008.cs.helsinki.fi

We consider the problem of efficiently learning optimal control policies and value functions over large state spaces in an online setting in which estimates must be available afte...

Richard S. Sutton, Csaba Szepesvári, Alborz...

claim paper

Read More »

« Prev « First page 10 / 11 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers