Search Sciweavers | Sciweavers

34 search results - page 3 / 7

» Using Temporal Neighborhoods to Adapt Function Approximators...

click to vote

AAAI
2008

207views Intelligent Agents» more AAAI 2008»

Adaptive Importance Sampling with Automatic Model Selection in Value Function Approximation

13 years 7 months ago

Download sugiyama-www.cs.titech.ac.jp

Off-policy reinforcement learning is aimed at efficiently reusing data samples gathered in the past, which is an essential problem for physically grounded AI as experiments are us...

Hirotaka Hachiya, Takayuki Akiyama, Masashi Sugiya...

claim paper

Read More »

click to vote

ATAL
2008
Springer

146views Intelligent Agents» more ATAL 2008»

Adaptive Kanerva-based function approximation for multi-agent systems

13 years 6 months ago

Download www.aamas-conference.org

In this paper, we show how adaptive prototype optimization can be used to improve the performance of function approximation based on Kanerva Coding when solving largescale instanc...

Cheng Wu, Waleed Meleis

claim paper

Read More »

click to vote

NIPS
2007

164views Information Technology» more NIPS 2007»

Incremental Natural Actor-Critic Algorithms

13 years 6 months ago

Download books.nips.cc

We present four new reinforcement learning algorithms based on actor-critic and natural-gradient ideas, and provide their convergence proofs. Actor-critic reinforcement learning m...

Shalabh Bhatnagar, Richard S. Sutton, Mohammad Gha...

claim paper

Read More »

click to vote

CORR
2010
Springer

204views Education» more CORR 2010»

Predictive State Temporal Difference Learning

13 years 3 months ago

Download www.cs.cmu.edu

We propose a new approach to value function approximation which combines linear temporal difference reinforcement learning with subspace identiﬁcation. In practical applications...

Byron Boots, Geoffrey J. Gordon

claim paper

Read More »

click to vote

GECCO
2009
Springer

124views Optimization» more GECCO 2009»

Reinforcement learning for games: failures and successes

13 years 9 months ago

Download www.gm.fh-koeln.de

We apply CMA-ES, an evolution strategy with covariance matrix adaptation, and TDL (Temporal Difference Learning) to reinforcement learning tasks. In both cases these algorithms se...

Wolfgang Konen, Thomas Bartz-Beielstein

claim paper

Read More »

« Prev « First page 3 / 7 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers