Search Sciweavers | Sciweavers

757 search results - page 2 / 152

» Generalized model learning for reinforcement learning in fac...

click to vote

GECCO
2006
Springer

208views Optimization» more GECCO 2006»

Comparing evolutionary and temporal difference methods in a reinforcement learning domain

13 years 9 months ago

Download www.cs.bham.ac.uk

Both genetic algorithms (GAs) and temporal difference (TD) methods have proven effective at solving reinforcement learning (RL) problems. However, since few rigorous empirical com...

Matthew E. Taylor, Shimon Whiteson, Peter Stone

claim paper

Read More »

click to vote

ECML
2006
Springer

116views Machine Learning» more ECML 2006»

Scaling Model-Based Average-Reward Reinforcement Learning for Product Delivery

13 years 9 months ago

Download web.engr.oregonstate.edu

Reinforcement learning in real-world domains suffers from three curses of dimensionality: explosions in state and action spaces, and high stochasticity. We present approaches that ...

Scott Proper, Prasad Tadepalli

claim paper

Read More »

click to vote

ICMLA
2009

185views Machine Learning» more ICMLA 2009»

Automatic Feature Selection for Model-Based Reinforcement Learning in Factored MDPs

13 years 3 months ago

Download staff.science.uva.nl

Abstract--Feature selection is an important challenge in machine learning. Unfortunately, most methods for automating feature selection are designed for supervised learning tasks a...

Mark Kroon, Shimon Whiteson

claim paper

Read More »

click to vote

SCAI
2008

246views Artificial Intelligence» more SCAI 2008»

Fast Learning in an Actor-Critic Architecture with Reward and Punishment

13 years 7 months ago

Download www.lucs.lu.se

Abstract. A reinforcement architecture is introduced that consists of three complementary learning systems with different generalization abilities. The ACTOR learns state-action as...

Christian Balkenius, Stefan Winberg

claim paper

Read More »

click to vote

ICML
2006
IEEE

156views Machine Learning» more ICML 2006»

Learning the structure of Factored Markov Decision Processes in reinforcement learning problems

14 years 6 months ago

Download animatlab.lip6.fr

Recent decision-theoric planning algorithms are able to find optimal solutions in large problems, using Factored Markov Decision Processes (fmdps). However, these algorithms need ...

Thomas Degris, Olivier Sigaud, Pierre-Henri Wuille...

claim paper

Read More »

« Prev « First page 2 / 152 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers