Search Sciweavers | Sciweavers

81 search results - page 2 / 17

» Neuroevolutionary reinforcement learning for generalized hel...

click to vote

JMLR
2010

148views more JMLR 2010»

A Generalized Path Integral Control Approach to Reinforcement Learning

12 years 11 months ago

Download jmlr.csail.mit.edu

With the goal to generate more scalable algorithms with higher efficiency and fewer open parameters, reinforcement learning (RL) has recently moved towards combining classical tec...

Evangelos Theodorou, Jonas Buchli, Stefan Schaal

claim paper

Read More »

click to vote

ECAI
2008
Springer

158views Artificial Intelligence» more ECAI 2008»

A Simulation-based Approach for Solving Generalized Semi-Markov Decision Processes

13 years 6 months ago

Download emmanuel.rachelson.free.fr

Time is a crucial variable in planning and often requires special attention since it introduces a specific structure along with additional complexity, especially in the case of dec...

Emmanuel Rachelson, Gauthier Quesnel, Fréd&...

claim paper

Read More »

click to vote

ICML
2007
IEEE

141views Machine Learning» more ICML 2007»

Reinforcement learning by reward-weighted regression for operational space control

14 years 5 months ago

Download www.machinelearning.org

Many robot control problems of practical importance, including operational space control, can be reformulated as immediate reward reinforcement learning problems. However, few of ...

Jan Peters, Stefan Schaal

claim paper

Read More »

click to vote

ATAL
2009
Springer

137views Intelligent Agents» more ATAL 2009»

Generalized model learning for reinforcement learning in factored domains

13 years 11 months ago

Download userweb.cs.utexas.edu

Improving the sample eﬃciency of reinforcement learning algorithms to scale up to larger and more realistic domains is a current research challenge in machine learning. Model-ba...

Todd Hester, Peter Stone

claim paper

Read More »

click to vote

IJCAI
2001

151views Artificial Intelligence» more IJCAI 2001»

R-MAX - A General Polynomial Time Algorithm for Near-Optimal Reinforcement Learning

13 years 6 months ago

Download jmlr.csail.mit.edu

R-max is a very simple model-based reinforcement learning algorithm which can attain near-optimal average reward in polynomial time. In R-max, the agent always maintains a complet...

Ronen I. Brafman, Moshe Tennenholtz

claim paper

Read More »

« Prev « First page 2 / 17 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers