Search Sciweavers | Sciweavers

1233 search results - page 90 / 247

» Reinforcement Learning in MirrorBot

144

click to vote

NIPS
2008

165views Information Technology» more NIPS 2008»

Regularized Policy Iteration

15 years 5 months ago

Download webdocs.cs.ualberta.ca

In this paper we consider approximate policy-iteration-based reinforcement learning algorithms. In order to implement a flexible function approximation scheme we propose the use o...

Amir Massoud Farahmand, Mohammad Ghavamzadeh, Csab...

claim paper

Read More »

143

click to vote

NIPS
2003

148views Information Technology» more NIPS 2003»

Approximate Planning in POMDPs with Macro-Actions

15 years 5 months ago

Download books.nips.cc

Recent research has demonstrated that useful POMDP solutions do not require consideration of the entire belief space. We extend this idea with the notion of temporal abstraction. ...

Georgios Theocharous, Leslie Pack Kaelbling

claim paper

Read More »

178

click to vote

CORR
2012
Springer

216views Education» more CORR 2012»

Fractional Moments on Bandit Problems

13 years 11 months ago

Download www.cse.iitm.ac.in

Reinforcement learning addresses the dilemma between exploration to ﬁnd profitable actions and exploitation to act according to the best observations already made. Bandit proble...

Ananda Narayanan B., Balaraman Ravindran

claim paper

Read More »

148

click to vote

ICML
2007
IEEE

141views Machine Learning» more ICML 2007»

Reinforcement learning by reward-weighted regression for operational space control

16 years 4 months ago

Download www.machinelearning.org

Many robot control problems of practical importance, including operational space control, can be reformulated as immediate reward reinforcement learning problems. However, few of ...

Jan Peters, Stefan Schaal

claim paper

Read More »

142

click to vote

IJCAI
2007

275views Artificial Intelligence» more IJCAI 2007»

Effective Control Knowledge Transfer through Learning Skill and Representation Hierarchies

15 years 5 months ago

Download www.ijcai.org

Learning capabilities of computer systems still lag far behind biological systems. One of the reasons can be seen in the inefﬁcient re-use of control knowledge acquired over the...

Mehran Asadi, Manfred Huber

claim paper

Read More »

« Prev « First page 90 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers