Search Sciweavers | Sciweavers

1799 search results - page 153 / 360

» Filtered Reinforcement Learning

150

click to vote

NIPS
2003

148views Information Technology» more NIPS 2003»

Approximate Planning in POMDPs with Macro-Actions

15 years 6 months ago

Download books.nips.cc

Recent research has demonstrated that useful POMDP solutions do not require consideration of the entire belief space. We extend this idea with the notion of temporal abstraction. ...

Georgios Theocharous, Leslie Pack Kaelbling

claim paper

Read More »

201

click to vote

CORR
2012
Springer

216views Education» more CORR 2012»

Fractional Moments on Bandit Problems

14 years 14 days ago

Download www.cse.iitm.ac.in

Reinforcement learning addresses the dilemma between exploration to ﬁnd profitable actions and exploitation to act according to the best observations already made. Bandit proble...

Ananda Narayanan B., Balaraman Ravindran

claim paper

Read More »

174

Voted

ROBOCUP
2004
Springer

114views Robotics» more ROBOCUP 2004»

Modular Learning System and Scheduling for Behavior Acquisition in Multi-agent Environment

15 years 10 months ago

Download www.er.ams.eng.osaka-u.ac.jp

The existing reinforcement learning approaches have been suﬀering from the policy alternation of others in multiagent dynamic environments such as RoboCup competitions since othe...

Yasutake Takahashi, Kazuhiro Edazawa, Minoru Asada

claim paper

Read More »

151

click to vote

IJCAI
2007

275views Artificial Intelligence» more IJCAI 2007»

Effective Control Knowledge Transfer through Learning Skill and Representation Hierarchies

15 years 6 months ago

Download www.ijcai.org

Learning capabilities of computer systems still lag far behind biological systems. One of the reasons can be seen in the inefﬁcient re-use of control knowledge acquired over the...

Mehran Asadi, Manfred Huber

claim paper

Read More »

144

click to vote

ECML
2006
Springer

141views Machine Learning» more ECML 2006»

Approximate Policy Iteration for Closed-Loop Learning of Visual Tasks

15 years 8 months ago

Download www.montefiore.ulg.ac.be

Abstract. Approximate Policy Iteration (API) is a reinforcement learning paradigm that is able to solve high-dimensional, continuous control problems. We propose to exploit API for...

Sébastien Jodogne, Cyril Briquet, Justus H....

claim paper

Read More »

« Prev « First page 153 / 360 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers