Search Sciweavers | Sciweavers

1235 search results - page 149 / 247

» ABC Reinforcement Learning

209

click to vote

Publication

233views

Sparse reward processes

14 years 1 months ago

Download arxiv.org

We introduce a class of learning problems where the agent is presented with a series of tasks. Intuitively, if there is relation among those tasks, then the information gained duri...

Christos Dimitrakakis

posted by olethros

Read More »

140

Voted

NIPS
2008

165views Information Technology» more NIPS 2008»

Regularized Policy Iteration

15 years 4 months ago

Download webdocs.cs.ualberta.ca

In this paper we consider approximate policy-iteration-based reinforcement learning algorithms. In order to implement a flexible function approximation scheme we propose the use o...

Amir Massoud Farahmand, Mohammad Ghavamzadeh, Csab...

claim paper

Read More »

134

click to vote

NIPS
2003

148views Information Technology» more NIPS 2003»

Approximate Planning in POMDPs with Macro-Actions

15 years 4 months ago

Download books.nips.cc

Recent research has demonstrated that useful POMDP solutions do not require consideration of the entire belief space. We extend this idea with the notion of temporal abstraction. ...

Georgios Theocharous, Leslie Pack Kaelbling

claim paper

Read More »

171

click to vote

CORR
2012
Springer

216views Education» more CORR 2012»

Fractional Moments on Bandit Problems

13 years 11 months ago

Download www.cse.iitm.ac.in

Reinforcement learning addresses the dilemma between exploration to ﬁnd profitable actions and exploitation to act according to the best observations already made. Bandit proble...

Ananda Narayanan B., Balaraman Ravindran

claim paper

Read More »

159

click to vote

ROBOCUP
2004
Springer

114views Robotics» more ROBOCUP 2004»

Modular Learning System and Scheduling for Behavior Acquisition in Multi-agent Environment

15 years 8 months ago

Download www.er.ams.eng.osaka-u.ac.jp

The existing reinforcement learning approaches have been suﬀering from the policy alternation of others in multiagent dynamic environments such as RoboCup competitions since othe...

Yasutake Takahashi, Kazuhiro Edazawa, Minoru Asada

claim paper

Read More »

« Prev « First page 149 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers