Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

39

ECML
2007
Springer

favoriteEmaildiscussreport

167views Machine Learning» more ECML 2007»

Efficient Continuous-Time Reinforcement Learning with Adaptive State Graphs

14 years 1 months ago

Efficient Continuous-Time Reinforcement Learning with Adaptive State Graphs

Download www.igi.tugraz.at

Abstract. We present a new reinforcement learning approach for deterministic continuous control problems in environments with unknown, arbitrary reward functions. The difficulty of finding solution trajectories for such problems can be reduced by incorporating limited prior knowledge of the approximative local system dynamics. The presented algorithm builds an adaptive state graph of sample points within the continuous state space. The nodes of the graph are generated by an efficient principled exploration scheme that directs the agent towards promising regions, while maintaining good online performance. Global solution trajectories are formed as combinations of local controllers that connect nodes of the graph, thereby naturally allowing continuous actions and continuous time steps. We demonstrate our approach on various movement planning tasks in continuous domains.

Gerhard Neumann, Michael Pfeiffer, Wolfgang Maass

Real-time Traffic

Continuous State Space | Deterministic Continuous Control | ECML 2007 | Machine Learning | Solution Trajectories |

claim paper

Related Content

» Adaptive Aggregation for Reinforcement Learning with Efficient Exploration Deterministic D...

» Efficient QoS Provisioning for Adaptive Multimedia in Mobile Communication Networks by Rei...

» Efficient Planning in Large POMDPs through Policy Graph Based Factorized Approximations

» The Dynamics of MultiAgent Reinforcement Learning

» Utile Distinctions for Relational Reinforcement Learning

» Binary action search for learning continuousaction control policies

» Efficient GraphBased SemiSupervised Learning of Structured Tagging Models

» Towards DomainIndependent Machine Intelligence

» Adaptive Retrieval Agents Internalizing Local Context and Scaling up to the Web

Post Info
More Details (n/a)

Added	14 Aug 2010
Updated	14 Aug 2010
Type	Conference
Year	2007
Where	ECML
Authors	Gerhard Neumann, Michael Pfeiffer, Wolfgang Maass

Comments (0)