Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

12

AAAI
1993

favoriteEmaildiscussreport

107views Intelligent Agents» more AAAI 1993»

Complexity Analysis of Real-Time Reinforcement Learning

13 years 5 months ago

Complexity Analysis of Real-Time Reinforcement Learning

Download www.ri.cmu.edu

This paper analyzes the complexity of on-line reinforcement learning algorithms, namely asynchronous realtime versions of Q-learning and value-iteration, applied to the problem of reaching a goal state in deterministic domains. Previous work had concluded that, in many cases, tabula rasa reinforcement learning was exponential for such problems, or was tractable only if the learning algorithm was augmented. We show that, to the contrary, the algorithms are tractable with only a simple change in the task representation or initialization. We provide tight bounds on the worst-case complexity, and show how the complexity is even smaller if the reinforcement learning algorithms have initial knowledge of the topology of the state space or the domain has certain special properties. We also present a novel bidirectional Q-learning algorithm to nd optimal paths from all states to a goal state and show that it is no more complex than the other algorithms.

Sven Koenig, Reid G. Simmons

Real-time Traffic

AAAI 1993 | Intelligent Agents | Learning Algorithms | Reinforcement Learning | Reinforcement Learning Algorithms |

claim paper

Related Content

» Transfer Learning in RealTime Strategy Games Using Hybrid CBRRL

» RealTime Pitch Determination of One or More Voices by Nonnegative Matrix Factorization

» RealTime Pedestrian Detection using Eigenflow

» Cognitive concepts in autonomous soccer playing robots

» Bifurcation Analysis of Reinforcement Learning Agents in the Seltens Horse Game

» On a Dynamical Analysis of Reinforcement Learning in Games Emergence of Occams Razor

» Learning Optimal Dialogue Management Rules by Using Reinforcement Learning and Inductive L...

» Gaussian Processes for Sample Efficient Reinforcement Learning with RMAXLike Exploration

» Tracking in Reinforcement Learning

Post Info
More Details (n/a)

Added	02 Nov 2010
Updated	02 Nov 2010
Type	Conference
Year	1993
Where	AAAI
Authors	Sven Koenig, Reid G. Simmons

Comments (0)