Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

86

ICML
2004
IEEE

favoriteEmaildiscussreport

161views Machine Learning» more ICML 2004»

Using relative novelty to identify useful temporal abstractions in reinforcement learning

15 years 11 months ago

Using relative novelty to identify useful temporal abstractions in reinforcement learning

Download www.cs.umass.edu

lative Novelty to Identify Useful Temporal Abstractions in Reinforcement Learning ?Ozg?ur S?im?sek ozgur@cs.umass.edu Andrew G. Barto barto@cs.umass.edu Department of Computer Science, University of Massachusetts, Amherst, MA 01003-9264 We present a new method for automatically creating useful temporal abstractions in reinforcement learning. We argue that states that allow the agent to transition to a different region of the state space are useful subgoals, and propose a method for identifying them using the concept of relative novelty. When such a state is identified, a temporallyextended activity (e.g., an option) is generated that takes the agent efficiently to this state. We illustrate the utility of the method in a number of tasks.

Özgür Simsek, Andrew G. Barto

Real-time Traffic

Barto Barto@cs.umass.edu Department | ICML 2004 | Machine Learning | Reinforcement Learning | Useful Temporal Abstractions |

claim paper

Related Content

» On the asymptotic equivalence between differential Hebbian and temporal difference learnin...

» Utile Distinctions for Relational Reinforcement Learning

» Relational temporal difference learning

» How an Agent Can Detect and Use Synchrony Parameter of Its Own Interaction with a Human

» Multiagent Relational Reinforcement Learning

» Bellman goes relational

» IntraOption Learning about Temporally Abstract Actions

» Learning to fly by combining reinforcement learning with behavioural cloning

» Chess Neighborhoods Function Combination and Reinforcement Learning

Post Info
More Details (n/a)

Added	17 Nov 2009
Updated	17 Nov 2009
Type	Conference
Year	2004
Where	ICML
Authors	Özgür Simsek, Andrew G. Barto

Comments (0)