Guiding exploration by pre-existing knowledge without modifying reward

13 years 4 months ago

Download www.cs.hut.fi

Reinforcement learning is based on exploration of the environment and receiving reward that indicates which actions taken by the agent are good and which ones are bad. In many applications receiving even the first reward may require long exploration, during which the agent has no information about its progress. This paper presents an approach that makes it possible to use pre-existing knowledge about the task for guiding exploration through the state space. Concepts of short- and long-term memory combine guidance by pre-existing knowledge with reinforcement learning methods for value function estimation in order to make learning faster while allowing the agent to converge towards a good policy.

Kary Främling

Real-time Traffic

Exploration | Long Exploration | Neural Networks | NN 2007 | Reinforcement |

claim paper

Post Info
More Details (n/a)

Added	27 Dec 2010
Updated	27 Dec 2010
Type	Journal
Year	2007
Where	NN
Authors	Kary Främling

Comments (0)

Sciweavers

Guiding exploration by pre-existing knowledge without modifying reward

Exploration | Long Exploration | Neural Networks | NN 2007 | Reinforcement |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers