Search Sciweavers | Sciweavers

60 search results - page 2 / 12

» Iteratively Extending Time Horizon Reinforcement Learning

click to vote

ATAL
2004
Springer

116views Intelligent Agents» more ATAL 2004»

Time-Extended Policies in Multi-Agent Reinforcement Learning

13 years 10 months ago

Download web.engr.oregonstate.edu

Many algorithms such as Q-learning successfully address reinforcement learning in single-agent multi-time-step problems. In addition there are methods that address reinforcement l...

Kagan Tumer, Adrian K. Agogino

claim paper

Read More »

click to vote

ECML
2003
Springer

118views Machine Learning» more ECML 2003»

A New Way to Introduce Knowledge into Reinforcement Learning

13 years 10 months ago

Download www.irisa.fr

We present in this paper a method to introduce a priori knowledge into reinforcement learning using temporally extended actions. The aim of our work is to reduce the learning time ...

Pascal Garcia

claim paper

Read More »

click to vote

ML
2002
ACM

121views Machine Learning» more ML 2002»

Near-Optimal Reinforcement Learning in Polynomial Time

13 years 4 months ago

Download www.cis.upenn.edu

We present new algorithms for reinforcement learning, and prove that they have polynomial bounds on the resources required to achieve near-optimal return in general Markov decisio...

Michael J. Kearns, Satinder P. Singh

claim paper

Read More »

click to vote

ECAI
2008
Springer

158views Artificial Intelligence» more ECAI 2008»

A Simulation-based Approach for Solving Generalized Semi-Markov Decision Processes

13 years 7 months ago

Download emmanuel.rachelson.free.fr

Time is a crucial variable in planning and often requires special attention since it introduces a specific structure along with additional complexity, especially in the case of dec...

Emmanuel Rachelson, Gauthier Quesnel, Fréd&...

claim paper

Read More »

click to vote

ICML
2000
IEEE

155views Machine Learning» more ICML 2000»

Combining Reinforcement Learning with a Local Control Algorithm

14 years 6 months ago

Download www-anw.cs.umass.edu

We explore combining reinforcement learning with a hand-crafted local controller in a manner suggested by the chaotic control algorithm of Vincent, Schmitt and Vincent (1994). A c...

Andrew G. Barto, Jette Randløv, Michael T. ...

claim paper

Read More »

« Prev « First page 2 / 12 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers