Search Sciweavers | Sciweavers

575 search results - page 47 / 115

» Reinforcement Learning State Estimator

145

click to vote

NIPS
1993

86views Information Technology» more NIPS 1993»

Robust Reinforcement Learning in Motion Planning

15 years 5 months ago

Download www.cs.cmu.edu

While exploring to nd better solutions, an agent performing online reinforcement learning (RL) can perform worse than is acceptable. In some cases, exploration might have unsafe, ...

Satinder P. Singh, Andrew G. Barto, Roderic A. Gru...

claim paper

Read More »

107

click to vote

UAI
2001

98views Artificial Intelligence» more UAI 2001»

Policy Improvement for POMDPs Using Normalized Importance Sampling

15 years 5 months ago

Download www.cs.ucr.edu

We present a new method for estimating the expected return of a POMDP from experience. The estimator does not assume any knowledge of the POMDP, can estimate the returns for finit...

Christian R. Shelton

claim paper

Read More »

137

click to vote

ECAI
2008
Springer

165views Artificial Intelligence» more ECAI 2008»

Belief revision with reinforcement learning for interactive object recognition

15 years 6 months ago

Download www.inf.fh-dortmund.de

From a conceptual point of view, belief revision and learning are quite similar. Both methods change the belief state of an intelligent agent by processing incoming information. Ho...

Thomas Leopold, Gabriele Kern-Isberner, Gabriele P...

claim paper

Read More »

131

click to vote

ICML
2003
IEEE

104views Machine Learning» more ICML 2003»

The Influence of Reward on the Speed of Reinforcement Learning: An Analysis of Shaping

15 years 9 months ago

Download www.hpl.hp.com

Shaping can be an effective method for improving the learning rate in reinforcement systems. Previously, shaping has been heuristically motivated and implemented. We provide a for...

Adam Laud, Gerald DeJong

claim paper

Read More »

134

click to vote

CORR
2011
Springer

194views Education» more CORR 2011»

Accelerating Reinforcement Learning through Implicit Imitation

14 years 8 months ago

Download www.aaai.org

Imitation can be viewed as a means of enhancing learning in multiagent environments. It augments an agent’s ability to learn useful behaviors by making intelligent use of the kn...

Craig Boutilier, Bob Price

claim paper

Read More »

« Prev « First page 47 / 115 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers