Search Sciweavers | Sciweavers

58 search results - page 4 / 12

» Using Learned Policies in Heuristic-Search Planning

108

click to vote

IROS
2008
IEEE

144views Robotics» more IROS 2008»

Learning nonparametric policies by imitation

15 years 6 months ago

Download www.cs.washington.edu

— A long cherished goal in artiﬁcial intelligence has been the ability to endow a robot with the capacity to learn and generalize skills from watching a human teacher. Such an ...

David B. Grimes, Rajesh P. N. Rao

claim paper

Read More »

click to vote

EWRL
2008

144views Machine Learning» more EWRL 2008»

Regularized Fitted Q-Iteration: Application to Planning

15 years 1 months ago

Download eprints.pascal-network.org

We consider planning in a Markovian decision problem, i.e., the problem of finding a good policy given access to a generative model of the environment. We propose to use fitted Q-i...

Amir Massoud Farahmand, Mohammad Ghavamzadeh, Csab...

claim paper

Read More »

101

click to vote

NIPS
1993

86views Information Technology» more NIPS 1993»

Robust Reinforcement Learning in Motion Planning

15 years 1 months ago

Download www.cs.cmu.edu

While exploring to nd better solutions, an agent performing online reinforcement learning (RL) can perform worse than is acceptable. In some cases, exploration might have unsafe, ...

Satinder P. Singh, Andrew G. Barto, Roderic A. Gru...

claim paper

Read More »

click to vote

ICML
2008
IEEE

135views Machine Learning» more ICML 2008»

Reinforcement learning with limited reinforcement: using Bayes risk for active learning in POMDPs

16 years 16 days ago

Download mapleleaf.csail.mit.edu

Partially Observable Markov Decision Processes (POMDPs) have succeeded in planning domains that require balancing actions that increase an agent's knowledge and actions that ...

Finale Doshi, Joelle Pineau, Nicholas Roy

claim paper

Read More »

103

click to vote

IJRR
2008

151views more IJRR 2008»

Trajectory Optimization using Reinforcement Learning for Map Exploration

14 years 11 months ago

Download mapleleaf.csail.mit.edu

Automatically building maps from sensor data is a necessary and fundamental skill for mobile robots; as a result, considerable research attention has focused on the technical chall...

Thomas Kollar, Nicholas Roy

claim paper

Read More »

« Prev « First page 4 / 12 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers