Search Sciweavers | Sciweavers

32 search results - page 7 / 7

» Adaptive autonomous control using online value iteration wit...

137

Voted

RSS
2007

176views Robotics» more RSS 2007»

Active Policy Learning for Robot Planning and Exploration under Uncertainty

15 years 2 months ago

Download www.roboticsproceedings.org

Abstract— This paper proposes a simulation-based active policy learning algorithm for ﬁnite-horizon, partially-observed sequential decision processes. The algorithm is tested i...

Ruben Martinez-Cantin, Nando de Freitas, Arnaud Do...

claim paper

Read More »

124

click to vote

ATAL
2007
Springer

151views Intelligent Agents» more ATAL 2007»

Batch reinforcement learning in a complex domain

15 years 7 months ago

Download userweb.cs.utexas.edu

Temporal diﬀerence reinforcement learning algorithms are perfectly suited to autonomous agents because they learn directly from an agent’s experience based on sequential actio...

Shivaram Kalyanakrishnan, Peter Stone

claim paper

Read More »

« Prev « First page 7 / 7 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers