Search Sciweavers | Sciweavers

58 search results - page 5 / 12

» Using Learned Policies in Heuristic-Search Planning

click to vote

ATAL
2009
Springer

205views Intelligent Agents» more ATAL 2009»

Point-based incremental pruning heuristic for solving finite-horizon DEC-POMDPs

15 years 6 months ago

Download www.aamas-conference.org

Recent scaling up of decentralized partially observable Markov decision process (DEC-POMDP) solvers towards realistic applications is mainly due to approximate methods. Of this fa...

Jilles Steeve Dibangoye, Abdel-Illah Mouaddib, Bra...

claim paper

Read More »

105

click to vote

ROBOCUP
2007
Springer

153views Robotics» more ROBOCUP 2007»

Model-Based Reinforcement Learning in a Complex Domain

15 years 5 months ago

Download userweb.cs.utexas.edu

Reinforcement learning is a paradigm under which an agent seeks to improve its policy by making learning updates based on the experiences it gathers through interaction with the en...

Shivaram Kalyanakrishnan, Peter Stone, Yaxin Liu

claim paper

Read More »

click to vote

ICML
1997
IEEE

181views Machine Learning» more ICML 1997»

Robot Learning From Demonstration

16 years 16 days ago

Download www-clmc.usc.edu

The goal of robot learning from demonstration is to have a robot learn from watching a demonstration of the task to be performed. In our approach to learning from demonstration th...

Christopher G. Atkeson, Stefan Schaal

claim paper

Read More »

click to vote

NIPS
2003

148views Information Technology» more NIPS 2003»

Approximate Planning in POMDPs with Macro-Actions

15 years 1 months ago

Download books.nips.cc

Recent research has demonstrated that useful POMDP solutions do not require consideration of the entire belief space. We extend this idea with the notion of temporal abstraction. ...

Georgios Theocharous, Leslie Pack Kaelbling

claim paper

Read More »

Voted

JAIR
2011

144views more JAIR 2011»

Non-Deterministic Policies in Markovian Decision Processes

14 years 6 months ago

Download www.jair.org

Markovian processes have long been used to model stochastic environments. Reinforcement learning has emerged as a framework to solve sequential planning and decision-making proble...

Mahdi Milani Fard, Joelle Pineau

claim paper

Read More »

« Prev « First page 5 / 12 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers