Search Sciweavers | Sciweavers

46 search results - page 2 / 10

» A Sparse Sampling Algorithm for Near-Optimal Planning in Lar...

click to vote

ICML
2006
IEEE

156views Machine Learning» more ICML 2006»

Learning the structure of Factored Markov Decision Processes in reinforcement learning problems

14 years 5 months ago

Download animatlab.lip6.fr

Recent decision-theoric planning algorithms are able to find optimal solutions in large problems, using Factored Markov Decision Processes (fmdps). However, these algorithms need ...

Thomas Degris, Olivier Sigaud, Pierre-Henri Wuille...

claim paper

Read More »

click to vote

ICML
2006
IEEE

131views Machine Learning» more ICML 2006»

PAC model-free reinforcement learning

14 years 5 months ago

Download cseweb.ucsd.edu

For a Markov Decision Process with finite state (size S) and action spaces (size A per state), we propose a new algorithm--Delayed Q-Learning. We prove it is PAC, achieving near o...

Alexander L. Strehl, Lihong Li, Eric Wiewiora, Joh...

claim paper

Read More »

click to vote

UAI
2003

104views Artificial Intelligence» more UAI 2003»

Optimal Limited Contingency Planning

13 years 6 months ago

Download ti.arc.nasa.gov

For a given problem, the optimal Markov policy over a ﬁnite horizon is a conditional plan containing a potentially large number of branches. However, there are applications wher...

Nicolas Meuleau, David E. Smith

claim paper

Read More »

click to vote

ICRA
2007
IEEE

154views Robotics» more ICRA 2007»

Oracular Partially Observable Markov Decision Processes: A Very Special Case

13 years 11 months ago

Download www.cs.cmu.edu

— We introduce the Oracular Partially Observable Markov Decision Process (OPOMDP), a type of POMDP in which the world produces no observations; instead there is an “oracle,” ...

Nicholas Armstrong-Crews, Manuela M. Veloso

claim paper

Read More »

click to vote

AAAI
1997

139views Intelligent Agents» more AAAI 1997»

Model Minimization in Markov Decision Processes

13 years 6 months ago

Download www.cs.brown.edu

Many stochastic planning problems can be represented using Markov Decision Processes (MDPs). A difficulty with using these MDP representations is that the common algorithms for so...

Thomas Dean, Robert Givan

claim paper

Read More »

« Prev « First page 2 / 10 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers