Search Sciweavers | Sciweavers

280 search results - page 46 / 56

» Planning for Markov Decision Processes with Sparse Stochasti...

click to vote

ICRA
2010
IEEE

133views Robotics» more ICRA 2010»

Variable resolution decomposition for robotic navigation under a POMDP framework

14 years 10 months ago

Download www.cs.mcgill.ca

— Partially Observable Markov Decision Processes (POMDPs) offer a powerful mathematical framework for making optimal action choices in noisy and/or uncertain environments, in par...

Robert Kaplow, Amin Atrash, Joelle Pineau

claim paper

Read More »

127

click to vote

IJRR
2011

218views more IJRR 2011»

Motion planning under uncertainty for robotic tasks with long time horizons

14 years 6 months ago

Download deslab.mit.edu

Abstract Partially observable Markov decision processes (POMDPs) are a principled mathematical framework for planning under uncertainty, a crucial capability for reliable operation...

Hanna Kurniawati, Yanzhu Du, David Hsu, Wee Sun Le...

claim paper

Read More »

click to vote

NIPS
2008

171views Information Technology» more NIPS 2008»

MDPs with Non-Deterministic Policies

15 years 1 months ago

Download www.cs.mcgill.ca

Markov Decision Processes (MDPs) have been extensively studied and used in the context of planning and decision-making, and many methods exist to find the optimal policy for probl...

Mahdi Milani Fard, Joelle Pineau

claim paper

Read More »

click to vote

AIPS
2000

107views Artificial Intelligence» more AIPS 2000»

On-line Scheduling via Sampling

15 years 1 months ago

Download www.aaai.org

1 We consider the problem of scheduling an unknown sequence of tasks for a single server as the tasks arrive with the goal off maximizing the total weighted value of the tasks serv...

Hyeong Soo Chang, Robert Givan, Edwin K. P. Chong

claim paper

Read More »

click to vote

ECML
2007
Springer

192views Machine Learning» more ECML 2007»

Policy Gradient Critics

15 years 6 months ago

Download www.idsia.ch

We present Policy Gradient Actor-Critic (PGAC), a new model-free Reinforcement Learning (RL) method for creating limited-memory stochastic policies for Partially Observable Markov ...

Daan Wierstra, Jürgen Schmidhuber

claim paper

Read More »

« Prev « First page 46 / 56 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers