Search Sciweavers | Sciweavers

89 search results - page 15 / 18

» Sample-Based Planning for Continuous Action Markov Decision ...

click to vote

IJRR
2008

101views more IJRR 2008»

Motion Planning Under Uncertainty for Image-guided Medical Needle Steering

14 years 11 months ago

Download dora.cwru.edu

We develop a new motion planning algorithm for a variant of a Dubins car with binary left/right steering and apply it to steerable needles, a new class of flexible beveltip medica...

Ron Alterovitz, Michael S. Branicky, Kenneth Y. Go...

claim paper

Read More »

click to vote

ICRA
2010
IEEE

163views Robotics» more ICRA 2010»

Exploiting domain knowledge in planning for uncertain robot systems modeled as POMDPs

14 years 10 months ago

Download robotics.ai.uiuc.edu

Abstract— We propose a planning algorithm that allows usersupplied domain knowledge to be exploited in the synthesis of information feedback policies for systems modeled as parti...

Salvatore Candido, James C. Davidson, Seth Hutchin...

claim paper

Read More »

107

click to vote

ICTAI
2009
IEEE

86views Artificial Intelligence» more ICTAI 2009»

TiMDPpoly: An Improved Method for Solving Time-Dependent MDPs

14 years 9 months ago

Download www.montefiore.ulg.ac.be

We introduce TiMDPpoly, an algorithm designed to solve planning problems with durative actions, under probabilistic uncertainty, in a non-stationary, continuous-time context. Miss...

Emmanuel Rachelson, Patrick Fabiani, Fréd&e...

claim paper

Read More »

click to vote

PRIMA
2007
Springer

98views Intelligent Agents» more PRIMA 2007»

Multiagent Planning with Trembling-Hand Perfect Equilibrium in Multiagent POMDPs

15 years 5 months ago

Download lang.is.kyushu-u.ac.jp

Multiagent Partially Observable Markov Decision Processes are a popular model of multiagent systems with uncertainty. Since the computational cost for ﬁnding an optimal joint pol...

Yuichi Yabu, Makoto Yokoo, Atsushi Iwasaki

claim paper

Read More »

click to vote

ECML
2007
Springer

192views Machine Learning» more ECML 2007»

Policy Gradient Critics

15 years 5 months ago

Download www.idsia.ch

We present Policy Gradient Actor-Critic (PGAC), a new model-free Reinforcement Learning (RL) method for creating limited-memory stochastic policies for Partially Observable Markov ...

Daan Wierstra, Jürgen Schmidhuber

claim paper

Read More »

« Prev « First page 15 / 18 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers