Search Sciweavers | Sciweavers

89 search results - page 13 / 18

» Sample-Based Planning for Continuous Action Markov Decision ...

Voted

JAIR
2008

130views more JAIR 2008»

Online Planning Algorithms for POMDPs

14 years 11 months ago

Download www.jair.org

Partially Observable Markov Decision Processes (POMDPs) provide a rich framework for sequential decision-making under uncertainty in stochastic domains. However, solving a POMDP i...

Stéphane Ross, Joelle Pineau, Sébast...

claim paper

Read More »

100

click to vote

ICML
2006
IEEE

131views Machine Learning» more ICML 2006»

PAC model-free reinforcement learning

16 years 14 days ago

Download cseweb.ucsd.edu

For a Markov Decision Process with finite state (size S) and action spaces (size A per state), we propose a new algorithm--Delayed Q-Learning. We prove it is PAC, achieving near o...

Alexander L. Strehl, Lihong Li, Eric Wiewiora, Joh...

claim paper

Read More »

click to vote

AIPS
2003

131views Artificial Intelligence» more AIPS 2003»

A Framework for Planning in Continuous-time Stochastic Domains

15 years 1 months ago

Download www.aaai.org

We propose a framework for policy generation in continuoustime stochastic domains with concurrent actions and events of uncertain duration. We make no assumptions regarding the co...

Håkan L. S. Younes, David J. Musliner, Reid ...

claim paper

Read More »

104

click to vote

NIPS
2008

116views Information Technology» more NIPS 2008»

Particle Filter-based Policy Gradient in POMDPs

15 years 1 months ago

Download eprints.pascal-network.org

Our setting is a Partially Observable Markov Decision Process with continuous state, observation and action spaces. Decisions are based on a Particle Filter for estimating the bel...

Pierre-Arnaud Coquelin, Romain Deguest, Rém...

claim paper

Read More »

Voted

HICSS
2003
IEEE

123views Biometrics» more HICSS 2003»

Issues in Rational Planning in Multi-Agent Settings

15 years 5 months ago

Download www.hicss.hawaii.edu

We adopt the decision-theoretic principle of expected utility maximization as a paradigm for designing autonomous rational agents operating in multi-agent environments. We use the...

Piotr J. Gmytrasiewicz

claim paper

Read More »

« Prev « First page 13 / 18 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers