Search Sciweavers | Sciweavers

2005 search results - page 357 / 401

» Decisive Markov Chains

110

click to vote

NIPS
2007

170views Information Technology» more NIPS 2007»

What makes some POMDP problems easy to approximate?

15 years 2 months ago

Download books.nips.cc

Point-based algorithms have been surprisingly successful in computing approximately optimal solutions for partially observable Markov decision processes (POMDPs) in high dimension...

David Hsu, Wee Sun Lee, Nan Rong

claim paper

Read More »

105

click to vote

AAAI
2006

146views Intelligent Agents» more AAAI 2006»

Incremental Least Squares Policy Iteration for POMDPs

15 years 2 months ago

Download www.aaai.org

We present a new algorithm, called incremental least squares policy iteration (ILSPI), for finding the infinite-horizon stationary policy for partially observable Markov decision ...

Hui Li, Xuejun Liao, Lawrence Carin

claim paper

Read More »

116

click to vote

FLAIRS
2004

140views Artificial Intelligence» more FLAIRS 2004»

State Space Reduction For Hierarchical Reinforcement Learning

15 years 2 months ago

Download ranger.uta.edu

er provides new techniques for abstracting the state space of a Markov Decision Process (MDP). These techniques extend one of the recent minimization models, known as -reduction, ...

Mehran Asadi, Manfred Huber

claim paper

Read More »

104

click to vote

NIPS
2001

144views Information Technology» more NIPS 2001»

Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

15 years 2 months ago

Download jmlr.csail.mit.edu

Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...

Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...

claim paper

Read More »

107

click to vote

NIPS
2001

158views Information Technology» more NIPS 2001»

Multiagent Planning with Factored MDPs

15 years 2 months ago

Download books.nips.cc

We present a principled and efficient planning algorithm for cooperative multiagent dynamic systems. A striking feature of our method is that the coordination and communication be...

Carlos Guestrin, Daphne Koller, Ronald Parr

claim paper

Read More »

« Prev « First page 357 / 401 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers