Search Sciweavers | Sciweavers

200 search results - page 13 / 40

» Point-Based Policy Iteration

180

click to vote

AAAI
2006

157views Intelligent Agents» more AAAI 2006»

Improving Approximate Value Iteration Using Memories and Predictive State Representations

15 years 8 months ago

Download www.aaai.org

Planning in partially-observable dynamical systems is a challenging problem, and recent developments in point-based techniques such as Perseus significantly improve performance as...

Michael R. James, Ton Wessling, Nikos A. Vlassis

claim paper

Read More »

197

click to vote

ICML
2006
IEEE

143views Machine Learning» more ICML 2006»

Fast direct policy evaluation using multiscale analysis of Markov diffusion processes

16 years 7 months ago

Download www.cs.umass.edu

Policy evaluation is a critical step in the approximate solution of large Markov decision processes (MDPs), typically requiring O(|S|3 ) to directly solve the Bellman system of |S...

Mauro Maggioni, Sridhar Mahadevan

claim paper

Read More »

170

click to vote

AAAI
2010

171views Intelligent Agents» more AAAI 2010»

Multi-Agent Learning with Policy Prediction

15 years 8 months ago

Download www.cs.umass.edu

Due to the non-stationary environment, learning in multi-agent systems is a challenging problem. This paper first introduces a new gradient-based learning algorithm, augmenting th...

Chongjie Zhang, Victor R. Lesser

claim paper

Read More »

168

Voted

NIPS
2000

121views Information Technology» more NIPS 2000»

APRICODD: Approximate Policy Construction Using Decision Diagrams

15 years 8 months ago

Download www.cs.ubc.ca

We propose a method of approximate dynamic programming for Markov decision processes (MDPs) using algebraic decision diagrams (ADDs). We produce near-optimal value functions and p...

Robert St-Aubin, Jesse Hoey, Craig Boutilier

claim paper

Read More »

171

Voted

NIPS
2004

125views Information Technology» more NIPS 2004»

VDCBPI: an Approximate Scalable Algorithm for Large POMDPs

15 years 8 months ago

Download books.nips.cc

Existing algorithms for discrete partially observable Markov decision processes can at best solve problems of a few thousand states due to two important sources of intractability:...

Pascal Poupart, Craig Boutilier

claim paper

Read More »

« Prev « First page 13 / 40 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers