Search Sciweavers | Sciweavers

2990 search results - page 550 / 598

» Hidden Markov processes

164

click to vote

AAAI
2006

146views Intelligent Agents» more AAAI 2006»

Incremental Least Squares Policy Iteration for POMDPs

15 years 7 months ago

Download www.aaai.org

We present a new algorithm, called incremental least squares policy iteration (ILSPI), for finding the infinite-horizon stationary policy for partially observable Markov decision ...

Hui Li, Xuejun Liao, Lawrence Carin

claim paper

Read More »

175

click to vote

AAAI
2006

157views Intelligent Agents» more AAAI 2006»

Compact, Convex Upper Bound Iteration for Approximate POMDP Planning

15 years 7 months ago

Download www.aaai.org

Partially observable Markov decision processes (POMDPs) are an intuitive and general way to model sequential decision making problems under uncertainty. Unfortunately, even approx...

Tao Wang, Pascal Poupart, Michael H. Bowling, Dale...

claim paper

Read More »

154

click to vote

AIPS
2004

105views Artificial Intelligence» more AIPS 2004»

Decision-Theoretic Military Operations Planning

15 years 7 months ago

Download eprints.pascal-network.org

Military operations planning involves concurrent actions, resource assignment, and conflicting costs. Individual tasks sometimes fail with a known probability, promoting a decisio...

Douglas Aberdeen, Sylvie Thiébaux, Lin Zhan...

claim paper

Read More »

171

click to vote

FLAIRS
2004

140views Artificial Intelligence» more FLAIRS 2004»

State Space Reduction For Hierarchical Reinforcement Learning

15 years 7 months ago

Download ranger.uta.edu

er provides new techniques for abstracting the state space of a Markov Decision Process (MDP). These techniques extend one of the recent minimization models, known as -reduction, ...

Mehran Asadi, Manfred Huber

claim paper

Read More »

162

click to vote

AIPS
2003

131views Artificial Intelligence» more AIPS 2003»

A Framework for Planning in Continuous-time Stochastic Domains

15 years 7 months ago

Download www.aaai.org

We propose a framework for policy generation in continuoustime stochastic domains with concurrent actions and events of uncertain duration. We make no assumptions regarding the co...

Håkan L. S. Younes, David J. Musliner, Reid ...

claim paper

Read More »

« Prev « First page 550 / 598 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers