Search Sciweavers | Sciweavers

102 search results - page 4 / 21

» MDPs with Non-Deterministic Policies

121

click to vote

AUTOMATICA
2008

74views more AUTOMATICA 2008»

Policy iteration based feedback control

15 years 1 months ago

Download www.cfins.au.tsinghua.edu.cn

It is well known that stochastic control systems can be viewed as Markov decision processes (MDPs) with continuous state spaces. In this paper, we propose to apply the policy iter...

Kan-Jian Zhang, Yan-Kai Xu, Xi Chen, Xi-Ren Cao

claim paper

Read More »

123

Voted

NIPS
2003

180views Information Technology» more NIPS 2003»

Bounded Finite State Controllers

15 years 3 months ago

Download books.nips.cc

We describe a new approximation algorithm for solving partially observable MDPs. Our bounded policy iteration approach searches through the space of bounded-size, stochastic ﬁni...

Pascal Poupart, Craig Boutilier

claim paper

Read More »

105

Voted

NIPS
2003

147views Information Technology» more NIPS 2003»

Auction Mechanism Design for Multi-Robot Coordination

15 years 3 months ago

Download books.nips.cc

The design of cooperative multi-robot systems is a highly active research area in robotics. Two lines of research in particular have generated interest: the solution of large, wea...

Curt A. Bererton, Geoffrey J. Gordon, Sebastian Th...

claim paper

Read More »

106

click to vote

AAAI
2006

121views Intelligent Agents» more AAAI 2006»

Focused Real-Time Dynamic Programming for MDPs: Squeezing More Out of a Heuristic

15 years 3 months ago

Download www.cs.cmu.edu

Real-time dynamic programming (RTDP) is a heuristic search algorithm for solving MDPs. We present a modified algorithm called Focused RTDP with several improvements. While RTDP ma...

Trey Smith, Reid G. Simmons

claim paper

Read More »

117

Voted

IJCAI
2001

185views Artificial Intelligence» more IJCAI 2001»

Symbolic Dynamic Programming for First-Order MDPs

15 years 3 months ago

Download www.cs.toronto.edu

We present a dynamic programming approach for the solution of first-order Markov decisions processes. This technique uses an MDP whose dynamics is represented in a variant of the ...

Craig Boutilier, Raymond Reiter, Bob Price

claim paper

Read More »

« Prev « First page 4 / 21 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers