Search Sciweavers | Sciweavers

116

IJCAI
2001

185views Artificial Intelligence» more IJCAI 2001»

Symbolic Dynamic Programming for First-Order MDPs

15 years 3 months ago

We present a dynamic programming approach for the solution of first-order Markov decisions processes. This technique uses an MDP whose dynamics is represented in a variant of the ...

Craig Boutilier, Raymond Reiter, Bob Price

claim paper

Read More »

75

Voted

NIPS
2004

128views Information Technology» more NIPS 2004»

A Cost-Shaping LP for Bellman Error Minimization with Performance Guarantees

15 years 3 months ago

Download books.nips.cc

We introduce a new algorithm based on linear programming that approximates the differential value function of an average-cost Markov decision process via a linear combination of p...

Daniela Pucci de Farias, Benjamin Van Roy

claim paper

Read More »

87

click to vote

AAAI
1994

159views Intelligent Agents» more AAAI 1994»

Acting Optimally in Partially Observable Stochastic Domains

15 years 3 months ago

Download www.cs.rutgers.edu

In this paper, we describe the partially observable Markov decision process pomdp approach to nding optimal or near-optimal control strategies for partially observable stochastic ...

Anthony R. Cassandra, Leslie Pack Kaelbling, Micha...

claim paper

Read More »

113

click to vote

ML
2002
ACM

121views Machine Learning» more ML 2002»

Near-Optimal Reinforcement Learning in Polynomial Time

15 years 1 months ago

Download www.cis.upenn.edu

We present new algorithms for reinforcement learning, and prove that they have polynomial bounds on the resources required to achieve near-optimal return in general Markov decisio...

Michael J. Kearns, Satinder P. Singh

claim paper

Read More »

147

click to vote

ICRA
2010
IEEE

101views Robotics» more ICRA 2010»

Multirobot coordination by auctioning POMDPs

15 years 8 days ago

Download users.isr.ist.utl.pt

— We consider the problem of task assignment and execution in multirobot systems, by proposing a procedure for bid estimation in auction protocols. Auctions are of interest to mu...

Matthijs T. J. Spaan, Nelson Gonçalves, Jo&...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers