Search Sciweavers | Sciweavers

26 search results - page 5 / 6

» An approximate algorithm for solving oracular POMDPs

click to vote

ATAL
2007
Springer

122views Intelligent Agents» more ATAL 2007»

Letting loose a SPIDER on a network of POMDPs: generating quality guaranteed policies

13 years 11 months ago

Download teamcore.usc.edu

Distributed Partially Observable Markov Decision Problems (Distributed POMDPs) are a popular approach for modeling multi-agent systems acting in uncertain domains. Given the signi...

Pradeep Varakantham, Janusz Marecki, Yuichi Yabu, ...

claim paper

Read More »

click to vote

ECML
2005
Springer

101views Machine Learning» more ECML 2005»

Model-Based Online Learning of POMDPs

13 years 11 months ago

Download www.cs.bgu.ac.il

Abstract. Learning to act in an unknown partially observable domain is a difﬁcult variant of the reinforcement learning paradigm. Research in the area has focused on model-free m...

Guy Shani, Ronen I. Brafman, Solomon Eyal Shimony

claim paper

Read More »

click to vote

ATAL
2006
Springer

157views Intelligent Agents» more ATAL 2006»

Decentralized planning under uncertainty for teams of communicating agents

13 years 9 months ago

Download www.cs.cmu.edu

Decentralized partially observable Markov decision processes (DEC-POMDPs) form a general framework for planning for groups of cooperating agents that inhabit a stochastic and part...

Matthijs T. J. Spaan, Geoffrey J. Gordon, Nikos A....

claim paper

Read More »

click to vote

NIPS
1998

140views Information Technology» more NIPS 1998»

Gradient Descent for General Reinforcement Learning

13 years 6 months ago

Download www.ri.cmu.edu

A simple learning rule is derived, the VAPS algorithm, which can be instantiated to generate a wide range of new reinforcementlearning algorithms. These algorithms solve a number ...

Leemon C. Baird III, Andrew W. Moore

claim paper

Read More »

click to vote

AIPS
2009

161views Artificial Intelligence» more AIPS 2009»

Navigation Planning in Probabilistic Roadmaps with Uncertainty

13 years 6 months ago

Download www.cs.bham.ac.uk

Probabilistic Roadmaps (PRM) are a commonly used class of algorithms for robot navigation tasks where obstacles are present in the environment. We examine the situation where the ...

Michael Kneebone, Richard Dearden

claim paper

Read More »

« Prev « First page 5 / 6 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers