Search Sciweavers | Sciweavers

48 search results - page 7 / 10

» Approximate Planning in POMDPs with Macro-Actions

114

Voted

NIPS
2007

207views Information Technology» more NIPS 2007»

Bayes-Adaptive POMDPs

15 years 1 months ago

Download books.nips.cc

Bayesian Reinforcement Learning has generated substantial interest recently, as it provides an elegant solution to the exploration-exploitation trade-off in reinforcement learning...

Stéphane Ross, Brahim Chaib-draa, Joelle Pi...

claim paper

Read More »

click to vote

NIPS
2007

170views Information Technology» more NIPS 2007»

Theoretical Analysis of Heuristic Search Methods for Online POMDPs

15 years 1 months ago

Download www.cs.mcgill.ca

Planning in partially observable environments remains a challenging problem, despite signiﬁcant recent advances in ofﬂine approximation techniques. A few online methods have a...

Stéphane Ross, Joelle Pineau, Brahim Chaib-...

claim paper

Read More »

Voted

ATAL
2009
Springer

155views Intelligent Agents» more ATAL 2009»

Achieving goals in decentralized POMDPs

15 years 6 months ago

Download anytime.cs.umass.edu

Coordination of multiple agents under uncertainty in the decentralized POMDP model is known to be NEXP-complete, even when the agents have a joint set of goals. Nevertheless, we s...

Christopher Amato, Shlomo Zilberstein

claim paper

Read More »

Voted

AAAI
2007

101views Intelligent Agents» more AAAI 2007»

Purely Epistemic Markov Decision Processes

15 years 2 months ago

Download www.aaai.org

Planning under uncertainty involves two distinct sources of uncertainty: uncertainty about the effects of actions and uncertainty about the current state of the world. The most wi...

Régis Sabbadin, Jérôme Lang, N...

claim paper

Read More »

click to vote

IJCAI
2003

122views Artificial Intelligence» more IJCAI 2003»

Point-based value iteration: An anytime algorithm for POMDPs

15 years 1 months ago

Download www.cs.mcgill.ca

This paper introduces the Point-Based Value Iteration (PBVI) algorithm for POMDP planning. PBVI approximates an exact value iteration solution by selecting a small set of represen...

Joelle Pineau, Geoffrey J. Gordon, Sebastian Thrun

claim paper

Read More »

« Prev « First page 7 / 10 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers