Search Sciweavers | Sciweavers

1239 search results - page 4 / 248

» Communication for Improving Policy Computation in Distribute...

164

click to vote

AAAI
2006

146views Intelligent Agents» more AAAI 2006»

Incremental Least Squares Policy Iteration for POMDPs

15 years 7 months ago

Download www.aaai.org

We present a new algorithm, called incremental least squares policy iteration (ILSPI), for finding the infinite-horizon stationary policy for partially observable Markov decision ...

Hui Li, Xuejun Liao, Lawrence Carin

claim paper

Read More »

199

click to vote

PROMAS
2004
Springer

189views Intelligent Agents» more PROMAS 2004»

Coordinating Teams in Uncertain Environments: A Hybrid BDI-POMDP Approach

15 years 11 months ago

Download teamcore.usc.edu

Distributed partially observable Markov decision problems (POMDPs) have emerged as a popular decision-theoretic approach for planning for multiagent teams, where it is imperative f...

Ranjit Nair, Milind Tambe

claim paper

Read More »

147

click to vote

ECML
2005
Springer

101views Machine Learning» more ECML 2005»

Model-Based Online Learning of POMDPs

15 years 11 months ago

Download www.cs.bgu.ac.il

Abstract. Learning to act in an unknown partially observable domain is a difﬁcult variant of the reinforcement learning paradigm. Research in the area has focused on model-free m...

Guy Shani, Ronen I. Brafman, Solomon Eyal Shimony

claim paper

Read More »

175

click to vote

ATAL
2007
Springer

142views Intelligent Agents» more ATAL 2007»

Q-value functions for decentralized POMDPs

16 years 1 days ago

Download www.science.uva.nl

Planning in single-agent models like MDPs and POMDPs can be carried out by resorting to Q-value functions: a (near-) optimal Q-value function is computed in a recursive manner by ...

Frans A. Oliehoek, Nikos A. Vlassis

claim paper

Read More »

178

click to vote

ATAL
2009
Springer

109views Intelligent Agents» more ATAL 2009»

Reward shaping for valuing communications during multi-agent coordination

16 years 13 days ago

Download eprints.ecs.soton.ac.uk

Decentralised coordination in multi-agent systems is typically achieved using communication. However, in many cases, communication is expensive to utilise because there is limited...

Simon A. Williamson, Enrico H. Gerding, Nicholas R...

claim paper

Read More »

« Prev « First page 4 / 248 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers