Search Sciweavers | Sciweavers

92 search results - page 9 / 19

» Acting Optimally in Partially Observable Stochastic Domains

120

click to vote

GLOBECOM
2009
IEEE

114views Communications» more GLOBECOM 2009»

Minimum-Length Scheduling for Multicast Traffic under Channel Uncertainty

14 years 9 months ago

Download www.ee.oulu.fi

Abstract--We consider a set of multicast sources, each multicasting a finite amount of data to its corresponding destinations. The objective is to minimize the time to deliver all ...

Anna Pantelidou, Anthony Ephremides

claim paper

Read More »

click to vote

ATAL
2007
Springer

122views Intelligent Agents» more ATAL 2007»

Letting loose a SPIDER on a network of POMDPs: generating quality guaranteed policies

15 years 5 months ago

Download teamcore.usc.edu

Distributed Partially Observable Markov Decision Problems (Distributed POMDPs) are a popular approach for modeling multi-agent systems acting in uncertain domains. Given the signi...

Pradeep Varakantham, Janusz Marecki, Yuichi Yabu, ...

claim paper

Read More »

104

click to vote

ECAI
2010
Springer

238views Artificial Intelligence» more ECAI 2010»

The Dynamics of Multi-Agent Reinforcement Learning

15 years 22 days ago

Download www.doc.ic.ac.uk

Abstract. Infinite-horizon multi-agent control processes with nondeterminism and partial state knowledge have particularly interesting properties with respect to adaptive control, ...

Luke Dickens, Krysia Broda, Alessandra Russo

claim paper

Read More »

106

click to vote

IJCAI
2001

151views Artificial Intelligence» more IJCAI 2001»

R-MAX - A General Polynomial Time Algorithm for Near-Optimal Reinforcement Learning

15 years 1 months ago

Download jmlr.csail.mit.edu

R-max is a very simple model-based reinforcement learning algorithm which can attain near-optimal average reward in polynomial time. In R-max, the agent always maintains a complet...

Ronen I. Brafman, Moshe Tennenholtz

claim paper

Read More »

118

click to vote

ATAL
2008
Springer

150views Intelligent Agents» more ATAL 2008»

Continual collaborative planning for mixed-initiative action and interaction

15 years 1 months ago

Download www.informatik.uni-freiburg.de

Multiagent environments are often highly dynamic and only partially observable which makes deliberative action planning computationally hard. In many such environments, however, a...

Michael Brenner

claim paper

Read More »

« Prev « First page 9 / 19 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers