Search Sciweavers | Sciweavers

27 search results - page 6 / 6

» Policy Gradient Method for Team Markov Games

click to vote

ICML
2003
IEEE

121views Machine Learning» more ICML 2003»

Planning in the Presence of Cost Functions Controlled by an Adversary

14 years 5 months ago

Download www.cs.cmu.edu

We investigate methods for planning in a Markov Decision Process where the cost function is chosen by an adversary after we fix our policy. As a running example, we consider a rob...

H. Brendan McMahan, Geoffrey J. Gordon, Avrim Blum

claim paper

Read More »

click to vote

ATAL
2003
Springer

185views Intelligent Agents» more ATAL 2003»

Optimizing information exchange in cooperative multi-agent systems

13 years 10 months ago

Download rbr.cs.umass.edu

Decentralized control of a cooperative multi-agent system is the problem faced by multiple decision-makers that share a common set of objectives. The decision-makers may be robots...

Claudia V. Goldman, Shlomo Zilberstein

claim paper

Read More »

« Prev « First page 6 / 6 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers