Search Sciweavers | Sciweavers

202 search results - page 19 / 41

» Comments on the Origin and Application of Markov Decision Pr...

117

click to vote

AAAI
2008

134views Intelligent Agents» more AAAI 2008»

Interaction Structure and Dimensionality Reduction in Decentralized MDPs

15 years 4 months ago

Download www.aaai.org

Decentralized Markov Decision Processes are a powerful general model of decentralized, cooperative multi-agent problem solving. The high complexity of the general problem leads to...

Martin Allen, Marek Petrik, Shlomo Zilberstein

claim paper

Read More »

109

click to vote

ICML
2003
IEEE

124views Machine Learning» more ICML 2003»

Exploration in Metric State Spaces

16 years 2 months ago

Download www.cis.upenn.edu

We present metric?? , a provably near-optimal algorithm for reinforcement learning in Markov decision processes in which there is a natural metric on the state space that allows t...

Sham Kakade, Michael J. Kearns, John Langford

claim paper

Read More »

118

click to vote

ATAL
2009
Springer

155views Intelligent Agents» more ATAL 2009»

Planning with continuous resources for agent teams

15 years 8 months ago

Download www.aamas-conference.org

Many problems of multiagent planning under uncertainty require distributed reasoning with continuous resources and resource limits. Decentralized Markov Decision Problems (Dec-MDP...

Janusz Marecki, Milind Tambe

claim paper

Read More »

140

click to vote

CPAIOR
2008
Springer

198views Operations Research» more CPAIOR 2008»

Amsaa: A Multistep Anticipatory Algorithm for Online Stochastic Combinatorial Optimization

15 years 3 months ago

Download cs.brown.edu

The one-step anticipatory algorithm (1s-AA) is an online algorithm making decisions under uncertainty by ignoring future non-anticipativity constraints. It makes near-optimal decis...

Luc Mercier, Pascal Van Hentenryck

claim paper

Read More »

163

click to vote

ICTAI
2006
IEEE

110views Artificial Intelligence» more ICTAI 2006»

A New Hybrid GA-MDP Algorithm For The Frequency Assignment Problem

15 years 8 months ago

Download www.loria.fr

We propose a novel algorithm called GA-MDP for solving the frequency assigment problem. GA-MDP inherits the spirit of genetic algorithms with an adaptation of Markov Decision Proc...

Lhassane Idoumghar, René Schott

claim paper

Read More »

« Prev « First page 19 / 41 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers