Search Sciweavers | Sciweavers

294 search results - page 2 / 59

» Learning the structure of Factored Markov Decision Processes...

click to vote

ATAL
2009
Springer

167views Intelligent Agents» more ATAL 2009»

Solving multiagent assignment Markov decision processes

13 years 11 months ago

Download www.aamas-conference.org

We consider the setting of multiple collaborative agents trying to complete a set of tasks as assigned by a centralized controller. We propose a scalable method called“Assignmen...

Scott Proper, Prasad Tadepalli

claim paper

Read More »

click to vote

PRICAI
2000
Springer

193views Artificial Intelligence» more PRICAI 2000»

Generating Hierarchical Structure in Reinforcement Learning from State Variables

13 years 8 months ago

Download www.csee.umbc.edu

This paper presents the CQ algorithm which decomposes and solves a Markov Decision Process (MDP) by automatically generating a hierarchy of smaller MDPs using state variables. The ...

Bernhard Hengst

claim paper

Read More »

click to vote

CORR
2010
Springer

105views Education» more CORR 2010»

Optimism in Reinforcement Learning Based on Kullback-Leibler Divergence

13 years 3 months ago

Download hal.archives-ouvertes.fr

We consider model-based reinforcement learning in ﬁnite Markov Decision Processes (MDPs), focussing on so-called optimistic strategies. Optimism is usually implemented by carryin...

Sarah Filippi, Olivier Cappé, Aurelien Gari...

claim paper

Read More »

click to vote

NIPS
2000

127views Information Technology» more NIPS 2000»

Using Free Energies to Represent Q-values in a Multiagent Reinforcement Learning Task

13 years 6 months ago

Download members.chello.at

The problem of reinforcement learning in large factored Markov decision processes is explored. The Q-value of a state-action pair is approximated by the free energy of a product o...

Brian Sallans, Geoffrey E. Hinton

claim paper

Read More »

click to vote

ICML
2008
IEEE

135views Machine Learning» more ICML 2008»

Reinforcement learning with limited reinforcement: using Bayes risk for active learning in POMDPs

14 years 5 months ago

Download mapleleaf.csail.mit.edu

Partially Observable Markov Decision Processes (POMDPs) have succeeded in planning domains that require balancing actions that increase an agent's knowledge and actions that ...

Finale Doshi, Joelle Pineau, Nicholas Roy

claim paper

Read More »

« Prev « First page 2 / 59 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers