Search Sciweavers | Sciweavers

5 search results - page 1 / 1

» Using Free Energies to Represent Q-values in a Multiagent Re...

click to vote

NIPS
2000

127views Information Technology» more NIPS 2000»

Using Free Energies to Represent Q-values in a Multiagent Reinforcement Learning Task

13 years 7 months ago

Download members.chello.at

The problem of reinforcement learning in large factored Markov decision processes is explored. The Q-value of a state-action pair is approximated by the free energy of a product o...

Brian Sallans, Geoffrey E. Hinton

claim paper

Read More »

click to vote

IAT
2003
IEEE

171views Intelligent Agents» more IAT 2003»

Asymmetric Multiagent Reinforcement Learning

13 years 11 months ago

Download lib.tkk.fi

A gradient-based method for both symmetric and asymmetric multiagent reinforcement learning is introduced in this paper. Symmetric multiagent reinforcement learning addresses the ...

Ville Könönen

claim paper

Read More »

click to vote

LAMAS
2005
Springer

168views Intelligent Agents» more LAMAS 2005»

Multi-agent Relational Reinforcement Learning

13 years 11 months ago

Download dtai.cs.kuleuven.be

In this paper we report on using a relational state space in multi-agent reinforcement learning. There is growing evidence in the Reinforcement Learning research community that a r...

Tom Croonenborghs, Karl Tuyls, Jan Ramon, Maurice ...

claim paper

Read More »

click to vote

TSMC
2008

229views more TSMC 2008»

A Comprehensive Survey of Multiagent Reinforcement Learning

13 years 6 months ago

Download www.dcsc.tudelft.nl

Multiagent systems are rapidly finding applications in a variety of domains, including robotics, distributed control, telecommunications, and economics. The complexity of many task...

Lucian Busoniu, Robert Babuska, Bart De Schutter

claim paper

Read More »

click to vote

ATAL
2005
Springer

130views Intelligent Agents» more ATAL 2005»

Behavior transfer for value-function-based reinforcement learning

13 years 11 months ago

Download www.cs.huji.ac.il

Temporal difference (TD) learning methods [22] have become popular reinforcement learning techniques in recent years. TD methods have had some experimental successes and have been...

Matthew E. Taylor, Peter Stone

claim paper

Read More »

« Prev « First page 1 / 1 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers