Search Sciweavers | Sciweavers

84

NIPS
2000

127views Information Technology» more NIPS 2000»

Using Free Energies to Represent Q-values in a Multiagent Reinforcement Learning Task

15 years 1 months ago

The problem of reinforcement learning in large factored Markov decision processes is explored. The Q-value of a state-action pair is approximated by the free energy of a product o...

Brian Sallans, Geoffrey E. Hinton

claim paper

Read More »

89

click to vote

IUI
1999
ACM

107views Software Engineering» more IUI 1999»

Multi-Agent Learning Approach to WWW Information Retrieval Using Neural Network

15 years 4 months ago

Download www.foibg.com

er has outlined the potential of multiagent framework for decision support. From an abstract point of view, the concept of an agent has been used as modularization principle for th...

Yong S. Choi, Suk I. Yoo

claim paper

Read More »

98

click to vote

ATAL
2007
Springer

128views Intelligent Agents» more ATAL 2007»

Advice taking in multiagent reinforcement learning

15 years 6 months ago

Download homepages.inf.ed.ac.uk

This paper proposes the β-WoLF algorithm for multiagent reinforcement learning (MARL) in the stochastic games framework that uses an additional “advice” signal to inform agen...

Michael Rovatsos, Alexandros Belesiotis

claim paper

Read More »

113

click to vote

AAAI
2011

206views Intelligent Agents» more AAAI 2011»

Coordinated Multi-Agent Reinforcement Learning in Networked Distributed POMDPs

13 years 11 months ago

Download www.cs.umass.edu

In many multi-agent applications such as distributed sensor nets, a network of agents act collaboratively under uncertainty and local interactions. Networked Distributed POMDP (ND...

Chongjie Zhang, Victor R. Lesser

claim paper

Read More »

94

click to vote

ICML
1999
IEEE

129views Machine Learning» more ICML 1999»

Implicit Imitation in Multiagent Reinforcement Learning

16 years 16 days ago

Download www.cs.toronto.edu

Imitation is actively being studied as an effective means of learning in multi-agent environments. It allows an agent to learn how to act well (perhaps optimally) by passively obs...

Bob Price, Craig Boutilier

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers