Search Sciweavers | Sciweavers

187 search results - page 8 / 38

» Imitation and Reinforcement Learning in Agents with Heteroge...

107

click to vote

KES
2004
Springer

165views Information Technology» more KES 2004»

Coordination in Multiagent Reinforcement Learning Systems

15 years 6 months ago

Download cig.ees.kyushu-u.ac.jp

This paper presents a novel method for on-line coordination in multiagent reinforcement learning systems. In this method a reinforcement-learning agent learns to select its action ...

M. A. S. Kamal, Junichi Murata

claim paper

Read More »

126

click to vote

ICMLA
2010

207views Machine Learning» more ICMLA 2010»

Multi-Agent Inverse Reinforcement Learning

14 years 11 months ago

Download ftp.cs.wisc.edu

Learning the reward function of an agent by observing its behavior is termed inverse reinforcement learning and has applications in learning from demonstration or apprenticeship l...

Sriraam Natarajan, Gautam Kunapuli, Kshitij Judah,...

claim paper

Read More »

click to vote

AAMAS
2005
Springer

126views Intelligent Agents» more AAMAS 2005»

Learning to Coordinate Using Commitment Sequences in Cooperative Multi-agent Systems

15 years 6 months ago

Download como.vub.ac.be

We report on an investigation of the learning of coordination in cooperative multi-agent systems. Speciﬁcally, we study solutions that are applicable to independent agents i.e. ...

Spiros Kapetanakis, Daniel Kudenko, Malcolm J. A. ...

claim paper

Read More »

click to vote

AAAI
2007

68views Intelligent Agents» more AAAI 2007»

A Reinforcement Learning Algorithm with Polynomial Interaction Complexity for Only-Costly-Observable MDPs

15 years 3 months ago

Download www.aaai.org

An Unobservable MDP (UMDP) is a POMDP in which there are no observations. An Only-Costly-Observable MDP (OCOMDP) is a POMDP which extends an UMDP by allowing a particular costly a...

Roy Fox, Moshe Tennenholtz

claim paper

Read More »

click to vote

ICML
2002
IEEE

133views Machine Learning» more ICML 2002»

Coordinated Reinforcement Learning

16 years 2 months ago

Download select.cs.cmu.edu

We present several new algorithms for multiagent reinforcement learning. A common feature of these algorithms is a parameterized, structured representation of a policy or value fu...

Carlos Guestrin, Michail G. Lagoudakis, Ronald Par...

claim paper

Read More »

« Prev « First page 8 / 38 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers