Search Sciweavers | Sciweavers

79

GECCO
2009
Springer

96views Optimization» more GECCO 2009»

Novelty of behaviour as a basis for the neuro-evolution of operant reward learning

15 years 5 months ago

An agent that deviates from a usual or previous course of action can be said to display novel or varying behaviour. Novelty of behaviour can be seen as the result of real or appar...

Andrea Soltoggio, Ben Jones

claim paper

Read More »

126

click to vote

JMLR
2012

229views Programming Languages» more JMLR 2012»

Hierarchical Relative Entropy Policy Search

13 years 3 months ago

Download www.ias.informatik.tu-darmstadt.de

Many real-world problems are inherently hierarchically structured. The use of this structure in an agent’s policy may well be the key to improved scalability and higher performa...

Christian Daniel, Gerhard Neumann, Jan Peters

claim paper

Read More »

128

Voted

ATAL
2004
Springer

149views Intelligent Agents» more ATAL 2004»

Learning User Preferences for Wireless Services Provisioning

15 years 6 months ago

Download people.csail.mit.edu

The problem of interest is how to dynamically allocate wireless access services in a competitive market which implements a take-it-or-leave-it allocation mechanism. In this paper ...

George Lee, Steven Bauer, Peyman Faratin, John Wro...

claim paper

Read More »

96

Voted

ATAL
2008
Springer

99views Intelligent Agents» more ATAL 2008»

Non-linear dynamics in multiagent reinforcement learning algorithms

15 years 2 months ago

Download www.aamas-conference.org

Several multiagent reinforcement learning (MARL) algorithms have been proposed to optimize agents' decisions. Only a subset of these MARL algorithms both do not require agent...

Sherief Abdallah, Victor R. Lesser

claim paper

Read More »

102

click to vote

JMLR
2006

153views more JMLR 2006»

Collaborative Multiagent Reinforcement Learning by Payoff Propagation

15 years 22 days ago

Download jmlr.csail.mit.edu

In this article we describe a set of scalable techniques for learning the behavior of a group of agents in a collaborative multiagent setting. As a basis we use the framework of c...

Jelle R. Kok, Nikos A. Vlassis

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers