Perpetual Learning for Non-Cooperative Multiple Agents

13 years 6 months ago

Download www.aaai.org

This paper examines, by argument, the dynamics of sequences of behavioural choices made, when non-cooperative restricted-memory agents learn in partially observable stochastic games. These sequences of combined agent strategies (joint-policies) can be thought of as a walk through the space of all possible joint-policies. We argue that this walk, while containing random elements, is also driven by each agent's drive to improve their current situation at each point, and posit a learning pressure field across policy space to represent this drive. Different learning choices may skew this learning pressure, and affect the simultaneous joint learning of multiple agents. Motivation Multi-Agent Stochastic Processes are becoming increasingly popular as a modelling paradigm. Game theoretic approaches commonly rely on the participating agents having full access to the process dynamics in advance, and then solve to find the best solution analytically, but with large problems this approach is...

Luke Dickens

Real-time Traffic

AAAI 2008 | Intelligent Agents | Learning Pressure | Multiple Agents | Non-cooperative Restricted-memory Agents |

claim paper

Post Info
More Details (n/a)

Added	02 Oct 2010
Updated	02 Oct 2010
Type	Conference
Year	2008
Where	AAAI
Authors	Luke Dickens

Comments (0)

Sciweavers

Perpetual Learning for Non-Cooperative Multiple Agents

AAAI 2008 | Intelligent Agents | Learning Pressure | Multiple Agents | Non-cooperative Restricted-memory Agents |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers