Search Sciweavers | Sciweavers

159

AAAI
2010

171views Intelligent Agents» more AAAI 2010»

Multi-Agent Learning with Policy Prediction

15 years 7 months ago

Due to the non-stationary environment, learning in multi-agent systems is a challenging problem. This paper first introduces a new gradient-based learning algorithm, augmenting th...

Chongjie Zhang, Victor R. Lesser

claim paper

Read More »

189

click to vote

NIPS
2008

130views Information Technology» more NIPS 2008»

Temporal Difference Based Actor Critic Learning - Convergence and Neural Implementation

15 years 7 months ago

Download eprints.pascal-network.org

Actor-critic algorithms for reinforcement learning are achieving renewed popularity due to their good convergence properties in situations where other approaches often fail (e.g.,...

Dotan Di Castro, Dmitry Volkinshtein, Ron Meir

claim paper

Read More »

185

click to vote

ATAL
2005
Springer

148views Intelligent Agents» more ATAL 2005»

An integrated framework for adaptive reasoning about conversation patterns

15 years 11 months ago

Download homepages.inf.ed.ac.uk

We present an integrated approach for reasoning about and learning conversation patterns in multiagent communication. The approach is based on the assumption that information abou...

Michael Rovatsos, Felix A. Fischer, Gerhard Wei&sz...

claim paper

Read More »

168

click to vote

FLAIRS
2008

132views Artificial Intelligence» more FLAIRS 2008»

Learning Continuous Action Models in a Real-Time Strategy Environment

15 years 8 months ago

Download www.knexusresearch.com

Although several researchers have integrated methods for reinforcement learning (RL) with case-based reasoning (CBR) to model continuous action spaces, existing integrations typic...

Matthew Molineaux, David W. Aha, Philip Moore

claim paper

Read More »

151

click to vote

ICML
2008
IEEE

162views Machine Learning» more ICML 2008»

Automatic discovery and transfer of MAXQ hierarchies

16 years 6 months ago

Download pages.cs.wisc.edu

We present an algorithm, HI-MAT (Hierarchy Induction via Models And Trajectories), that discovers MAXQ task hierarchies by applying dynamic Bayesian network models to a successful...

Neville Mehta, Soumya Ray, Prasad Tadepalli, Thoma...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers