Search Sciweavers | Sciweavers

513 search results - page 19 / 103

» Metric learning for reinforcement learning agents

107

click to vote

GECCO
2006
Springer

133views Optimization» more GECCO 2006»

On-line evolutionary computation for reinforcement learning in stochastic domains

15 years 4 months ago

Download userweb.cs.utexas.edu

In reinforcement learning, an agent interacting with its environment strives to learn a policy that specifies, for each state it may encounter, what action to take. Evolutionary c...

Shimon Whiteson, Peter Stone

claim paper

Read More »

click to vote

ATAL
2006
Springer

142views Intelligent Agents» more ATAL 2006»

Probabilistic policy reuse in a reinforcement learning agent

15 years 4 months ago

Download www.cs.cmu.edu

We contribute Policy Reuse as a technique to improve a reinforcement learning agent with guidance from past learned similar policies. Our method relies on using the past policies ...

Fernando Fernández, Manuela M. Veloso

claim paper

Read More »

127

click to vote

AAAI
2011

206views Intelligent Agents» more AAAI 2011»

Coordinated Multi-Agent Reinforcement Learning in Networked Distributed POMDPs

14 years 14 days ago

Download www.cs.umass.edu

In many multi-agent applications such as distributed sensor nets, a network of agents act collaboratively under uncertainty and local interactions. Networked Distributed POMDP (ND...

Chongjie Zhang, Victor R. Lesser

claim paper

Read More »

109

click to vote

ICML
1994
IEEE

152views Machine Learning» more ICML 1994»

Markov Games as a Framework for Multi-Agent Reinforcement Learning

15 years 4 months ago

Download www.cs.rutgers.edu

In the Markov decision process (MDP) formalization of reinforcement learning, a single adaptive agent interacts with an environment defined by a probabilistic transition function....

Michael L. Littman

claim paper

Read More »

click to vote

AAMAS
2005
Springer

133views Intelligent Agents» more AAMAS 2005»

Advice-Exchange Between Evolutionary Algorithms and Reinforcement Learning Agents: Experiments in the Pursuit Domain

15 years 6 months ago

Download iscte.pt

This research aims at studying the effects of exchanging information during the learning process in Multiagent Systems. The concept of advice-exchange, introduced in (Nunes and Ol...

Luís Nunes, Eugénio C. Oliveira

claim paper

Read More »

« Prev « First page 19 / 103 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers