Search Sciweavers | Sciweavers

513 search results - page 35 / 103

» Metric learning for reinforcement learning agents

101

Voted

AAAI
2006

129views Intelligent Agents» more AAAI 2006»

On the Difficulty of Modular Reinforcement Learning for Real-World Partial Programming

15 years 1 months ago

Download www.cc.gatech.edu

In recent years there has been a great deal of interest in "modular reinforcement learning" (MRL). Typically, problems are decomposed into concurrent subgoals, allowing ...

Sooraj Bhat, Charles Lee Isbell Jr., Michael Matea...

claim paper

Read More »

click to vote

ICML
2006
IEEE

142views Machine Learning» more ICML 2006»

An intrinsic reward mechanism for efficient exploration

16 years 1 months ago

Download www-anw.cs.umass.edu

How should a reinforcement learning agent act if its sole purpose is to efficiently learn an optimal policy for later use? In other words, how should it explore, to be able to exp...

Özgür Simsek, Andrew G. Barto

claim paper

Read More »

103

click to vote

IJCAI
2003

169views Artificial Intelligence» more IJCAI 2003»

Covariant Policy Search

15 years 1 months ago

Download www.ri.cmu.edu

We investigate the problem of non-covariant behavior of policy gradient reinforcement learning algorithms. The policy gradient approach is amenable to analysis by information geom...

J. Andrew Bagnell, Jeff G. Schneider

claim paper

Read More »

105

click to vote

ATAL
2007
Springer

128views Intelligent Agents» more ATAL 2007»

Advice taking in multiagent reinforcement learning

15 years 6 months ago

Download homepages.inf.ed.ac.uk

This paper proposes the β-WoLF algorithm for multiagent reinforcement learning (MARL) in the stochastic games framework that uses an additional “advice” signal to inform agen...

Michael Rovatsos, Alexandros Belesiotis

claim paper

Read More »

102

Voted

ICML
1999
IEEE

129views Machine Learning» more ICML 1999»

Implicit Imitation in Multiagent Reinforcement Learning

16 years 1 months ago

Download www.cs.toronto.edu

Imitation is actively being studied as an effective means of learning in multi-agent environments. It allows an agent to learn how to act well (perhaps optimally) by passively obs...

Bob Price, Craig Boutilier

claim paper

Read More »

« Prev « First page 35 / 103 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers