Search Sciweavers | Sciweavers

1233 search results - page 83 / 247

» Reinforcement Learning in MirrorBot

click to vote

ICML
2008
IEEE

135views Machine Learning» more ICML 2008»

Reinforcement learning with limited reinforcement: using Bayes risk for active learning in POMDPs

16 years 2 months ago

Download mapleleaf.csail.mit.edu

Partially Observable Markov Decision Processes (POMDPs) have succeeded in planning domains that require balancing actions that increase an agent's knowledge and actions that ...

Finale Doshi, Joelle Pineau, Nicholas Roy

claim paper

Read More »

157

Voted

ECML
2006
Springer

146views Machine Learning» more ECML 2006»

Task-Driven Discretization of the Joint Space of Visual Percepts and Continuous Actions

15 years 5 months ago

Download www.montefiore.ulg.ac.be

We target the problem of closed-loop learning of control policies that map visual percepts to continuous actions. Our algorithm, called Reinforcement Learning of Joint Classes (RLJ...

Sébastien Jodogne, Justus H. Piater

claim paper

Read More »

114

click to vote

NIPS
2007

164views Information Technology» more NIPS 2007»

Incremental Natural Actor-Critic Algorithms

15 years 3 months ago

Download books.nips.cc

We present four new reinforcement learning algorithms based on actor-critic and natural-gradient ideas, and provide their convergence proofs. Actor-critic reinforcement learning m...

Shalabh Bhatnagar, Richard S. Sutton, Mohammad Gha...

claim paper

Read More »

Voted

ECML
1997
Springer

79views Machine Learning» more ECML 1997»

Ibots Learn Genuine Team Solutions

15 years 5 months ago

Download www.idsia.ch

\Ibots" (Integrating roBOTS) is a computer experiment in group learning. It is designed to understand how to use reinforcement learning to program automatically a team of robo...

Cristina Versino, Luca Maria Gambardella

claim paper

Read More »

108

click to vote

ICMAS
1998

157views Intelligent Agents» more ICMAS 1998»

The Moving Target Function Problem in Multi-Agent Learning

15 years 2 months ago

Download jmvidal.cse.sc.edu

We describe a framework that can be used to model and predict the behavior of MASs with learning agents. It uses a difference equation for calculating the progression of an agent&...

José M. Vidal, Edmund H. Durfee

claim paper

Read More »

« Prev « First page 83 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers