Search Sciweavers | Sciweavers

267 search results - page 17 / 54

» The Dynamics of Multi-Agent Reinforcement Learning

106

click to vote

ICML
2005
IEEE

127views Machine Learning» more ICML 2005»

Exploration and apprenticeship learning in reinforcement learning

16 years 16 days ago

Download ai.stanford.edu

We consider reinforcement learning in systems with unknown dynamics. Algorithms such as E3 (Kearns and Singh, 2002) learn near-optimal policies by using "exploration policies...

Pieter Abbeel, Andrew Y. Ng

claim paper

Read More »

109

click to vote

IJCAI
2007

254views Artificial Intelligence» more IJCAI 2007»

Bayesian Inverse Reinforcement Learning

15 years 1 months ago

Download www.ijcai.org

Inverse Reinforcement Learning (IRL) is the problem of learning the reward function underlying a Markov Decision Process given the dynamics of the system and the behaviour of an e...

Deepak Ramachandran, Eyal Amir

claim paper

Read More »

Voted

IJCNN
2006
IEEE

150views Neural Networks» more IJCNN 2006»

Reinforcement Learning Control for Biped Robot Walking on Uneven Surfaces

15 years 5 months ago

Download www.cs.cmu.edu

— Biped robots based on the concept of (passive) dynamic walking are far simpler than the traditional fullycontrolled walking robots, while achieving a more natural gait and cons...

Shouyi Wang, Jelmer Braaksma, Robert Babuska, Daan...

claim paper

Read More »

Voted

IJCNN
2006
IEEE

121views Neural Networks» more IJCNN 2006»

Learning a Rendezvous Task with Dynamic Joint Action Perception

15 years 5 months ago

Download axon.cs.byu.edu

Abstract— Groups of reinforcement learning agents interacting in a common environment often fail to learn optimal behaviors. Poor performance is particularly common in environmen...

Nancy Fulda, Dan Ventura

claim paper

Read More »

click to vote

PRIMA
2009
Springer

102views Intelligent Agents» more PRIMA 2009»

Recursive Adaptation of Stepsize Parameter for Non-stationary Environments

15 years 6 months ago

Download teamcore.usc.edu

In this article, we propose a method to adapt stepsize parameters used in reinforcement learning for dynamic environments. In general reinforcement learning situations, a stepsize...

Itsuki Noda

claim paper

Read More »

« Prev « First page 17 / 54 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers