Search Sciweavers | Sciweavers

1233 search results - page 148 / 247

» Reinforcement Learning in MirrorBot

click to vote

ATAL
2007
Springer

130views Intelligent Agents» more ATAL 2007»

Theoretical advantages of lenient Q-learners: an evolutionary game theoretic perspective

15 years 4 months ago

Download www.aamas-conference.org

This paper presents the dynamics of multiple reinforcement learning agents from an Evolutionary Game Theoretic (EGT) perspective. We provide a Replicator Dynamics model for tradit...

Liviu Panait, Karl Tuyls

claim paper

Read More »

click to vote

CIMCA
2006
IEEE

147views Intelligent Agents» more CIMCA 2006»

Model-driven Walks for Resource Discovery in Peer-to-Peer Networks

15 years 4 months ago

Download mbakhouya.free.fr

In this paper, a distributed and adaptive approach for resource discovery in peer-to-peer networks is presented. This approach is based on the mobile agent paradigm and the random...

Mohamed Bakhouya, Jaafar Gaber

claim paper

Read More »

click to vote

ICML
2006
IEEE

103views Machine Learning» more ICML 2006»

Using inaccurate models in reinforcement learning

15 years 10 months ago

Download ai.stanford.edu

In the model-based policy search approach to reinforcement learning (RL), policies are found using a model (or "simulator") of the Markov decision process. However, for ...

Pieter Abbeel, Morgan Quigley, Andrew Y. Ng

claim paper

Read More »

click to vote

ICML
2004
IEEE

145views Machine Learning» more ICML 2004»

Convergence of synchronous reinforcement learning with linear function approximation

15 years 10 months ago

Download www.machinelearning.org

Synchronous reinforcement learning (RL) algorithms with linear function approximation are representable as inhomogeneous matrix iterations of a special form (Schoknecht & Merk...

Artur Merke, Ralf Schoknecht

claim paper

Read More »

click to vote

ICML
2002
IEEE

146views Machine Learning» more ICML 2002»

Hierarchically Optimal Average Reward Reinforcement Learning

15 years 10 months ago

Download www.cs.ualberta.ca

Two notions of optimality have been explored in previous work on hierarchical reinforcement learning (HRL): hierarchical optimality, or the optimal policy in the space defined by ...

Mohammad Ghavamzadeh, Sridhar Mahadevan

claim paper

Read More »

« Prev « First page 148 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers