Search Sciweavers | Sciweavers

779 search results - page 13 / 156

» Reinforcement Using Supervised Learning for Policy Generaliz...

211

click to vote

ICML
2001
IEEE

159views Machine Learning» more ICML 2001»

Direct Policy Search using Paired Statistical Tests

16 years 7 months ago

Download www.autonlab.org

Direct policy search is a practical way to solve reinforcement learning problems involving continuous state and action spaces. The goal becomes finding policy parameters that maxi...

Malcolm J. A. Strens, Andrew W. Moore

claim paper

Read More »

175

click to vote

ATAL
2009
Springer

150views Intelligent Agents» more ATAL 2009»

Learning of coordination: exploiting sparse interactions in multiagent systems

16 years 1 months ago

Download www.cs.cmu.edu

Creating coordinated multiagent policies in environments with uncertainty is a challenging problem, which can be greatly simpliﬁed if the coordination needs are known to be limi...

Francisco S. Melo, Manuela M. Veloso

claim paper

Read More »

221

click to vote

ATAL
2007
Springer

147views Intelligent Agents» more ATAL 2007»

A reinforcement learning based distributed search algorithm for hierarchical peer-to-peer information retrieval systems

15 years 11 months ago

Download www.haizhengzhang.com

The dominant existing routing strategies employed in peerto-peer(P2P) based information retrieval(IR) systems are similarity-based approaches. In these approaches, agents depend o...

Haizheng Zhang, Victor R. Lesser

claim paper

Read More »

185

click to vote

JUCS
2007

98views more JUCS 2007»

Focus of Attention in Reinforcement Learning

15 years 6 months ago

Download www.research.rutgers.edu

Abstract: Classiﬁcation-based reinforcement learning (RL) methods have recently been proposed as an alternative to the traditional value-function based methods. These methods use...

Lihong Li, Vadim Bulitko, Russell Greiner

claim paper

Read More »

169

click to vote

NIPS
2001

121views Information Technology» more NIPS 2001»

Rates of Convergence of Performance Gradient Estimates Using Function Approximation and Bias in Reinforcement Learning

15 years 8 months ago

Download books.nips.cc

We address two open theoretical questions in Policy Gradient Reinforcement Learning. The first concerns the efficacy of using function approximation to represent the state action ...

Gregory Z. Grudic, Lyle H. Ungar

claim paper

Read More »

« Prev « First page 13 / 156 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers