Search Sciweavers | Sciweavers

210 search results - page 12 / 42

» An analysis of reinforcement learning with function approxim...

199

Voted

JMLR
2006

153views more JMLR 2006»

Collaborative Multiagent Reinforcement Learning by Payoff Propagation

15 years 7 months ago

Download jmlr.csail.mit.edu

In this article we describe a set of scalable techniques for learning the behavior of a group of agents in a collaborative multiagent setting. As a basis we use the framework of c...

Jelle R. Kok, Nikos A. Vlassis

claim paper

Read More »

199

click to vote

NIPS
1998

140views Information Technology» more NIPS 1998»

Gradient Descent for General Reinforcement Learning

15 years 8 months ago

Download www.ri.cmu.edu

A simple learning rule is derived, the VAPS algorithm, which can be instantiated to generate a wide range of new reinforcementlearning algorithms. These algorithms solve a number ...

Leemon C. Baird III, Andrew W. Moore

claim paper

Read More »

198

click to vote

ATAL
2008
Springer

146views Intelligent Agents» more ATAL 2008»

Adaptive Kanerva-based function approximation for multi-agent systems

15 years 9 months ago

Download www.aamas-conference.org

In this paper, we show how adaptive prototype optimization can be used to improve the performance of function approximation based on Kanerva Coding when solving largescale instanc...

Cheng Wu, Waleed Meleis

claim paper

Read More »

192

click to vote

ECML
2006
Springer

141views Machine Learning» more ECML 2006»

Approximate Policy Iteration for Closed-Loop Learning of Visual Tasks

15 years 11 months ago

Download www.montefiore.ulg.ac.be

Abstract. Approximate Policy Iteration (API) is a reinforcement learning paradigm that is able to solve high-dimensional, continuous control problems. We propose to exploit API for...

Sébastien Jodogne, Cyril Briquet, Justus H....

claim paper

Read More »

200

Voted

ECAI
2008
Springer

83views Artificial Intelligence» more ECAI 2008»

Reinforcement Learning with the Use of Costly Features

15 years 9 months ago

Download people.cs.kuleuven.be

In many practical reinforcement learning problems, the state space is too large to permit an exact representation of the value function, much less the time required to compute it. ...

Robby Goetschalckx, Scott Sanner, Kurt Driessens

claim paper

Read More »

« Prev « First page 12 / 42 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers