Search Sciweavers | Sciweavers

417 search results - page 10 / 84

» Reinforcement Learning Estimation of Distribution Algorithm

click to vote

HPDC
2009
IEEE

108views Distributed And Parallel Com...» more HPDC 2009»

Maestro: a self-organizing peer-to-peer dataflow framework using reinforcement learning

15 years 3 months ago

Download www.cs.vu.nl

In this paper we describe Maestro, a dataflow computation framework for Ibis, our Java-based grid middleware. The novelty of Maestro is that it is a self-organizing peer-to-peer s...

C. van Reeuwijk

claim paper

Read More »

104

click to vote

NIPS
2003

105views Information Technology» more NIPS 2003»

Gaussian Processes in Reinforcement Learning

15 years 1 months ago

Download books.nips.cc

We exploit some useful properties of Gaussian process (GP) regression models for reinforcement learning in continuous state spaces and discrete time. We demonstrate how the GP mod...

Carl Edward Rasmussen, Malte Kuss

claim paper

Read More »

Voted

NIPS
2000

127views Information Technology» more NIPS 2000»

Using Free Energies to Represent Q-values in a Multiagent Reinforcement Learning Task

15 years 1 months ago

Download members.chello.at

The problem of reinforcement learning in large factored Markov decision processes is explored. The Q-value of a state-action pair is approximated by the free energy of a product o...

Brian Sallans, Geoffrey E. Hinton

claim paper

Read More »

click to vote

ICML
2002
IEEE

113views Machine Learning» more ICML 2002»

Learning from Scarce Experience

16 years 14 days ago

Download www.cs.ucr.edu

Searching the space of policies directly for the optimal policy has been one popular method for solving partially observable reinforcement learning problems. Typically, with each ...

Leonid Peshkin, Christian R. Shelton

claim paper

Read More »

Voted

ECML
2004
Springer

77views Machine Learning» more ECML 2004»

Filtered Reinforcement Learning

15 years 5 months ago

Download eprints.pascal-network.org

Reinforcement learning (RL) algorithms attempt to assign the credit for rewards to the actions that contributed to the reward. Thus far, credit assignment has been done in one of t...

Douglas Aberdeen

claim paper

Read More »

« Prev « First page 10 / 84 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers