Search Sciweavers | Sciweavers

10054 search results - page 429 / 2011

» On the Complexity of Function Learning

145

click to vote

ROBOCUP
2009
Springer

134views Robotics» more ROBOCUP 2009»

Learning Complementary Multiagent Behaviors: A Case Study

15 years 11 months ago

Download teamcore.usc.edu

As the reach of multiagent reinforcement learning extends to more and more complex tasks, it is likely that the diverse challenges posed by some of these tasks can only be address...

Shivaram Kalyanakrishnan, Peter Stone

claim paper

Read More »

143

click to vote

ICML
2005
IEEE

135views Machine Learning» more ICML 2005»

Finite time bounds for sampling based fitted value iteration

16 years 5 months ago

Download www.machinelearning.org

In this paper we consider sampling based fitted value iteration for discounted, large (possibly infinite) state space, finite action Markovian Decision Problems where only a gener...

Csaba Szepesvári, Rémi Munos

claim paper

Read More »

137

click to vote

ESANN
2004

90views Neural Networks» more ESANN 2004»

High-accuracy value-function approximation with neural networks applied to the acrobot

15 years 6 months ago

Download remi.coulom.free.fr

Several reinforcement-learning techniques have already been applied to the Acrobot control problem, using linear function approximators to estimate the value function. In this pape...

Rémi Coulom

claim paper

Read More »

108

click to vote

ECML
2000
Springer

74views Machine Learning» more ECML 2000»

Layered Learning

15 years 9 months ago

Download www-lrn.cs.umass.edu

We examine how a network of many knowledge layers can be constructed in an on-line manner, such that the learned units represent building blocks of knowledge that serve to compres...

Peter Stone, Manuela M. Veloso

claim paper

Read More »

145

click to vote

CEC
2005
IEEE

138views Artificial Intelligence» more CEC 2005»

A note on the population based incremental learning with infinite population size

15 years 6 months ago

Download ceit.aut.ac.ir

In this paper, we study the dynamical properties of the population based incremental learning (PBIL) algorithm when it uses truncation, proportional, and Boltzmann selection schema...

Reza Rastegar, Mohammad Reza Meybodi

claim paper

Read More »

« Prev « First page 429 / 2011 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers