Search Sciweavers | Sciweavers

417 search results - page 29 / 84

» Reinforcement Learning Estimation of Distribution Algorithm

138

click to vote

ICS
2010
Tsinghua U.

145views Distributed And Parallel Com...» more ICS 2010»

Space-Efficient Estimation of Robust Statistics and Distribution Testing

16 years 1 months ago

Download www.cs.umass.edu

: The generic problem of estimation and inference given a sequence of i.i.d. samples has been extensively studied in the statistics, property testing, and learning communities. A n...

Steve Chien, Katrina Ligett, Andrew McGregor

claim paper

Read More »

122

click to vote

ICML
2004
IEEE

128views Machine Learning» more ICML 2004»

Learning random walk models for inducing word dependency distributions

16 years 4 months ago

Download nlp.stanford.edu

Many NLP tasks rely on accurately estimating word dependency probabilities P(w1|w2), where the words w1 and w2 have a particular relationship (such as verb-object). Because of the...

Kristina Toutanova, Christopher D. Manning, Andrew...

claim paper

Read More »

151

click to vote

ATAL
2010
Springer

172views Intelligent Agents» more ATAL 2010»

Distributed multiagent learning with a broadcast adaptive subgradient method

15 years 5 months ago

Download www.aamas-conference.org

Many applications in multiagent learning are essentially convex optimization problems in which agents have only limited communication and partial information about the function be...

Renato L. G. Cavalcante, Alex Rogers, Nicholas R. ...

claim paper

Read More »

132

click to vote

ESANN
2004

90views Neural Networks» more ESANN 2004»

High-accuracy value-function approximation with neural networks applied to the acrobot

15 years 5 months ago

Download remi.coulom.free.fr

Several reinforcement-learning techniques have already been applied to the Acrobot control problem, using linear function approximators to estimate the value function. In this pape...

Rémi Coulom

claim paper

Read More »

143

click to vote

ICML
2007
IEEE

180views Machine Learning» more ICML 2007»

Bayesian actor-critic algorithms

16 years 4 months ago

Download www.machinelearning.org

We1 present a new actor-critic learning model in which a Bayesian class of non-parametric critics, using Gaussian process temporal difference learning is used. Such critics model ...

Mohammad Ghavamzadeh, Yaakov Engel

claim paper

Read More »

« Prev « First page 29 / 84 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers