Search Sciweavers | Sciweavers

4 search results - page 1 / 1

» Temporal Difference Bayesian Model Averaging: A Bayesian Per...

click to vote

ICML
2010
IEEE

222views Machine Learning» more ICML 2010»

Temporal Difference Bayesian Model Averaging: A Bayesian Perspective on Adapting Lambda

13 years 2 months ago

Download www.icml2010.org

Temporal difference (TD) algorithms are attractive for reinforcement learning due to their ease-of-implementation and use of "bootstrapped" return estimates to make effi...

Carlton Downey, Scott Sanner

claim paper

Read More »

click to vote

ICML
2007
IEEE

180views Machine Learning» more ICML 2007»

Bayesian actor-critic algorithms

14 years 5 months ago

Download www.machinelearning.org

We1 present a new actor-critic learning model in which a Bayesian class of non-parametric critics, using Gaussian process temporal difference learning is used. Such critics model ...

Mohammad Ghavamzadeh, Yaakov Engel

claim paper

Read More »

click to vote

CORR
2010
Springer

118views Education» more CORR 2010»

Large scale probabilistic available bandwidth estimation

13 years 4 months ago

Download www.tsp.ece.mcgill.ca

The common utilization-based definition of available bandwidth and many of the existing tools to estimate it suffer from several important weaknesses: i) most tools report a point...

Frederic Thouin, Mark Coates, Michael G. Rabbat

claim paper

Read More »

click to vote

ATAL
2010
Springer

181views Intelligent Agents» more ATAL 2010»

Planning against fictitious players in repeated normal form games

13 years 6 months ago

Download www.aamas-conference.org

Planning how to interact against bounded memory and unbounded memory learning opponents needs different treatment. Thus far, however, work in this area has shown how to design pla...

Enrique Munoz de Cote, Nicholas R. Jennings

claim paper

Read More »

« Prev « First page 1 / 1 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers