Search Sciweavers | Sciweavers

417 search results - page 20 / 84

» Reinforcement Learning Estimation of Distribution Algorithm

108

click to vote

ICDCS
2007
IEEE

111views Distributed And Parallel Com...» more ICDCS 2007»

Distributed Density Estimation Using Non-parametric Statistics

15 years 8 months ago

Download research.microsoft.com

Learning the underlying model from distributed data is often useful for many distributed systems. In this paper, we study the problem of learning a non-parametric model from distr...

Yusuo Hu, Hua Chen, Jian-Guang Lou, Jiang Li

claim paper

Read More »

106

click to vote

CEC
2007
IEEE

126views Artificial Intelligence» more CEC 2007»

Bayesian inference in estimation of distribution algorithms

15 years 8 months ago

Download www.itee.uq.edu.au

— Metaheuristics such as Estimation of Distribution Algorithms and the Cross-Entropy method use probabilistic modelling and inference to generate candidate solutions in optimizat...

Marcus Gallagher, Ian Wood, Jonathan M. Keith, Geo...

claim paper

Read More »

141

Voted

JMLR
2010

119views more JMLR 2010»

A Convergent Online Single Time Scale Actor Critic Algorithm

14 years 9 months ago

Download jmlr.csail.mit.edu

Actor-Critic based approaches were among the first to address reinforcement learning in a general setting. Recently, these algorithms have gained renewed interest due to their gen...

Dotan Di Castro, Ron Meir

claim paper

Read More »

130

click to vote

IJKESDP
2010

94views more IJKESDP 2010»

Rule acquisition for cognitive agents by using estimation of distribution algorithms

14 years 11 months ago

Download ir.lib.hiroshima-u.ac.jp

Cognitive Agents must be able to decide their actions based on their recognized states. In general, learning mechanisms are equipped for such agents in order to realize intellgent ...

Tokue Nishimura, Hisashi Handa

claim paper

Read More »

131

click to vote

ICML
2010
IEEE

222views Machine Learning» more ICML 2010»

Temporal Difference Bayesian Model Averaging: A Bayesian Perspective on Adapting Lambda

15 years 6 days ago

Download www.icml2010.org

Temporal difference (TD) algorithms are attractive for reinforcement learning due to their ease-of-implementation and use of "bootstrapped" return estimates to make effi...

Carlton Downey, Scott Sanner

claim paper

Read More »

« Prev « First page 20 / 84 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers