Sciweavers

417 search results - page 20 / 84
» Reinforcement Learning Estimation of Distribution Algorithm
Sort
View
ICDCS
2007
IEEE
15 years 8 months ago
Distributed Density Estimation Using Non-parametric Statistics
Learning the underlying model from distributed data is often useful for many distributed systems. In this paper, we study the problem of learning a non-parametric model from distr...
Yusuo Hu, Hua Chen, Jian-Guang Lou, Jiang Li
CEC
2007
IEEE
15 years 8 months ago
Bayesian inference in estimation of distribution algorithms
— Metaheuristics such as Estimation of Distribution Algorithms and the Cross-Entropy method use probabilistic modelling and inference to generate candidate solutions in optimizat...
Marcus Gallagher, Ian Wood, Jonathan M. Keith, Geo...
141
Voted
JMLR
2010
119views more  JMLR 2010»
14 years 9 months ago
A Convergent Online Single Time Scale Actor Critic Algorithm
Actor-Critic based approaches were among the first to address reinforcement learning in a general setting. Recently, these algorithms have gained renewed interest due to their gen...
Dotan Di Castro, Ron Meir
IJKESDP
2010
94views more  IJKESDP 2010»
14 years 11 months ago
Rule acquisition for cognitive agents by using estimation of distribution algorithms
Cognitive Agents must be able to decide their actions based on their recognized states. In general, learning mechanisms are equipped for such agents in order to realize intellgent ...
Tokue Nishimura, Hisashi Handa
ICML
2010
IEEE
15 years 6 days ago
Temporal Difference Bayesian Model Averaging: A Bayesian Perspective on Adapting Lambda
Temporal difference (TD) algorithms are attractive for reinforcement learning due to their ease-of-implementation and use of "bootstrapped" return estimates to make effi...
Carlton Downey, Scott Sanner