Sciweavers

135 search results - page 18 / 27
» Using Reinforcement Learning to Coordinate Better
Sort
View
ICML
2010
IEEE
14 years 7 months ago
Temporal Difference Bayesian Model Averaging: A Bayesian Perspective on Adapting Lambda
Temporal difference (TD) algorithms are attractive for reinforcement learning due to their ease-of-implementation and use of "bootstrapped" return estimates to make effi...
Carlton Downey, Scott Sanner
CCIA
2009
Springer
14 years 10 months ago
Interaction, observance or both? Study of the effects on convention emergence
Abstract. Social conventions are useful self-sustaining protocols for groups to coordinate behavior without a centralized entity enforcing coordination. The emergence of such conve...
Daniel Villatoro, Jordi Sabater-Mir, Sandip Sen
IDEAL
2004
Springer
15 years 2 months ago
Learning Users' Interests in a Market-Based Recommender System
Recommender systems are widely used to cope with the problem of information overload and, consequently, many recommendation methods have been developed. However, no one technique i...
Yan Zheng Wei, Luc Moreau, Nicholas R. Jennings
NIPS
1992
14 years 10 months ago
Explanation-Based Neural Network Learning for Robot Control
How can artificial neural nets generalize better from fewer examples? In order to generalize successfully, neural network learning methods typically require large training data se...
Tom M. Mitchell, Sebastian Thrun
GECCO
2005
Springer
153views Optimization» more  GECCO 2005»
15 years 3 months ago
Evolving neural network ensembles for control problems
In neuroevolution, a genetic algorithm is used to evolve a neural network to perform a particular task. The standard approach is to evolve a population over a number of generation...
David Pardoe, Michael S. Ryoo, Risto Miikkulainen