Sciweavers

1880 search results - page 122 / 376
» Robust Learning - Rich and Poor
Sort
View
VISAPP
2008
15 years 7 months ago
Towards the Estimation of Conspicuity with Visual Priors
Traffic signs are designed to be clearly seen by drivers. However a little is known about the visual influence of the traffic sign environment on how it will be perceived. Computer...
Ludovic Simon, Jean-Philippe Tarel, Roland Bremond
184
Voted
JMLR
2010
139views more  JMLR 2010»
15 years 27 days ago
Tempered Markov Chain Monte Carlo for training of Restricted Boltzmann Machines
Alternating Gibbs sampling is the most common scheme used for sampling from Restricted Boltzmann Machines (RBM), a crucial component in deep architectures such as Deep Belief Netw...
Guillaume Desjardins, Aaron C. Courville, Yoshua B...
156
Voted
GECCO
2009
Springer
135views Optimization» more  GECCO 2009»
16 years 20 days ago
Neuroevolutionary reinforcement learning for generalized helicopter control
Helicopter hovering is an important challenge problem in the field of reinforcement learning. This paper considers several neuroevolutionary approaches to discovering robust cont...
Rogier Koppejan, Shimon Whiteson
194
Voted
AAMAS
2007
Springer
16 years 9 days ago
Bifurcation Analysis of Reinforcement Learning Agents in the Selten's Horse Game
Abstract. The application of reinforcement learning algorithms to multiagent domains may cause complex non-convergent dynamics. The replicator dynamics, commonly used in evolutiona...
Alessandro Lazaric, Jose Enrique Munoz de Cote, Fa...
172
Voted
ICML
1996
IEEE
15 years 10 months ago
Discovering Structure in Multiple Learning Tasks: The TC Algorithm
Recently, there has been an increased interest in "lifelong" machine learning methods, that transfer knowledge across multiple learning tasks. Such methods have repeated...
Sebastian Thrun, Joseph O'Sullivan