Sciweavers

449 search results - page 25 / 90
» Finding Structure in Reinforcement Learning
Sort
View
125
Voted
GECCO
2006
Springer
177views Optimization» more  GECCO 2006»
15 years 7 months ago
Hyper-ellipsoidal conditions in XCS: rotation, linear approximation, and solution structure
The learning classifier system XCS is an iterative rulelearning system that evolves rule structures based on gradient-based prediction and rule quality estimates. Besides classifi...
Martin V. Butz, Pier Luca Lanzi, Stewart W. Wilson
PKDD
2009
Springer
184views Data Mining» more  PKDD 2009»
15 years 8 months ago
Boosting Active Learning to Optimality: A Tractable Monte-Carlo, Billiard-Based Algorithm
Abstract. This paper focuses on Active Learning with a limited number of queries; in application domains such as Numerical Engineering, the size of the training set might be limite...
Philippe Rolet, Michèle Sebag, Olivier Teyt...
117
Voted
NIPS
2004
15 years 5 months ago
Similarity and Discrimination in Classical Conditioning: A Latent Variable Account
We propose a probabilistic, generative account of configural learning phenomena in classical conditioning. Configural learning experiments probe how animals discriminate and gener...
Aaron C. Courville, Nathaniel D. Daw, David S. Tou...
129
Voted
IROS
2008
IEEE
165views Robotics» more  IROS 2008»
15 years 10 months ago
Mutual development of behavior acquisition and recognition based on value system
Abstract. Both self-learning architecture (embedded structure) and explicit/implicit teaching from other agents (environmental design issue) are necessary not only for one behavior...
Yasutake Takahashi, Yoshihiro Tamura, Minoru Asada
NIPS
1996
15 years 4 months ago
Why did TD-Gammon Work?
Although TD-Gammon is one of the major successes in machine learning, it has not led to similar impressive breakthroughs in temporal difference learning for other applications or ...
Jordan B. Pollack, Alan D. Blair