Sciweavers

697 search results - page 68 / 140
» Plasticity-Mediated Competitive Learning
Sort
View
122
Voted
AR
2008
118views more  AR 2008»
15 years 25 days ago
Efficient Behavior Learning Based on State Value Estimation of Self and Others
The existing reinforcement learning methods have been seriously suffering from the curse of dimension problem especially when they are applied to multiagent dynamic environments. ...
Yasutake Takahashi, Kentarou Noma, Minoru Asada
ICDM
2009
IEEE
160views Data Mining» more  ICDM 2009»
15 years 7 months ago
Fast Online Training of Ramp Loss Support Vector Machines
—A fast online algorithm OnlineSVMR for training Ramp-Loss Support Vector Machines (SVMR s) is proposed. It finds the optimal SVMR for t+1 training examples using SVMR built on t...
Zhuang Wang, Slobodan Vucetic
113
Voted
ECML
2006
Springer
15 years 4 months ago
Transductive Gaussian Process Regression with Automatic Model Selection
Abstract. In contrast to the standard inductive inference setting of predictive machine learning, in real world learning problems often the test instances are already available at ...
Quoc V. Le, Alexander J. Smola, Thomas Gärtne...
NIPS
1996
15 years 2 months ago
Why did TD-Gammon Work?
Although TD-Gammon is one of the major successes in machine learning, it has not led to similar impressive breakthroughs in temporal difference learning for other applications or ...
Jordan B. Pollack, Alan D. Blair
102
Voted
NN
2006
Springer
15 years 20 days ago
The misbehavior of value and the discipline of the will
Most reinforcement learning models of animal conditioning operate under the convenient, though fictive, assumption that Pavlovian conditioning concerns prediction learning whereas...
Peter Dayan, Yael Niv, Ben Seymour, Nathaniel D. D...