Sciweavers

118 search results - page 12 / 24
» icml 2003
Sort
View
ICML
2003
IEEE
15 years 10 months ago
Using Linear-threshold Algorithms to Combine Multi-class Sub-experts
We present a new type of multi-class learning algorithm called a linear-max algorithm. Linearmax algorithms learn with a special type of attribute called a sub-expert. A sub-exper...
Chris Mesterharm
76
Voted
ICML
2003
IEEE
15 years 10 months ago
TD(0) Converges Provably Faster than the Residual Gradient Algorithm
In Reinforcement Learning (RL) there has been some experimental evidence that the residual gradient algorithm converges slower than the TD(0) algorithm. In this paper, we use the ...
Ralf Schoknecht, Artur Merke
ICML
2003
IEEE
15 years 10 months ago
Feature Selection for High-Dimensional Data: A Fast Correlation-Based Filter Solution
Feature selection, as a preprocessing step to machine learning, has been effective in reducing dimensionality, removing irrelevant data, increasing learning accuracy, and improvin...
Lei Yu, Huan Liu
76
Voted
ICML
2003
IEEE
15 years 10 months ago
Learning Mixture Models with the Latent Maximum Entropy Principle
We present a new approach to estimating mixture models based on a new inference principle we have proposed: the latent maximum entropy principle (LME). LME is different both from ...
Shaojun Wang, Dale Schuurmans, Fuchun Peng, Yunxin...
98
Voted
ICML
2003
IEEE
15 years 2 months ago
The Significance of Temporal-Difference Learning in Self-Play Training TD-Rummy versus EVO-rummy
Reinforcement learning has been used for training game playing agents. The value function for a complex game must be approximated with a continuous function because the number of ...
Clifford Kotnik, Jugal K. Kalita