Sciweavers

118 search results - page 8 / 24
» icml 2003
Sort
View
ICML
2003
IEEE
15 years 10 months ago
The Cross Entropy Method for Fast Policy Search
We present a learning framework for Markovian decision processes that is based on optimization in the policy space. Instead of using relatively slow gradient-based optimization al...
Shie Mannor, Reuven Y. Rubinstein, Yohai Gat
ICML
2003
IEEE
15 years 10 months ago
Action Elimination and Stopping Conditions for Reinforcement Learning
We consider incorporating action elimination procedures in reinforcement learning algorithms. We suggest a framework that is based on learning an upper and a lower estimates of th...
Eyal Even-Dar, Shie Mannor, Yishay Mansour
ICML
2003
IEEE
15 years 10 months ago
Representational Issues in Meta-Learning
To address the problem of algorithm selection for the classification task, we equip a relational case base with new similarity measures that are able to cope with multirelational ...
Alexandros Kalousis, Melanie Hilario
ICML
2003
IEEE
15 years 10 months ago
Low Bias Bagged Support Vector Machines
Theoretical and experimental analyses of bagging indicate that it is primarily a variance reduction technique. This suggests that bagging should be applied to learning algorithms ...
Giorgio Valentini, Thomas G. Dietterich
ICML
2003
IEEE
15 years 10 months ago
Semi-Supervised Learning of Mixture Models
This paper analyzes the performance of semisupervised learning of mixture models. We show that unlabeled data can lead to an increase in classification error even in situations wh...
Fabio Gagliardi Cozman, Ira Cohen, Marcelo Cesar C...