Sciweavers

966 search results - page 54 / 194
» A Two-Level Learning Method for Generalized Multi-instance P...
Sort
View
ATAL
2009
Springer
15 years 9 months ago
Online exploration in least-squares policy iteration
One of the key problems in reinforcement learning is balancing exploration and exploitation. Another is learning and acting in large or even continuous Markov decision processes (...
Lihong Li, Michael L. Littman, Christopher R. Mans...
133
Voted
ICML
2007
IEEE
16 years 3 months ago
A transductive framework of distance metric learning by spectral dimensionality reduction
Distance metric learning and nonlinear dimensionality reduction are two interesting and active topics in recent years. However, the connection between them is not thoroughly studi...
Fuxin Li, Jian Yang, Jue Wang
ICML
2006
IEEE
16 years 3 months ago
Relational temporal difference learning
We introduce relational temporal difference learning as an effective approach to solving multi-agent Markov decision problems with large state spaces. Our algorithm uses temporal ...
Nima Asgharbeygi, David J. Stracuzzi, Pat Langley
ML
2002
ACM
121views Machine Learning» more  ML 2002»
15 years 2 months ago
Choosing Multiple Parameters for Support Vector Machines
The problem of automatically tuning multiple parameters for pattern recognition Support Vector Machines (SVMs) is considered. This is done by minimizing some estimates of the gener...
Olivier Chapelle, Vladimir Vapnik, Olivier Bousque...
AMT
2006
Springer
147views Multimedia» more  AMT 2006»
15 years 6 months ago
Semi-Supervised Text Classification Using Positive and Unlabeled Data
Text classification using positive and unlabeled data refers to the problem of building text classifier using positive documents (P) of one class and unlabeled documents (U) of man...
Shuang Yu, Xueyuan Zhou, Chunping Li