Sciweavers

1078 search results - page 134 / 216
» Using Machine Learning to Support Debugging with Tarantula
Sort
View
140
Voted
KDD
2003
ACM
214views Data Mining» more  KDD 2003»
16 years 3 months ago
Adaptive duplicate detection using learnable string similarity measures
The problem of identifying approximately duplicate records in databases is an essential step for data cleaning and data integration processes. Most existing approaches have relied...
Mikhail Bilenko, Raymond J. Mooney
128
Voted
KDD
2002
ACM
169views Data Mining» more  KDD 2002»
16 years 3 months ago
Optimizing search engines using clickthrough data
This paper presents an approach to automatically optimizing the retrieval quality of search engines using clickthrough data. Intuitively, a good information retrieval system shoul...
Thorsten Joachims
110
Voted
ICML
2004
IEEE
16 years 3 months ago
Improving SVM accuracy by training on auxiliary data sources
The standard model of supervised learning assumes that training and test data are drawn from the same underlying distribution. This paper explores an application in which a second...
Pengcheng Wu, Thomas G. Dietterich
113
Voted
ECML
2005
Springer
15 years 8 months ago
A Distance-Based Approach for Action Recommendation
Abstract. Rule induction has attracted a great deal of attention in Machine Learning and Data Mining. However, generating rules is not an end in itself because their applicability ...
Ronan Trepos, Ansaf Salleb, Marie-Odile Cordier, V...
106
Voted
ACL
2010
15 years 23 days ago
It Makes Sense: A Wide-Coverage Word Sense Disambiguation System for Free Text
Word sense disambiguation (WSD) systems based on supervised learning achieved the best performance in SensEval and SemEval workshops. However, there are few publicly available ope...
Zhi Zhong, Hwee Tou Ng