Sciweavers

6715 search results - page 200 / 1343
» Learning from a Test Set
Sort
View
146
Voted
KAIS
2010
144views more  KAIS 2010»
15 years 1 months ago
Boosting support vector machines for imbalanced data sets
Real world data mining applications must address the issue of learning from imbalanced data sets. The problem occurs when the number of instances in one class greatly outnumbers t...
Benjamin X. Wang, Nathalie Japkowicz
137
Voted
EMNLP
2009
15 years 1 months ago
Web-Scale Distributional Similarity and Entity Set Expansion
Computing the pairwise semantic similarity between all words on the Web is a computationally challenging task. Parallelization and optimizations are necessary. We propose a highly...
Patrick Pantel, Eric Crestan, Arkady Borkovsky, An...
127
Voted
ICDM
2009
IEEE
200views Data Mining» more  ICDM 2009»
15 years 1 months ago
Improving SVM Classification on Imbalanced Data Sets in Distance Spaces
Abstract--Imbalanced data sets present a particular challenge to the data mining community. Often, it is the rare event that is of interest and the cost of misclassifying the rare ...
Suzan Koknar-Tezel, Longin Jan Latecki
128
Voted
EMNLP
2006
15 years 4 months ago
Domain Adaptation with Structural Correspondence Learning
Discriminative learning methods are widely used in natural language processing. These methods work best when their training and test data are drawn from the same distribution. For...
John Blitzer, Ryan T. McDonald, Fernando Pereira
144
Voted
IJCAI
2003
15 years 4 months ago
Integrating Background Knowledge Into Text Classification
We present a description of three different algorithms that use background knowledge to improve text classifiers. One uses the background knowledge as an index into the set of tra...
Sarah Zelikovitz, Haym Hirsh