Sciweavers

523 search results - page 4 / 105
» Metric Learning for Text Documents
Sort
View
ICML
2002
IEEE
14 years 7 months ago
Partially Supervised Classification of Text Documents
We investigate the following problem: Given a set of documents of a particular topic or class ?, and a large set ? of mixed documents that contains documents from class ? and othe...
Bing Liu, Wee Sun Lee, Philip S. Yu, Xiaoli Li
ICDM
2010
IEEE
147views Data Mining» more  ICDM 2010»
13 years 4 months ago
Location and Scatter Matching for Dataset Shift in Text Mining
Dataset shift from the training data in a source domain to the data in a target domain poses a great challenge for many statistical learning methods. Most algorithms can be viewed ...
Bo Chen, Wai Lam, Ivor W. Tsang, Tak-Lam Wong
ECML
2007
Springer
13 years 10 months ago
Semi-supervised Collaborative Text Classification
Most text categorization methods require text content of documents that is often difficult to obtain. We consider "Collaborative Text Categorization", where each document...
Rong Jin, Ming Wu, Rahul Sukthankar
ECIR
2009
Springer
14 years 3 months ago
Evaluation of Text Clustering Algorithms with N-Gram-Based Document Fingerprints
This paper presents a new approach designed to reduce the computational load of the existing clustering algorithms by trimming down the documents size using fingerprinting methods...
Javier Parapar, Alvaro Barreiro
COLING
2000
13 years 7 months ago
Automatic Text Categorization by Unsupervised Learning
The goal of text categorization is to classify documents into a certain number of pre-defined categories. The previous works in this area have used a large number of labeled train...
Youngjoong Ko, Jungyun Seo