Sciweavers

ICML
2002
IEEE

Partially Supervised Classification of Text Documents

14 years 5 months ago
Partially Supervised Classification of Text Documents
We investigate the following problem: Given a set of documents of a particular topic or class ?, and a large set ? of mixed documents that contains documents from class ? and other types of documents, identify the documents from class ? in ?. The key feature of this problem is that there is no labeled non? document, which makes traditional machine learning techniques inapplicable, as they all need labeled documents of both classes. We call this problem partially supervised classification. In this paper, we show that this problem can be posed as a constrained optimization problem and that under appropriate conditions, solutions to the constrained optimization problem will give good solutions to the partially supervised classification problem. We present a novel technique to solve the problem and demonstrate the effectiveness of the technique through extensive experimentation.
Bing Liu, Wee Sun Lee, Philip S. Yu, Xiaoli Li
Added 17 Nov 2009
Updated 17 Nov 2009
Type Conference
Year 2002
Where ICML
Authors Bing Liu, Wee Sun Lee, Philip S. Yu, Xiaoli Li
Comments (0)