Sciweavers

366 search results - page 59 / 74
» Using WordNet for Text Categorization
Sort
View
ANLP
2000
137views more  ANLP 2000»
14 years 10 months ago
Using Corpus-derived Name Lists for Named Entity Recognition
This paper describes experiments to establish the performance of a named entity recognition system which builds categorized lists of names from manually annotated training data. N...
Mark Stevenson, Robert J. Gaizauskas
ADCS
2004
14 years 10 months ago
Phrases and Feature Selection in E-Mail Classification
In this paper we study the effectiveness of using a phrase-based representation in e-mail classification, and the affect this approach has on a number of machine learning algorithm...
Elisabeth Crawford, Irena Koprinska, Jon Patrick
SIGIR
2002
ACM
14 years 9 months ago
Automatic classification in product catalogs
In this paper, we present the AutoCat system for product classification. AutoCat uses a vector space model, modified to consider product attributes unavailable in traditional docu...
Ben Wolin
SIGIR
2006
ACM
15 years 3 months ago
Identifying comparative sentences in text documents
This paper studies the problem of identifying comparative sentences in text documents. The problem is related to but quite different from sentiment/opinion sentence identification...
Nitin Jindal, Bing Liu
ICDAR
1997
IEEE
15 years 1 months ago
Representing OCRed documents in HTML
ABSTRACT: OCR is an error-prone process. It is time-consuming and expensive to manually proofread OCR results. The errors remaining in OCRed texts can cause serious problems in rea...
Tao Hong, Sargur N. Srihari