Sciweavers

366 search results - page 59 / 74
» Using WordNet for Text Categorization
Sort
View
ANLP
2000
137views more  ANLP 2000»
15 years 3 months ago
Using Corpus-derived Name Lists for Named Entity Recognition
This paper describes experiments to establish the performance of a named entity recognition system which builds categorized lists of names from manually annotated training data. N...
Mark Stevenson, Robert J. Gaizauskas
112
Voted
ADCS
2004
15 years 3 months ago
Phrases and Feature Selection in E-Mail Classification
In this paper we study the effectiveness of using a phrase-based representation in e-mail classification, and the affect this approach has on a number of machine learning algorithm...
Elisabeth Crawford, Irena Koprinska, Jon Patrick
SIGIR
2002
ACM
15 years 1 months ago
Automatic classification in product catalogs
In this paper, we present the AutoCat system for product classification. AutoCat uses a vector space model, modified to consider product attributes unavailable in traditional docu...
Ben Wolin
SIGIR
2006
ACM
15 years 7 months ago
Identifying comparative sentences in text documents
This paper studies the problem of identifying comparative sentences in text documents. The problem is related to but quite different from sentiment/opinion sentence identification...
Nitin Jindal, Bing Liu
ICDAR
1997
IEEE
15 years 6 months ago
Representing OCRed documents in HTML
ABSTRACT: OCR is an error-prone process. It is time-consuming and expensive to manually proofread OCR results. The errors remaining in OCRed texts can cause serious problems in rea...
Tao Hong, Sargur N. Srihari