Sciweavers

34 search results - page 4 / 7
» Scalable Term Selection for Text Categorization
Sort
View
KDD
2008
ACM
120views Data Mining» more  KDD 2008»
14 years 5 months ago
Entity categorization over large document collections
Extracting entities (such as people, movies) from documents and identifying the categories (such as painter, writer) they belong to enable structured querying and data analysis ov...
Arnd Christian König, Rares Vernica, Venkates...
DOCENG
2006
ACM
13 years 11 months ago
NEWPAR: an automatic feature selection and weighting schema for category ranking
Category ranking provides a way to classify plain text documents into a pre-determined set of categories. This work proposes to have a look at typical document collections and ana...
Fernando Ruiz-Rico, José Luis Vicedo Gonz&a...
ICDE
2007
IEEE
115views Database» more  ICDE 2007»
14 years 6 months ago
SPRITE: A Learning-Based Text Retrieval System in DHT Networks
In this paper, we propose SPRITE (Selective PRogressive Index Tuning by Examples), a scalable system for text retrieval in a structured P2P network. Under SPRITE, each peer is res...
Yingguang Li, H. V. Jagadish, Kian-Lee Tan
KDD
2007
ACM
139views Data Mining» more  KDD 2007»
14 years 5 months ago
Raising the baseline for high-precision text classifiers
Many important application areas of text classifiers demand high precision and it is common to compare prospective solutions to the performance of Naive Bayes. This baseline is us...
Aleksander Kolcz, Wen-tau Yih
ICAIL
2003
ACM
13 years 10 months ago
Concept Extraction from Legal Cases: The Use of a Statistic of Coincidence
Effective retrieval of court decisions is important. Automatically identifying legal concepts in the decision texts would be very helpful. In this paper we investigate how a stat...
Marie-Francine Moens, Roxana Angheluta