Sciweavers

77 search results - page 2 / 16
» Building a dynamic classifier for large text data collection...
Sort
View
DL
2000
Springer
162views Digital Library» more  DL 2000»
13 years 9 months ago
Snowball: extracting relations from large plain-text collections
Text documents often contain valuable structured data that is hidden in regular English sentences. This data is best exploited if available as a relational table that we could use...
Eugene Agichtein, Luis Gravano
CIKM
2000
Springer
13 years 9 months ago
Scalable association-based text classification
Naïve Bayes (NB) classifier has long been considered a core methodology in text classification mainly due to its simplicity and computational efficiency. There is an increasing n...
Dimitris Meretakis, Dimitris Fragoudis, Hongjun Lu...
SIGIR
1999
ACM
13 years 9 months ago
Efficient Distributed Algorithms to Build Inverted Files
We present three distributed algorithms to build global inverted files for very large text collections. The distributed environment we use is a high bandwidth network of workstati...
Berthier A. Ribeiro-Neto, Edleno Silva de Moura, M...
IJDLS
2010
108views more  IJDLS 2010»
13 years 2 months ago
Sampling the Web as Training Data for Text Classification
Data acquisition is a major concern in text classification. The excessive human efforts required by conventional methods to build up quality training collection might not always b...
Wei-Yen Day, Chun-Yi Chi, Ruey-Cheng Chen, Pu-Jen ...
KDD
2002
ACM
147views Data Mining» more  KDD 2002»
14 years 5 months ago
A parallel learning algorithm for text classification
Text classification is the process of classifying documents into predefined categories based on their content. Existing supervised learning algorithms to automatically classify te...
Canasai Kruengkrai, Chuleerat Jaruskulchai