Sciweavers

587 search results - page 24 / 118
» New Algorithms for Text Fingerprinting
Sort
View
95
Voted
TKDE
2008
111views more  TKDE 2008»
14 years 9 months ago
Text Clustering with Feature Selection by Using Statistical Data
Abstract-- Feature selection is an important method for improving the efficiency and accuracy of text categorization algorithms by removing redundant and irrelevant terms from the ...
Yanjun Li, Congnan Luo, Soon M. Chung
JMLR
2002
138views more  JMLR 2002»
14 years 9 months ago
Text Chunking based on a Generalization of Winnow
This paper describes a text chunking system based on a generalization of the Winnow algorithm. We propose a general statistical model for text chunking which we then convert into ...
Tong Zhang, Fred Damerau, David Johnson
CPM
1999
Springer
107views Combinatorics» more  CPM 1999»
15 years 2 months ago
A General Practical Approach to Pattern Matching over Ziv-Lempel Compressed Text
We address the problem of string matching on Ziv-Lempel compressed text. The goal is to search a pattern in a text without uncompressing it. This is a highly relevant issue to keep...
Gonzalo Navarro, Mathieu Raffinot
68
Voted
EMNLP
2007
14 years 11 months ago
Incremental Text Structuring with Online Hierarchical Ranking
Many emerging applications require documents to be repeatedly updated. Such documents include newsfeeds, webpages, and shared community resources such as Wikipedia. In this paper ...
Erdong Chen, Benjamin Snyder, Regina Barzilay
ECIR
2006
Springer
14 years 11 months ago
Generating Search Term Variants for Text Collections with Historic Spellings
In this paper, we describe a new approach for retrieval in texts with non-standard spelling, which is important for historic texts in English or German. For this purpose, we presen...
Andrea Ernst-Gerlach, Norbert Fuhr