Sciweavers

74 search results - page 8 / 15
» Improved Approximate String Matching Using Compressed Suffix...
Sort
View
SIGMOD
2008
ACM
142views Database» more  SIGMOD 2008»
15 years 9 months ago
Cost-based variable-length-gram selection for string collections to support approximate queries efficiently
Approximate queries on a collection of strings are important in many applications such as record linkage, spell checking, and Web search, where inconsistencies and errors exist in...
Xiaochun Yang, Bin Wang, Chen Li
CSB
2005
IEEE
151views Bioinformatics» more  CSB 2005»
15 years 3 months ago
Lossless Compression of DNA Microarray Images
Microarray experiments are characterized by a massive amount of data, usually in the form of an image. Based on the nature of microarray images, we consider the microarray in term...
Yong Zhang, Rahul Parthe, Donald A. Adjeroh
SIGMOD
2008
ACM
100views Database» more  SIGMOD 2008»
14 years 9 months ago
Incorporating string transformations in record matching
Today's record matching infrastructure does not allow a flexible way to account for synonyms such as "Robert" and "Bob" which refer to the same name, and ...
Arvind Arasu, Surajit Chaudhuri, Kris Ganjam, Ragh...
VLDB
1993
ACM
138views Database» more  VLDB 1993»
15 years 1 months ago
Searching Large Lexicons for Partially Specified Terms using Compressed Inverted Files
There are many advantages to be gained by storing the lexicon of a full text database in main memory. In this paper we describe how to use a compressed inverted file index to sear...
Justin Zobel, Alistair Moffat, Ron Sacks-Davis
ALMOB
2006
102views more  ALMOB 2006»
14 years 9 months ago
Mining, compressing and classifying with extensible motifs
Background: Motif patterns of maximal saturation emerged originally in contexts of pattern discovery in biomolecular sequences and have recently proven a valuable notion also in t...
Alberto Apostolico, Matteo Comin, Laxmi Parida