Sciweavers

540 search results - page 35 / 108
» Efficient Similarity Search for Hierarchical Data in Large D...
Sort
View
ICDE
2004
IEEE
151views Database» more  ICDE 2004»
15 years 11 months ago
Improved File Synchronization Techniques for Maintaining Large Replicated Collections over Slow Networks
We study the problem of maintaining large replicated collections of files or documents in a distributed environment with limited bandwidth. This problem arises in a number of impo...
Torsten Suel, Patrick Noel, Dimitre Trendafilov
DASFAA
2005
IEEE
136views Database» more  DASFAA 2005»
15 years 3 months ago
Indexing DNA Sequences Using q-Grams
We have observed in recent years a growing interest in similarity search on large collections of biological sequences. Contributing to the interest, this paper presents a method fo...
Xia Cao, Shuai Cheng Li, Anthony K. H. Tung
CORR
2006
Springer
178views Education» more  CORR 2006»
14 years 9 months ago
A tool set for the quick and efficient exploration of large document collections
: We are presenting a set of multilingual text analysis tools that can help analysts in any field to explore large document collections quickly in order to determine whether the do...
Camelia Ignat, Bruno Pouliquen, Ralf Steinberger, ...
SIGMOD
2001
ACM
200views Database» more  SIGMOD 2001»
15 years 10 months ago
Data Bubbles: Quality Preserving Performance Boosting for Hierarchical Clustering
In this paper, we investigate how to scale hierarchical clustering methods (such as OPTICS) to extremely large databases by utilizing data compression methods (such as BIRCH or ra...
Markus M. Breunig, Hans-Peter Kriegel, Peer Kr&oum...
DEXA
2004
Springer
136views Database» more  DEXA 2004»
15 years 3 months ago
PC-Filter: A Robust Filtering Technique for Duplicate Record Detection in Large Databases
: In this paper, we will propose PC-Filter (PC stands for Partition Comparison), a robust data filter for approximately duplicate record detection in large databases. PC-Filter dis...
Ji Zhang, Tok Wang Ling, Robert M. Bruckner, Han L...