Sciweavers

915 search results - page 73 / 183
» Template Based Structured Collections
Sort
View
153
Voted
SIGMOD
2009
ACM
140views Database» more  SIGMOD 2009»
15 years 10 months ago
Robust web extraction: an approach based on a probabilistic tree-edit model
On script-generated web sites, many documents share common HTML tree structure, allowing wrappers to effectively extract information of interest. Of course, the scripts and thus ...
Nilesh N. Dalvi, Philip Bohannon, Fei Sha
ECAI
2008
Springer
15 years 4 months ago
Pedigree Tracking in the Face of Ancillary Content
The accurate tracking and retrieval of content pedigree is a quickly growing requirement as our abilities to create information assets increases exponentially. Plagiarism detection...
Eugene Creswick, Emi Fujioka, Terrance Goan
BMCBI
2005
119views more  BMCBI 2005»
15 years 3 months ago
The distance-profile representation and its application to detection of distantly related protein families
Background: Detecting homology between remotely related protein families is an important problem in computational biology since the biological properties of uncharacterized protei...
Chin-Jen Ku, Golan Yona
ICDE
2009
IEEE
135views Database» more  ICDE 2009»
16 years 4 months ago
Space-Constrained Gram-Based Indexing for Efficient Approximate String Search
Abstract-- Answering approximate queries on string collections is important in applications such as data cleaning, query relaxation, and spell checking, where inconsistencies and e...
Alexander Behm, Shengyue Ji, Chen Li, Jiaheng Lu
134
Voted
RECOMB
2009
Springer
16 years 3 months ago
Storage and Retrieval of Individual Genomes
A repetitive sequence collection is one where portions of a base sequence of length n are repeated many times with small variations, forming a collection of total length N. Example...
Gonzalo Navarro, Jouni Sirén, Niko Väl...