Sciweavers

341 search results - page 8 / 69
» Data Cleaning and Semantic Improvement in Biological Databas...
Sort
View
138
Voted
NAR
2011
267views Computer Vision» more  NAR 2011»
14 years 4 months ago
EcoCyc: a comprehensive database of Escherichia coli biology
EcoCyc (http://EcoCyc.org) is a comprehensive model organism database for Escherichia coli K-12 MG1655. From the scientific literature, EcoCyc captures the functions of individual...
Ingrid M. Keseler, Julio Collado-Vides, Alberto Sa...
161
Voted
ICDE
2009
IEEE
121views Database» more  ICDE 2009»
15 years 11 months ago
Large-Scale Deduplication with Constraints Using Dedupalog
We present a declarative framework for collective deduplication of entity references in the presence of constraints. Constraints occur naturally in many data cleaning domains and c...
Arvind Arasu, Christopher Ré, Dan Suciu
LREC
2008
166views Education» more  LREC 2008»
14 years 11 months ago
A lexicon for biology and bioinformatics: the BOOTStrep experience
This paper describes the design, implementation and population of a lexical resource for biology and bioinformatics (the BioLexicon) developed within an ongoing European project. ...
Valeria Quochi, Monica Monachini, Riccardo Del Gra...
DANTE
1999
IEEE
119views Database» more  DANTE 1999»
15 years 2 months ago
A Semantic Caching Method Based on Linear Constraints
Because performance is a crucial issue in database systems, data caching techniques have been studied in database research field, especially in client-server databases and distrib...
Yoshiharu Ishikawa, Hiroyuki Kitagawa
ICDE
2008
IEEE
147views Database» more  ICDE 2008»
15 years 11 months ago
Fast Indexes and Algorithms for Set Similarity Selection Queries
Data collections often have inconsistencies that arise due to a variety of reasons, and it is desirable to be able to identify and resolve them efficiently. Set similarity queries ...
Marios Hadjieleftheriou, Amit Chandel, Nick Koudas...