Sciweavers

71 search results - page 1 / 15
» Frequency-Based Coverage Statistics Mining for Data Integrat...
Sort
View
IJCAI
2003
13 years 6 months ago
Frequency-Based Coverage Statistics Mining for Data Integration
Recent work in data integration has shown the importance of statistical information about the coverage and overlap of data sources for efficient query processing. Gathering and s...
Zaiqing Nie, Subbarao Kambhampati
ICDE
2004
IEEE
85views Database» more  ICDE 2004»
14 years 6 months ago
A Frequency-based Approach for Mining Coverage Statistics in Data Integration
Query optimization in data integration requires source coverage and overlap statistics. Gathering and storing the required statistics presents many challenges, not the least of wh...
Zaiqing Nie, Subbarao Kambhampati
BIBE
2005
IEEE
13 years 10 months ago
Using Data Mining Techniques to Learn Layouts of Flat-File Biological Datasets
One of the major problems in biological data integration is that many data sources are stored as flat-files, with a variety of different layouts. Integrating data from such sour...
Kaushik Sinha, Xuan Zhang, Ruoming Jin, Gagan Agra...
ICDM
2003
IEEE
136views Data Mining» more  ICDM 2003»
13 years 10 months ago
Statistical Relational Learning for Document Mining
A major obstacle to fully integrated deployment of many data mining algorithms is the assumption that data sits in a single table, even though most real-world databases have compl...
Alexandrin Popescul, Lyle H. Ungar, Steve Lawrence...
BMCBI
2008
204views more  BMCBI 2008»
13 years 5 months ago
EST2uni: an open, parallel tool for automated EST analysis and database creation, with a data mining web interface and microarra
Background: Expressed sequence tag (EST) collections are composed of a high number of single-pass, redundant, partial sequences, which need to be processed, clustered, and annotat...
Javier Forment, Francisco Gilabert Villamón...