Sciweavers

1315 search results - page 96 / 263
» Discovering Classification from Data of Multiple Sources
Sort
View
COMAD
2008
15 years 3 months ago
Querying for Information Integration: How to go from an Imprecise Intent to a Precise Query?
In this paper, we address the problem of query formulation in the context of multi-domain integration of heterogeneous data on the Web. We argue that effectively tackling this pro...
Aditya Telang, Sharma Chakravarthy, Chengkai Li
SSDBM
2007
IEEE
105views Database» more  SSDBM 2007»
15 years 8 months ago
Maintaining K-Anonymity against Incremental Updates
K-anonymity is a simple yet practical mechanism to protect privacy against attacks of re-identifying individuals by joining multiple public data sources. All existing methods achi...
Jian Pei, Jian Xu, Zhibin Wang, Wei Wang 0009, Ke ...
WWW
2007
ACM
16 years 2 months ago
U-REST: an unsupervised record extraction system
In this paper, we describe a system that can extract record structures from web pages with no direct human supervision. Records are commonly occurring HTML-embedded data tuples th...
Yuan Kui Shen, David R. Karger
SDM
2011
SIAM
243views Data Mining» more  SDM 2011»
14 years 4 months ago
Data Integration via Constrained Clustering: An Application to Enzyme Clustering
When multiple data sources are available for clustering, an a priori data integration process is usually required. This process may be costly and may not lead to good clusterings,...
Elisa Boari de Lima, Raquel Cardoso de Melo Minard...
INFOSCALE
2006
ACM
15 years 7 months ago
PENS: an algorithm for density-based clustering in peer-to-peer systems
Huge amounts of data are available in large-scale networks of autonomous data sources dispersed over a wide area. Data mining is an essential technology for obtaining hidden and v...
Mei Li, Guanling Lee, Wang-Chien Lee, Anand Sivasu...