Sciweavers

1875 search results - page 218 / 375
» Data Mining for Web Intelligence
Sort
View
KDD
2005
ACM
149views Data Mining» more  KDD 2005»
15 years 11 months ago
A distributed learning framework for heterogeneous data sources
We present a probabilistic model-based framework for distributed learning that takes into account privacy restrictions and is applicable to scenarios where the different sites ha...
Srujana Merugu, Joydeep Ghosh
KDD
2009
ACM
180views Data Mining» more  KDD 2009»
16 years 6 months ago
Consensus group stable feature selection
Stability is an important yet under-addressed issue in feature selection from high-dimensional and small sample data. In this paper, we show that stability of feature selection ha...
Steven Loscalzo, Lei Yu, Chris H. Q. Ding
KDD
2003
ACM
214views Data Mining» more  KDD 2003»
16 years 6 months ago
Adaptive duplicate detection using learnable string similarity measures
The problem of identifying approximately duplicate records in databases is an essential step for data cleaning and data integration processes. Most existing approaches have relied...
Mikhail Bilenko, Raymond J. Mooney
KDD
2002
ACM
148views Data Mining» more  KDD 2002»
16 years 6 months ago
Discovering informative content blocks from Web documents
In this paper, we propose a new approach to discover informative contents from a set of tabular documents (or Web pages) of a Web site. Our system, InfoDiscoverer, first partition...
Shian-Hua Lin, Jan-Ming Ho
KDD
1998
ACM
80views Data Mining» more  KDD 1998»
15 years 10 months ago
Human Performance on Clustering Web Pages: A Preliminary Study
With the increase in information on the World Wide Web it has become difficult to quickly find desired information without using multiple queries or using a topic-specific search ...
Sofus A. Macskassy, Arunava Banerjee, Brian D. Dav...