Sciweavers

KDD
2008
ACM
121views Data Mining» more  KDD 2008»
14 years 5 months ago
Mining multi-faceted overviews of arbitrary topics in a text collection
A common task in many text mining applications is to generate a multi-faceted overview of a topic in a text collection. Such an overview not only directly serves as an informative...
Xu Ling, Qiaozhu Mei, ChengXiang Zhai, Bruce R. Sc...
KDD
2008
ACM
134views Data Mining» more  KDD 2008»
14 years 5 months ago
Privacy-preserving cox regression for survival analysis
Privacy-preserving data mining (PPDM) is an emergent research area that addresses the incorporation of privacy preserving concerns to data mining techniques. In this paper we prop...
Shipeng Yu, Glenn Fung, Rómer Rosales, Srir...
KDD
2008
ACM
119views Data Mining» more  KDD 2008»
14 years 5 months ago
A unified approach for schema matching, coreference and canonicalization
Michael L. Wick, Khashayar Rohanimanesh, Karl Schu...
KDD
2008
ACM
193views Data Mining» more  KDD 2008»
14 years 5 months ago
A family of dissimilarity measures between nodes generalizing both the shortest-path and the commute-time distances
This work introduces a new family of link-based dissimilarity measures between nodes of a weighted directed graph. This measure, called the randomized shortest-path (RSP) dissimil...
Luh Yen, Marco Saerens, Amin Mantrach, Masashi Shi...
KDD
2008
ACM
131views Data Mining» more  KDD 2008»
14 years 5 months ago
Customer targeting models using actively-selected web content
Prem Melville, Saharon Rosset, Richard D. Lawrence
KDD
2008
ACM
162views Data Mining» more  KDD 2008»
14 years 5 months ago
Anonymizing transaction databases for publication
Yabo Xu, Ke Wang, Ada Wai-Chee Fu, Philip S. Yu
KDD
2008
ACM
152views Data Mining» more  KDD 2008»
14 years 5 months ago
Fast collapsed gibbs sampling for latent dirichlet allocation
Ian Porteous, David Newman, Alexander T. Ihler, Ar...
KDD
2008
ACM
162views Data Mining» more  KDD 2008»
14 years 5 months ago
Composition attacks and auxiliary information in data privacy
Privacy is an increasingly important aspect of data publishing. Reasoning about privacy, however, is fraught with pitfalls. One of the most significant is the auxiliary informatio...
Srivatsava Ranjit Ganta, Shiva Prasad Kasiviswanat...
KDD
2008
ACM
150views Data Mining» more  KDD 2008»
14 years 5 months ago
Hypergraph spectral learning for multi-label classification
A hypergraph is a generalization of the traditional graph in which the edges are arbitrary non-empty subsets of the vertex set. It has been applied successfully to capture highord...
Liang Sun, Shuiwang Ji, Jieping Ye
KDD
2008
ACM
116views Data Mining» more  KDD 2008»
14 years 5 months ago
Volatile correlation computation: a checkpoint view
Recent years have witnessed increased interest in computing strongly correlated pairs in very large databases. Most previous studies have been focused on static data sets. However...
Wenjun Zhou, Hui Xiong