Sciweavers

SDM
2007
SIAM
190views Data Mining» more  SDM 2007»
13 years 5 months ago
AC-Framework for Privacy-Preserving Collaboration
The secure multi-party computation (SMC) model provides means for balancing the use and confidentiality of distributed data. Increasing security concerns have led to a surge in w...
Wei Jiang, Chris Clifton
SDM
2007
SIAM
106views Data Mining» more  SDM 2007»
13 years 5 months ago
Approximating Representations for Large Numerical Databases
The paper introduces a notion of support for realvalued functions. It is shown how to approximate supports of a large class of functions based on supports of so called polynomial ...
Szymon Jaroszewicz, Marcin Korzen
SDM
2007
SIAM
133views Data Mining» more  SDM 2007»
13 years 5 months ago
Change-Point Detection using Krylov Subspace Learning
We propose an efficient algorithm for principal component analysis (PCA) that is applicable when only the inner product with a given vector is needed. We show that Krylov subspace...
Tsuyoshi Idé, Koji Tsuda
SDM
2007
SIAM
98views Data Mining» more  SDM 2007»
13 years 5 months ago
Lattice based Clustering of Temporal Gene-Expression Matrices
Individuals show different cell classes when they are in the different stages of a disease, have different disease subtypes, or have different response to a treatment or envir...
Yang Huang, Martin Farach-Colton
SDM
2007
SIAM
103views Data Mining» more  SDM 2007»
13 years 5 months ago
A System for Keyword Search on Textual Streams
An increasing amount of data is produced in the form of text streams − these can be RSS news feeds, TV closed captions, emails, etc. We study the problem of answering keyword qu...
Vagelis Hristidis, Oscar Valdivia, Michail Vlachos...
SDM
2007
SIAM
204views Data Mining» more  SDM 2007»
13 years 5 months ago
Flexible Anonymization For Privacy Preserving Data Publishing: A Systematic Search Based Approach
k-anonymity is a popular measure of privacy for data publishing: It measures the risk of identity-disclosure of individuals whose personal information are released in the form of ...
Bijit Hore, Ravi Chandra Jammalamadaka, Sharad Meh...
SDM
2007
SIAM
177views Data Mining» more  SDM 2007»
13 years 5 months ago
Bursty Feature Representation for Clustering Text Streams
Text representation plays a crucial role in classical text mining, where the primary focus was on static text. Nevertheless, well-studied static text representations including TFI...
Qi He, Kuiyu Chang, Ee-Peng Lim, Jun Zhang
SDM
2007
SIAM
152views Data Mining» more  SDM 2007»
13 years 5 months ago
HP2PC: Scalable Hierarchically-Distributed Peer-to-Peer Clustering
In distributed data mining models, adopting a flat node distribution model can affect scalability. To address the problem of modularity, flexibility and scalability, we propose...
Khaled M. Hammouda, Mohamed S. Kamel
SDM
2007
SIAM
104views Data Mining» more  SDM 2007»
13 years 5 months ago
Boosting Optimal Logical Patterns Using Noisy Data
We consider the supervised learning of a binary classifier from noisy observations. We use smooth boosting to linearly combine abstaining hypotheses, each of which maps a subcube...
Noam Goldberg, Chung-chieh Shan