Sciweavers

947 search results - page 146 / 190
» Evaluation of Sampling for Data Mining of Association Rules
Sort
View
75
Voted
ICDM
2008
IEEE
113views Data Mining» more  ICDM 2008»
15 years 7 months ago
Online Reliability Estimates for Individual Predictions in Data Streams
Several predictive systems are nowadays vital for operations and decision support. The quality of these systems is most of the time defined by their average accuracy which has lo...
Pedro Pereira Rodrigues, João Gama, Zoran B...
95
Voted
SDM
2003
SIAM
134views Data Mining» more  SDM 2003»
15 years 1 months ago
Hierarchical Document Clustering using Frequent Itemsets
A major challenge in document clustering is the extremely high dimensionality. For example, the vocabulary for a document set can easily be thousands of words. On the other hand, ...
Benjamin C. M. Fung, Ke Wang, Martin Ester
79
Voted
DASFAA
2007
IEEE
163views Database» more  DASFAA 2007»
15 years 6 months ago
Protecting Individual Information Against Inference Attacks in Data Publishing
In many data-publishing applications, the data owner needs to protect sensitive information pertaining to individuals. Meanwhile, certain information is required to be published. T...
Chen Li, Houtan Shirani-Mehr, Xiaochun Yang
118
Voted
DEXA
2006
Springer
129views Database» more  DEXA 2006»
15 years 4 months ago
Selectively Storing XML Data in Relations
This paper presents a new framework for users to select relevant data from an XML document and store it in an existing relational database, as opposed to previous approaches that s...
Wenfei Fan, Lisha Ma
KDD
2008
ACM
165views Data Mining» more  KDD 2008»
16 years 28 days ago
Colibri: fast mining of large static and dynamic graphs
Low-rank approximations of the adjacency matrix of a graph are essential in finding patterns (such as communities) and detecting anomalies. Additionally, it is desirable to track ...
Hanghang Tong, Spiros Papadimitriou, Jimeng Sun, P...