Sciweavers

28821 search results - page 5640 / 5765
» Distributed and parallel systems
Sort
View
137
Voted
KDD
2007
ACM
178views Data Mining» more  KDD 2007»
16 years 3 months ago
Practical learning from one-sided feedback
In many data mining applications, online labeling feedback is only available for examples which were predicted to belong to the positive class. Such applications include spam filt...
D. Sculley
147
Voted
KDD
2007
ACM
153views Data Mining» more  KDD 2007»
16 years 3 months ago
Exploiting duality in summarization with deterministic guarantees
Summarization is an important task in data mining. A major challenge over the past years has been the efficient construction of fixed-space synopses that provide a deterministic q...
Panagiotis Karras, Dimitris Sacharidis, Nikos Mamo...
150
Voted
KDD
2007
ACM
193views Data Mining» more  KDD 2007»
16 years 3 months ago
Joint optimization of wrapper generation and template detection
Many websites have large collections of pages generated dynamically from an underlying structured source like a database. The data of a category are typically encoded into similar...
Shuyi Zheng, Ruihua Song, Ji-Rong Wen, Di Wu
153
Voted
KDD
2007
ACM
182views Data Mining» more  KDD 2007»
16 years 3 months ago
A fast algorithm for finding frequent episodes in event streams
Frequent episode discovery is a popular framework for mining data available as a long sequence of events. An episode is essentially a short ordered sequence of event types and the...
Srivatsan Laxman, P. S. Sastry, K. P. Unnikrishnan
KDD
2006
ACM
130views Data Mining» more  KDD 2006»
16 years 3 months ago
Efficient anonymity-preserving data collection
The output of a data mining algorithm is only as good as its inputs, and individuals are often unwilling to provide accurate data about sensitive topics such as medical history an...
Justin Brickell, Vitaly Shmatikov
« Prev « First page 5640 / 5765 Last » Next »