Sciweavers

1768 search results - page 211 / 354
» Mining Very Large Databases
Sort
View
139
Voted
KDD
1999
ACM
166views Data Mining» more  KDD 1999»
15 years 8 months ago
CACTUS - Clustering Categorical Data Using Summaries
Clustering is an important data mining problem. Most of the earlier work on clustering focussed on numeric attributes which have a natural ordering on their attribute values. Rece...
Venkatesh Ganti, Johannes Gehrke, Raghu Ramakrishn...
121
Voted
KDD
2008
ACM
164views Data Mining» more  KDD 2008»
16 years 4 months ago
Microscopic evolution of social networks
We present a detailed study of network evolution by analyzing four large online social networks with full temporal information about node and edge arrivals. For the first time at ...
Jure Leskovec, Lars Backstrom, Ravi Kumar, Andrew ...
155
Voted
KDD
2007
ACM
152views Data Mining» more  KDD 2007»
16 years 4 months ago
Relational data pre-processing techniques for improved securities fraud detection
Commercial datasets are often large, relational, and dynamic. They contain many records of people, places, things, events and their interactions over time. Such datasets are rarel...
Andrew Fast, Lisa Friedland, Marc Maier, Brian Tay...
151
Voted
SGAI
2009
Springer
15 years 10 months ago
Parallel Rule Induction with Information Theoretic Pre-Pruning
In a world where data is captured on a large scale the major challenge for data mining algorithms is to be able to scale up to large datasets. There are two main approaches to indu...
Frederic T. Stahl, Max Bramer, Mo Adda
143
Voted
NCA
2006
IEEE
15 years 9 months ago
Parallelization of Geant4 Using TOP-C and Marshalgen
Geant4 is a very large, highly accurate toolkit for Monte Carlo simulation of particle-matter interaction. It has been applied to high-energy physics, cosmic ray modeling, radiati...
Gene Cooperman, Viet Ha Nguyen, Igor Malioutov