Sciweavers

319 search results - page 29 / 64
» Algorithms for Mining Distance-Based Outliers in Large Datas...
Sort
View
PAKDD
2011
ACM
419views Data Mining» more  PAKDD 2011»
14 years 24 days ago
DISC: Data-Intensive Similarity Measure for Categorical Data
Abstract. The concept of similarity is fundamentally important in almost every scientific field. Clustering, distance-based outlier detection, classification, regression and sea...
Aditya Desai, Himanshu Singh, Vikram Pudi
ICDM
2005
IEEE
122views Data Mining» more  ICDM 2005»
15 years 3 months ago
Finding Representative Set from Massive Data
In the information age, data is pervasive. In some applications, data explosion is a significant phenomenon. The massive data volume poses challenges to both human users and comp...
Feng Pan, Wei Wang 0010, Anthony K. H. Tung, Jiong...
DASFAA
2010
IEEE
176views Database» more  DASFAA 2010»
15 years 4 months ago
Mining Diversity on Networks
Abstract. Despite the recent emergence of many large-scale networks in different application domains, an important measure that captures a participant’s diversity in the network ...
Lu Liu, Feida Zhu, Chen Chen, Xifeng Yan, Jiawei H...
ICDM
2006
IEEE
138views Data Mining» more  ICDM 2006»
15 years 4 months ago
Adaptive Blocking: Learning to Scale Up Record Linkage
Many information integration tasks require computing similarity between pairs of objects. Pairwise similarity computations are particularly important in record linkage systems, as...
Mikhail Bilenko, Beena Kamath, Raymond J. Mooney
MLG
2007
Springer
15 years 4 months ago
Weighted Substructure Mining for Image Analysis
1 In web-related applications of image categorization, it is desirable to derive an interpretable classification rule with high accuracy. Using the bag-of-words representation and...
Sebastian Nowozin, Koji Tsuda, Takeaki Uno, Taku K...