Sciweavers

1085 search results - page 123 / 217
» Active Mining in a Distributed Setting
Sort
View
KDD
2010
ACM
197views Data Mining» more  KDD 2010»
14 years 10 months ago
Semi-supervised feature selection for graph classification
The problem of graph classification has attracted great interest in the last decade. Current research on graph classification assumes the existence of large amounts of labeled tra...
Xiangnan Kong, Philip S. Yu
127
Voted
WSDM
2010
ACM
315views Data Mining» more  WSDM 2010»
15 years 10 months ago
SBotMiner: Large Scale Search Bot Detection
In this paper, we study search bot traffic from search engine query logs at a large scale. Although bots that generate search traffic aggressively can be easily detected, a large ...
Fang Yu, Yinglian Xie, Qifa Ke
98
Voted
IPPS
1999
IEEE
15 years 4 months ago
High-Performance Knowledge Extraction from Data on PC-Based Networks of Workstations
The automatic construction of classi ers programs able to correctly classify data collected from the real world is one of the major problems in pattern recognition and in a wide ar...
Cosimo Anglano, Attilio Giordana, Giuseppe Lo Bell...
KDD
2006
ACM
120views Data Mining» more  KDD 2006»
16 years 29 days ago
Hierarchical topic segmentation of websites
In this paper, we consider the problem of identifying and segmenting topically cohesive regions in the URL tree of a large website. Each page of the website is assumed to have a t...
Ravi Kumar, Kunal Punera, Andrew Tomkins
KDD
2005
ACM
166views Data Mining» more  KDD 2005»
16 years 29 days ago
A general model for clustering binary data
Clustering is the problem of identifying the distribution of patterns and intrinsic correlations in large data sets by partitioning the data points into similarity classes. This p...
Tao Li