Sciweavers

1085 search results - page 124 / 217
» Active Mining in a Distributed Setting
Sort
View
117
Voted
WSDM
2009
ACM
163views Data Mining» more  WSDM 2009»
15 years 7 months ago
Tagging with Queries: How and Why?
Web search queries capture the information need of search engine users. Search engines store these queries in their logs and analyze them to guide their search results. In this wo...
Ioannis Antonellis, Hector Garcia-Molina, Jawed Ka...
88
Voted
ICDM
2007
IEEE
129views Data Mining» more  ICDM 2007»
15 years 6 months ago
A Generalization of Proximity Functions for K-Means
K-means is a widely used partitional clustering method. A large amount of effort has been made on finding better proximity (distance) functions for K-means. However, the common c...
Junjie Wu, Hui Xiong, Jian Chen, Wenjun Zhou
PAKDD
2009
ACM
225views Data Mining» more  PAKDD 2009»
15 years 5 months ago
Accurate Synthetic Generation of Realistic Personal Information
A large proportion of the massive amounts of data that are being collected by many organisations today is about people, and often contains identifying information like names, addre...
Peter Christen, Agus Pudjijono
106
Voted
KDD
2010
ACM
250views Data Mining» more  KDD 2010»
15 years 4 months ago
Modeling relational events via latent classes
Many social networks can be characterized by a sequence of dyadic interactions between individuals. Techniques for analyzing such events are of increasing interest. In this paper,...
Christopher DuBois, Padhraic Smyth
GECCO
2006
Springer
180views Optimization» more  GECCO 2006»
15 years 4 months ago
Improving cooperative GP ensemble with clustering and pruning for pattern classification
A boosting algorithm based on cellular genetic programming to build an ensemble of predictors is proposed. The method evolves a population of trees for a fixed number of rounds an...
Gianluigi Folino, Clara Pizzuti, Giandomenico Spez...