Sciweavers

555 search results - page 79 / 111
» An Empirical Study on Web Mining of Parallel Data
Sort
View
SDM
2008
SIAM
177views Data Mining» more  SDM 2008»
14 years 11 months ago
Cluster Ensemble Selection
This paper studies the ensemble selection problem for unsupervised learning. Given a large library of different clustering solutions, our goal is to select a subset of solutions t...
Xiaoli Z. Fern, Wei Lin
SDM
2007
SIAM
126views Data Mining» more  SDM 2007»
14 years 11 months ago
Scalable Name Disambiguation using Multi-level Graph Partition
When non-unique values are used as the identifier of entities, due to their homonym, confusion can occur. In particular, when (part of) “names” of entities are used as their ...
Byung-Won On, Dongwon Lee
SDM
2007
SIAM
130views Data Mining» more  SDM 2007»
14 years 11 months ago
Maximizing the Area under the ROC Curve with Decision Lists and Rule Sets
Decision lists (or ordered rule sets) have two attractive properties compared to unordered rule sets: they require a simpler classification procedure and they allow for a more co...
Henrik Boström
ICDM
2010
IEEE
167views Data Mining» more  ICDM 2010»
14 years 7 months ago
Averaged Stochastic Gradient Descent with Feedback: An Accurate, Robust, and Fast Training Method
On large datasets, the popular training approach has been stochastic gradient descent (SGD). This paper proposes a modification of SGD, called averaged SGD with feedback (ASF), tha...
Xu Sun, Hisashi Kashima, Takuya Matsuzaki, Naonori...
GIS
2007
ACM
15 years 10 months ago
Environmental scenario search and visualization
We have developed Environmental Scenario Search Engine (ESSE) for parallel data mining of a set of conditions inside distributed, very large databases from multiple environmental ...
Mikhail N. Zhizhin, Eric A. Kihn, Vassily Lyutsare...