Sciweavers

1061 search results - page 104 / 213
» Massive Data Pre-Processing with a Cluster Based Approach
Sort
View
BMCBI
2005
142views more  BMCBI 2005»
15 years 4 months ago
CLU: A new algorithm for EST clustering
Background: The continuous flow of EST data remains one of the richest sources for discoveries in modern biology. The first step in EST data mining is usually associated with EST ...
Andrey A. Ptitsyn, Winston Hide
ICML
2007
IEEE
16 years 5 months ago
Graph clustering with network structure indices
Graph clustering has become ubiquitous in the study of relational data sets. We examine two simple algorithms: a new graphical adaptation of the k-medoids algorithm and the Girvan...
Matthew J. Rattigan, Marc Maier, David Jensen
ACL
2007
15 years 5 months ago
Sparse Information Extraction: Unsupervised Language Models to the Rescue
Even in a massive corpus such as the Web, a substantial fraction of extractions appear infrequently. This paper shows how to assess the correctness of sparse extractions by utiliz...
Doug Downey, Stefan Schoenmackers, Oren Etzioni
ICASSP
2010
IEEE
15 years 4 months ago
Acceleration of sequence kernel computation for real-time speaker identification
The sequence kernel has been shown to be a promising kernel function for learning from sequential data such as speech and DNA. However, it is not scalable to massive datasets due ...
Makoto Yamada, Masashi Sugiyama, Gordon Wichern, T...
SDM
2008
SIAM
129views Data Mining» more  SDM 2008»
15 years 5 months ago
Statistical Density Prediction in Traffic Networks
Recently, modern tracking methods started to allow capturing the position of massive numbers of moving objects. Given this information, it is possible to analyze and predict the t...
Hans-Peter Kriegel, Matthias Renz, Matthias Schube...