Sciweavers

2277 search results - page 4 / 456
» Clustering by pattern similarity in large data sets
Sort
View
DATAMINE
2006
166views more  DATAMINE 2006»
13 years 6 months ago
Accelerated EM-based clustering of large data sets
Motivated by the poor performance (linear complexity) of the EM algorithm in clustering large data sets, and inspired by the successful accelerated versions of related algorithms l...
Jakob J. Verbeek, Jan Nunnink, Nikos A. Vlassis
VLDB
2005
ACM
118views Database» more  VLDB 2005»
13 years 11 months ago
Selectivity Estimation for Fuzzy String Predicates in Large Data Sets
Many database applications have the emerging need to support fuzzy queries that ask for strings that are similar to a given string, such as “name similar to smith” and “tele...
Liang Jin, Chen Li
BPM
2009
Springer
161views Business» more  BPM 2009»
14 years 26 days ago
Trace Clustering Based on Conserved Patterns: Towards Achieving Better Process Models
Process mining refers to the extraction of process models from event logs. Real-life processes tend to be less structured and more flexible. Traditional process mining algorithms ...
R. P. Jagadeesh Chandra Bose, Wil M. P. van der Aa...
BMCBI
2006
163views more  BMCBI 2006»
13 years 6 months ago
Ensemble attribute profile clustering: discovering and characterizing groups of genes with similar patterns of biological featur
Background: Ensemble attribute profile clustering is a novel, text-based strategy for analyzing a userdefined list of genes and/or proteins. The strategy exploits annotation data ...
J. R. Semeiks, A. Rizki, Mina J. Bissell, I. Saira...
BMCBI
2010
121views more  BMCBI 2010»
13 years 3 months ago
A grammar-based distance metric enables fast and accurate clustering of large sets of 16S sequences
Background: We propose a sequence clustering algorithm and compare the partition quality and execution time of the proposed algorithm with those of a popular existing algorithm. T...
David J. Russell, Samuel F. Way, Andrew K. Benson,...