Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

126

DEXAW
2009
IEEE

131views Database» more DEXAW 2009»

Clustering of Short Strings in Large Databases

15 years 11 months ago

Clustering of Short Strings in Large Databases

Download www.uni-weimar.de

—A novel method CLOSS intended for textual databases is proposed. It successfully identiﬁes misspelled string clusters, even if the cluster border is not prominent. The method uses q-gram approach to represent data and a string proximity graph to ﬁnd the cluster. Contribution refers to short string clustering in text mining, when the proximity graph has multiple horizontal lines or the line is not present.

Michail Kazimianec, Arturas Mazeika

Real-time Traffic

Cluster Border | Database | DEXAW 2009 | Short String Clustering | String Proximity Graph |

claim paper

Related Content

» Selectivity Estimation for Fuzzy String Predicates in Large Data Sets

» Approximate String Matching in DNA Sequences

» Approximate String Joins

» The edtree An Index for Large DNA Sequence Databases

» A LowCost Parallel KMeans VQ Algorithm Using Cluster Computing

» KMeans VQ algorithm using a lowcost parallel cluster computing

» Clustering wavelets to speedup data dissemination in structured P2P MANETs

» Fuzzy Decomposition of Spatially Extended Objects

» CorrelationBased Web Document Clustering for Adaptive Web Interface Design

Post Info
More Details (n/a)

Added	20 May 2010
Updated	20 May 2010
Type	Conference
Year	2009
Where	DEXAW
Authors	Michail Kazimianec, Arturas Mazeika

Comments (0)