Sciweavers

IIS
2004

Lingo: Search Results Clustering Algorithm Based on Singular Value Decomposition

13 years 5 months ago
Lingo: Search Results Clustering Algorithm Based on Singular Value Decomposition
Search results clustering problem is defined as an automatic, on-line grouping of similar documents in a search results list returned from a search engine. In this paper we present Lingo--a novel algorithm for clustering search results, which emphasizes cluster description quality. We describe methods used in the algorithm: algebraic transformations of the term-document matrix and frequent phrase extraction using suffix arrays. Finally, we discuss results acquired from an empirical evaluation of the algorithm. Knowledge is of two kinds: we know a subject ourselves, or we know where we can find information about it. -- Samuel Johnson, 1775
Stanislaw Osinski, Jerzy Stefanowski, Dawid Weiss
Added 31 Oct 2010
Updated 31 Oct 2010
Type Conference
Year 2004
Where IIS
Authors Stanislaw Osinski, Jerzy Stefanowski, Dawid Weiss
Comments (0)