Sciweavers

DEXA
2006
Springer

Understanding and Enhancing the Folding-In Method in Latent Semantic Indexing

13 years 7 months ago
Understanding and Enhancing the Folding-In Method in Latent Semantic Indexing
Abstract. Latent Semantic Indexing(LSI) has been proved to be effective to capture the semantic structure of document collections. It is widely used in content-based text retrieval. However, in many real-world applications dealing with very large document collections, LSI suffers from its high computational complexity, which comes from the process of Singular Value Decomposition(SVD). As a result, in practice, the foldingin method is widely used as an approximation to the LSI method. However, in practice, the folding-in method is generally implemented "as is" and detailed analysis on its effectiveness and performance is left out. Consequentially, the performance of the folding-in method cannot be guaranteed. In this paper, we firstly illustrated the underlying principle of the folding-in method from a linear algebra point of view and analyzed some existing commonly used techniques. Based on the theoretical analysis, we proposed a novel algorithm to guide the implementation of...
Xiang Wang 0002, Xiaoming Jin
Added 22 Aug 2010
Updated 22 Aug 2010
Type Conference
Year 2006
Where DEXA
Authors Xiang Wang 0002, Xiaoming Jin
Comments (0)