Sciweavers

DATESO
2004

LSI vs. Wordnet Ontology in Dimension Reduction for Information Retrieval

13 years 5 months ago
LSI vs. Wordnet Ontology in Dimension Reduction for Information Retrieval
Abstract. In the area of information retrieval, the dimension of document vectors plays an important role. Firstly, with higher dimensions index structures suffer the "curse of dimensionality" and their efficiency rapidly decreases. Secondly, we may not use exact words when looking for a document, thus we miss some relevant documents. LSI (Latent Semantic Indexing) is a numerical method, which discovers latent semantic in documents by creating concepts from existing terms. However, it is hard to compute LSI. In this article, we offer a replacement of LSI with a projection matrix created from WordNet hierarchy and compare it with LSI.
Pavel Moravec, Michal Kolovrat, Václav Sn&a
Added 30 Oct 2010
Updated 30 Oct 2010
Type Conference
Year 2004
Where DATESO
Authors Pavel Moravec, Michal Kolovrat, Václav Snásel
Comments (0)