Sciweavers

TSD
2009
Springer

Combining Text Vector Representations for Information Retrieval

13 years 9 months ago
Combining Text Vector Representations for Information Retrieval
Abstract. This paper suggests a novel representation for documents that is intended to improve precision. This representation is generated by combining two central techniques: Random Indexing; and Holographic Reduced Representations (HRRs). Random indexing uses co-occurrence information among words to generate semantic context vectors that are the sum of randomly generated term identity vectors. HRRs are used to encode textual structure which can directly capture relations between words (e.g., compound terms, subject-verb, and verb-object). By using the random vectors to capture semantic information, and then employing HRRs to capture structural relations extracted from the text, document vectors are generated by summing all such representations in a document. In this paper, we show that these representations can be successfully used in information retrieval, can effectively incorporate relations, and can reduce the dimensionality of the traditional vector space model (VSM). The result...
Maya Carrillo, Chris Eliasmith, Aurelio Lóp
Added 27 Jul 2010
Updated 27 Jul 2010
Type Conference
Year 2009
Where TSD
Authors Maya Carrillo, Chris Eliasmith, Aurelio López-López
Comments (0)