Sciweavers

AAAI
2007

Improving Similarity Measures for Short Segments of Text

13 years 7 months ago
Improving Similarity Measures for Short Segments of Text
In this paper we improve previous work on measuring the similarity of short segments of text in two ways. First, we introduce a Web-relevance similarity measure and demonstrate its effectiveness. This measure extends the Web-kernel similarity function introduced by Sahami and Heilman (2006) by using relevance weighted inner-product of term occurrences rather than TF×IDF. Second, we show that one can further improve the accuracy of similarity measures by using a machine learning approach. Our methods outperform other state-of-the-art methods in a general query suggestion task for multiple evaluation metrics.
Wen-tau Yih, Christopher Meek
Added 02 Oct 2010
Updated 02 Oct 2010
Type Conference
Year 2007
Where AAAI
Authors Wen-tau Yih, Christopher Meek
Comments (0)