Sciweavers

CEAS
2005
Springer

Implicit Queries for Email

13 years 10 months ago
Implicit Queries for Email
Implicit query systems examine a document and automatically conduct searches for the most relevant information. In this paper, we offer three contributions to implicit query research. First, we show how to use query logs from a search engine: by constraining results to commonly issued queries, we can get dramatic improvements. Second, we describe a method for optimizing parameters for an implicit query system, by using logistic regression training. The method is designed to estimate the probability that any particular suggested query is a good one. Third, we show which features beyond standard TF-IDF features are most helpful in our logistic regression model: query frequency information, capitalization information, subject line information, and message length information. Using the optimization method and the additional features, we are able to produce a system with up to 6 times better results on top-1 score than a simple TF-IDF system.
Joshua Goodman, Vitor R. Carvalho
Added 26 Jun 2010
Updated 26 Jun 2010
Type Conference
Year 2005
Where CEAS
Authors Joshua Goodman, Vitor R. Carvalho
Comments (0)