Sciweavers

ECIR
2009
Springer
14 years 1 months ago
Correlation of Term Count and Document Frequency for Google N-Grams
For bounded datasets such as the TREC Web Track (WT10g) the computation of term frequency (TF) and inverse document frequency (IDF) is not difficult. However, when the corpus is th...
Martin Klein, Michael L. Nelson
ECIR
2009
Springer
14 years 1 months ago
A Probabilistic Retrieval Model for Semistructured Data
Abstract. Retrieving semistructured (XML) data typically requires either a structured query such as XPath, or a keyword query that does not take structure into account. In this pap...
Jinyoung Kim, Xiaobing Xue, W. Bruce Croft
ECIR
2009
Springer
14 years 1 months ago
Information Extraction and Linking in a Retrieval Context
Marie-Francine Moens, Djoerd Hiemstra
ECIR
2009
Springer
14 years 1 months ago
Joint Ranking for Multilingual Web Search
Ranking for multilingual information retrieval (MLIR) is a task to rank documents of different languages solely based on their relevancy to the query regardless of query’s langu...
Wei Gao, Cheng Niu, Ming Zhou, Kam-Fai Wong
ECIR
2009
Springer
14 years 1 months ago
Design and Evaluation of a University-Wide Expert Search Engine
We present an account of designing and evaluating a university-wide expert search engine. We performed system-based evaluation to determine the optimal retrieval settings and an ex...
Ruud Liebregts, Toine Bogers
ECIR
2009
Springer
14 years 1 months ago
On Automatic Plagiarism Detection Based on n-Grams Comparison
Abstract. When automatic plagiarism detection is carried out considering a reference corpus, a suspicious text is compared to a set of original documents in order to relate the pla...
Alberto Barrón-Cedeño, Paolo Rosso
ECIR
2009
Springer
14 years 1 months ago
A Framework of Evaluation for Question-Answering Systems
Evaluating complex system is a complex task. Evaluation campaigns are organized each year to test different systems on global results, but they do not evaluate the relevance of th...
Sarra El Ayari, Brigitte Grau
ECIR
2009
Springer
14 years 1 months ago
Investigating Learning Approaches for Blog Post Opinion Retrieval
Blog post opinion retrieval is the problem of identifying posts which express an opinion about a particular topic. Usually the problem is solved using a 3 step process in which rel...
Shima Gerani, Mark James Carman, Fabio Crestani
ECIR
2009
Springer
14 years 1 months ago
Classifying and Characterizing Query Intent
Understanding the intent underlying user queries may help personalize search results and improve user satisfaction. In this paper, we develop a methodology for using ad clickthroug...
Azin Ashkan, Charles L. A. Clarke, Eugene Agichtei...
ECIR
2009
Springer
14 years 1 months ago
A Comparative Study of Utilizing Topic Models for Information Retrieval
We explore the utility of different types of topic models for retrieval purposes. Based on prior work, we describe several ways that topic models can be integrated into the retrie...
Xing Yi, James Allan