Search Sciweavers | Sciweavers

141

INEX
2005
Springer

107views Information Technology» more INEX 2005»

Parameter Estimation for a Simple Hierarchical Generative Model for XML Retrieval

15 years 10 months ago

Abstract. This paper explores the possibility of using a modiﬁed Expectation-Maximization algorithm to estimate parameters for a simple hierarchical generative model for XML retr...

Paul Ogilvie, Jamie Callan

claim paper

Read More »

138

click to vote

CIKM
2003
Springer

98views Information Technology» more CIKM 2003»

Multi-resolution disambiguation of term occurrences

15 years 10 months ago

Download einat.webir.org

We describe a system for extracting mentions of terms such as company and product names, in a large and noisy corpus of documents, such as the World Wide Web. Since natural langua...

Einat Amitay, Rani Nelken, Wayne Niblack, Ron Siva...

claim paper

Read More »

161

Voted

ECIR
2008
Springer

117views Information Technology» more ECIR 2008»

Using Coherence-Based Measures to Predict Query Difficulty

15 years 6 months ago

Download staff.science.uva.nl

Abstract. We investigate the potential of coherence-based scores to predict query difficulty. The coherence of a document set associated with each query word is used to capture the...

Jiyin He, Martha Larson, Maarten de Rijke

claim paper

Read More »

122

click to vote

LREC
2008

132views Education» more LREC 2008»

Babylon Parallel Text Builder: Gathering Parallel Texts for Low-Density Languages

15 years 6 months ago

Download www.lrec-conf.org

This paper describes BABYLON, a system that attempts to overcome the shortage of parallel texts in low-density languages by supplementing existing parallel texts with texts gather...

Michael Mohler, Rada Mihalcea

claim paper

Read More »

145

click to vote

EACL
2006
ACL Anthology

103views Natural Language Processing» more EACL 2006»

Classifying Biological Full-Text Articles for Multi-Database Curation

15 years 6 months ago

Download acl.ldc.upenn.edu

In this paper, we propose an approach for identifying curatable articles from a large document set. This system considers three parts of an article (title ract, MeSH terms, and ca...

Wen-Juan Hou, Chih Lee, Hsin-Hsi Chen

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers