Sciweavers

63 search results - page 2 / 13
» Large Linguistically-Processed Web Corpora for Multiple Lang...
Sort
View
ECIR
2006
Springer
13 years 6 months ago
Automatic Acquisition of Chinese-English Parallel Corpus from the Web
Parallel corpora are a valuable resource for tasks such as cross-language information retrieval and data-driven natural language processing systems. Previously only small scale cor...
Ying Zhang, Ke Wu, Jianfeng Gao, Phil Vines
SIGIR
2006
ACM
13 years 11 months ago
Improving the estimation of relevance models using large external corpora
Information retrieval algorithms leverage various collection statistics to improve performance. Because these statistics are often computed on a relatively small evaluation corpus...
Fernando Diaz, Donald Metzler
EMNLP
2009
13 years 2 months ago
A Rich Feature Vector for Protein-Protein Interaction Extraction from Multiple Corpora
Because of the importance of proteinprotein interaction (PPI) extraction from text, many corpora have been proposed with slightly differing definitions of proteins and PPI. Since ...
Makoto Miwa, Rune Sætre, Yusuke Miyao, Jun-i...
SIGIR
2004
ACM
13 years 10 months ago
Translating unknown queries with web corpora for cross-language information retrieval
It is crucial for cross-language information retrieval (CLIR) systems to deal with the translation of unknown queries1 due to that real queries might be short. The purpose of this...
Pu-Jen Cheng, Jei-Wen Teng, Ruey-Cheng Chen, Jenq-...
WSDM
2012
ACM
236views Data Mining» more  WSDM 2012»
12 years 20 days ago
Effective query formulation with multiple information sources
Most standard information retrieval models use a single source of information (e.g., the retrieval corpus) for query formulation tasks such as term and phrase weighting and query ...
Michael Bendersky, Donald Metzler, W. Bruce Croft