Sciweavers

943 search results - page 61 / 189
» Statistical Language Models for Information Retrieval
Sort
View
CICLING
2008
Springer
14 years 11 months ago
A Semantics-Enhanced Language Model for Unsupervised Word Sense Disambiguation
An N-gram language model aims at capturing statistical word order dependency information from corpora. Although the concept of language models has been applied extensively to handl...
Shou-de Lin, Karin Verspoor
ECIR
2006
Springer
14 years 11 months ago
An Efficient Computation of the Multiple-Bernoulli Language Model
Abstract. The Multiple Bernoulli (MB) Language Model has been generally considered too computationally expensive for practical purposes and superseded by the more efficient multino...
Leif Azzopardi, David E. Losada
ICML
1998
IEEE
15 years 10 months ago
Learning a Language-Independent Representation for Terms from a Partially Aligned Corpus
Cross-language latent semantic indexing is a method that learns useful languageindependent vector representations of terms through a statistical analysis of a documentaligned text...
Michael L. Littman, Fan Jiang, Greg A. Keim
SIGMOD
1998
ACM
115views Database» more  SIGMOD 1998»
15 years 2 months ago
Providing Database-like Access to the Web Using Queries Based on Textual Similarity
Most databases contain “name constants” like course numbers, personal names, and place names that correspond to entities in the real world. Previous work in integration of het...
William W. Cohen
KES
2005
Springer
15 years 3 months ago
An OCR Post-processing Approach Based on Multi-knowledge
This paper proposes an OCR post-processing approach based on multi-knowledge, which integrates language knowledge and candidate distance information given by the OCR engine. In thi...
Li Zhuang, Xiaoyan Zhu