Sciweavers

735 search results - page 16 / 147
» Corpora and data preparation
Sort
View
ACL
2012
13 years 10 days ago
A Statistical Model for Unsupervised and Semi-supervised Transliteration Mining
We propose a novel model to automatically extract transliteration pairs from parallel corpora. Our model is efficient, language pair independent and mines transliteration pairs i...
Hassan Sajjad, Alexander Fraser, Helmut Schmid
EMNLP
2008
14 years 11 months ago
Mining and Modeling Relations between Formal and Informal Chinese Phrases from Web Corpora
We present a novel method for discovering and modeling the relationship between informal Chinese expressions (including colloquialisms and instant-messaging slang) and their forma...
Zhifei Li, David Yarowsky
COLING
1996
14 years 11 months ago
The Automatic Extraction of Open Compounds from Text Corpora
This paper describes a new method for extracting open compounds (uninterrupted sequences of words) from text corpora of languages, such as Thai, Japanese and Korea that exhibit un...
Virach Sornlertlamvanich, Hozumi Tanaka
BMCBI
2006
131views more  BMCBI 2006»
14 years 10 months ago
Statistical modeling of biomedical corpora: mining the Caenorhabditis Genetic Center Bibliography for genes related to life span
Background: The statistical modeling of biomedical corpora could yield integrated, coarse-to-fine views of biological phenomena that complement discoveries made from analysis of m...
David M. Blei, K. Franks, Michael I. Jordan, I. Sa...
MIE
2008
112views Healthcare» more  MIE 2008»
14 years 11 months ago
Mining Knowledge from Corpora: an Application to Retrieval and Indexing
The present work aims at discovering new associations between medical concepts to be exploited as input in retrieval and indexing. Material and Methods: Association rules method is...
Lina Fatima Soualmia, Badisse Dahamna, Stéf...