Sciweavers

88 search results - page 1 / 18
» Process Model for Composing High-quality Text Corpora
Sort
View
LREC
2008
70views Education» more  LREC 2008»
13 years 6 months ago
Process Model for Composing High-quality Text Corpora
The Teko corpus composing model offers a decentralized, dynamic way of collecting high-quality text corpora for linguistic research. The resulting corpus consists of independent t...
Mikko Lounela
EMNLP
2011
12 years 5 months ago
Learning Sentential Paraphrases from Bilingual Parallel Corpora for Text-to-Text Generation
Previous work has shown that high quality phrasal paraphrases can be extracted from bilingual parallel corpora. However, it is not clear whether bitexts are an appropriate resourc...
Juri Ganitkevitch, Chris Callison-Burch, Courtney ...
ECIR
2006
Springer
13 years 6 months ago
Automatic Acquisition of Chinese-English Parallel Corpus from the Web
Parallel corpora are a valuable resource for tasks such as cross-language information retrieval and data-driven natural language processing systems. Previously only small scale cor...
Ying Zhang, Ke Wu, Jianfeng Gao, Phil Vines
FSMNLP
2008
Springer
13 years 6 months ago
Finite State Models for the Generation of Large Corpora of Natural Language Texts
Domenico Cantone, Salvatore Cristofaro, Simone Far...
LREC
2010
217views Education» more  LREC 2010»
13 years 6 months ago
Building a Web Corpus of Czech
Large corpora are essential to modern methods of computational linguistics and natural language processing. In this paper, we describe an ongoing project whose aim is to build a l...
Drahomíra "johanka" Spoustová, Miros...