Sciweavers

43 search results - page 1 / 9
» Creating a Persian-English Comparable Corpus
Sort
View
CLEF
2010
Springer
13 years 5 months ago
Creating a Persian-English Comparable Corpus
Multilingual corpora are valuable resources for cross-language information retrieval and are available in many language pairs. However the Persian language does not have rich multi...
Homa Baradaran Hashemi, Azadeh Shakery, Heshaam Fe...
IJDAR
2011
143views more  IJDAR 2011»
12 years 11 months ago
Grammar-based techniques for creating ground-truthed sketch corpora
Although publicly-available, ground-truthed corpora have proven useful for training, evaluating, and comparing recognition systems in many domains, the availability of such corpor...
Scott MacLean, George Labahn, Edward Lank, Mirette...
ACL
2010
13 years 2 months ago
Creating Robust Supervised Classifiers via Web-Scale N-Gram Data
In this paper, we systematically assess the value of using web-scale N-gram data in state-of-the-art supervised NLP classifiers. We compare classifiers that include or exclude fea...
Shane Bergsma, Emily Pitler, Dekang Lin
ECTEL
2007
Springer
13 years 11 months ago
Categorizing Learning Objects Based On Wikipedia as Substitute Corpus
As metadata is often not sufficiently provided by authors of Learning Resources, automatic metadata generation methods are used to create metadata afterwards. One kind of metadata ...
Marek Meyer, Christoph Rensing, Ralf Steinmetz
LREC
2010
169views Education» more  LREC 2010»
13 years 6 months ago
Using Comparable Corpora to Adapt a Translation Model to Domains
Statistical machine translation (SMT) requires a large parallel corpus, which is available only for restricted language pairs and domains. To expand the language pairs and domains...
Hiroyuki Kaji, Takashi Tsunakawa, Daisuke Okada