Search Sciweavers | Sciweavers

15

CLEF
2010
Springer

257views Information Technology» more CLEF 2010»

Creating a Persian-English Comparable Corpus

13 years 5 months ago

Multilingual corpora are valuable resources for cross-language information retrieval and are available in many language pairs. However the Persian language does not have rich multi...

Homa Baradaran Hashemi, Azadeh Shakery, Heshaam Fe...

claim paper

Read More »

9

click to vote

IJDAR
2011

143views more IJDAR 2011»

Grammar-based techniques for creating ground-truthed sketch corpora

12 years 11 months ago

Download www.cs.uwaterloo.ca

Although publicly-available, ground-truthed corpora have proven useful for training, evaluating, and comparing recognition systems in many domains, the availability of such corpor...

Scott MacLean, George Labahn, Edward Lank, Mirette...

claim paper

Read More »

10

click to vote

ACL
2010

141views Computational Linguistics» more ACL 2010»

Creating Robust Supervised Classifiers via Web-Scale N-Gram Data

13 years 2 months ago

Download webdocs.cs.ualberta.ca

In this paper, we systematically assess the value of using web-scale N-gram data in state-of-the-art supervised NLP classifiers. We compare classifiers that include or exclude fea...

Shane Bergsma, Emily Pitler, Dekang Lin

claim paper

Read More »

14

click to vote

ECTEL
2007
Springer

168views Machine Learning» more ECTEL 2007»

Categorizing Learning Objects Based On Wikipedia as Substitute Corpus

13 years 11 months ago

Download fire.eun.org

As metadata is often not suﬃciently provided by authors of Learning Resources, automatic metadata generation methods are used to create metadata afterwards. One kind of metadata ...

Marek Meyer, Christoph Rensing, Ralf Steinmetz

claim paper

Read More »

12

click to vote

LREC
2010

169views Education» more LREC 2010»

Using Comparable Corpora to Adapt a Translation Model to Domains

13 years 6 months ago

Download www.lrec-conf.org

Statistical machine translation (SMT) requires a large parallel corpus, which is available only for restricted language pairs and domains. To expand the language pairs and domains...

Hiroyuki Kaji, Takashi Tsunakawa, Daisuke Okada

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers