Search Sciweavers | Sciweavers

14 search results - page 1 / 3

» Feature-Based Method for Document Alignment in Comparable Ne...

click to vote

EACL
2009
ACL Anthology

95views Natural Language Processing» more EACL 2009»

Feature-Based Method for Document Alignment in Comparable News Corpora

14 years 5 months ago

Download www.aclweb.org

Thuy Vu, AiTi Aw, Min Zhang

claim paper

Read More »

click to vote

EACL
2009
ACL Anthology

109views Natural Language Processing» more EACL 2009»

MINT: A Method for Effective and Scalable Mining of Named Entity Transliterations from Large Comparable Corpora

13 years 2 months ago

Download research.microsoft.com

In this paper, we address the problem of mining transliterations of Named Entities (NEs) from large comparable corpora. We leverage the empirical fact that multilingual news artic...

Raghavendra Udupa, K. Saravanan, A. Kumaran, Jagad...

claim paper

Read More »

click to vote

NAACL
2010

182views Computational Linguistics» more NAACL 2010»

Extracting Parallel Sentences from Comparable Corpora using Document Level Alignment

13 years 2 months ago

Download research.microsoft.com

The quality of a statistical machine translation (SMT) system is heavily dependent upon the amount of parallel sentences used in training. In recent years, there have been several...

Jason R. Smith, Chris Quirk, Kristina Toutanova

claim paper

Read More »

click to vote

COLING
2010

191views Computational Linguistics» more COLING 2010»

Mining Large-scale Comparable Corpora from Chinese-English News Collections

12 years 11 months ago

Download www.aclweb.org

In this paper, we explore a CLIR-based approach to construct large-scale Chinese-English comparable corpora, which is valuable for translation knowledge mining. The initial source...

Degen Huang, Lian Zhao, Lishuang Li, Haitao Yu

claim paper

Read More »

click to vote

CLEF
2010
Springer

257views Information Technology» more CLEF 2010»

Creating a Persian-English Comparable Corpus

13 years 4 months ago

Download khorshid.ut.ac.ir

Multilingual corpora are valuable resources for cross-language information retrieval and are available in many language pairs. However the Persian language does not have rich multi...

Homa Baradaran Hashemi, Azadeh Shakery, Heshaam Fe...

claim paper

Read More »

« Prev « First page 1 / 3 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers