Sciweavers

138 search results - page 2 / 28
» Data Cleaning for Word Alignment
Sort
View
KDD
2005
ACM
125views Data Mining» more  KDD 2005»
14 years 5 months ago
Email data cleaning
Addressed in this paper is the issue of `email data cleaning' for text mining. Many text mining applications need take emails as input. Email data is usually noisy and thus i...
Jie Tang, Hang Li, Yunbo Cao, ZhaoHui Tang
ACL
2006
13 years 6 months ago
Boosting Statistical Word Alignment Using Labeled and Unlabeled Data
This paper proposes a semi-supervised boosting approach to improve statistical word alignment with limited labeled data and large amounts of unlabeled data. The proposed approach ...
Hua Wu, Haifeng Wang, Zhan-yi Liu
ACL
2004
13 years 6 months ago
Statistical Machine Translation with Word- and Sentence-Aligned Parallel Corpora
The parameters of statistical translation models are typically estimated from sentence-aligned parallel corpora. We show that significant improvements in the alignment and transla...
Chris Callison-Burch, David Talbot, Miles Osborne
LREC
2008
90views Education» more  LREC 2008»
13 years 6 months ago
Word Alignment Annotation in a Japanese-Chinese Parallel Corpus
Parallel corpora are critical resources for machine translation research and development since parallel corpora contain translation equivalences of various granularities. Manual a...
Yujie Zhang, Zhulong Wang, Kiyotaka Uchimoto, Qing...
EACL
2003
ACL Anthology
13 years 6 months ago
Combining Clues for Word Alignment
In this paper, a word alignment approach is presented which is based on a combination of clues. Word alignment clues indicate associations between words and phrases. They can be b...
Jörg Tiedemann