Sciweavers

151 search results - page 9 / 31
» A Chinese Corpus for Linguistic Research
Sort
View
COLING
2002
14 years 9 months ago
An Agent-based Approach to Chinese Named Entity Recognition
Chinese NE (Named Entity) recognition is a difficult problem because of the uncertainty in word segmentation and flexibility in language structure. This paper proposes the use of ...
Shiren Ye, Tat-Seng Chua, Jimin Liu
ACL
2010
14 years 6 months ago
Learning Lexicalized Reordering Models from Reordering Graphs
Lexicalized reordering models play a crucial role in phrase-based translation systems. They are usually learned from the word-aligned bilingual corpus by examining the reordering ...
Jinsong Su, Yang Liu, Yajuan Lü, Haitao Mi, Q...
LREC
2008
105views Education» more  LREC 2008»
14 years 11 months ago
Linguistic Resources for Reconstructing Spontaneous Speech Text
The output of a speech recognition system is not always ideal for subsequent downstream processing, in part because speakers themselves often make mistakes. A system would accompl...
Erin Fitzgerald, Frederick Jelinek
IRI
2008
IEEE
15 years 3 months ago
Curate a transliteration corpus from transliteration/translation pairs
Transliteration of new named entity is important for information retrieval that crosses two or multiple language. Rule-based machine transliteration is not satisfactory, since dif...
Shih-Hung Wu, Yu-Te Li
LREC
2010
143views Education» more  LREC 2010»
14 years 11 months ago
Towards a Large Parallel Corpus of Cleft Constructions
We present our efforts to create a large-scale, semi-automatically annotated parallel corpus of cleft constructions. The corpus is intended to reduce or make more effective the ma...
Gerlof Bouma, Lilja Øvrelid, Jonas Kuhn