Sciweavers

151 search results - page 9 / 31
» A Chinese Corpus for Linguistic Research
Sort
View
COLING
2002
14 years 11 months ago
An Agent-based Approach to Chinese Named Entity Recognition
Chinese NE (Named Entity) recognition is a difficult problem because of the uncertainty in word segmentation and flexibility in language structure. This paper proposes the use of ...
Shiren Ye, Tat-Seng Chua, Jimin Liu
ACL
2010
14 years 9 months ago
Learning Lexicalized Reordering Models from Reordering Graphs
Lexicalized reordering models play a crucial role in phrase-based translation systems. They are usually learned from the word-aligned bilingual corpus by examining the reordering ...
Jinsong Su, Yang Liu, Yajuan Lü, Haitao Mi, Q...
LREC
2008
105views Education» more  LREC 2008»
15 years 1 months ago
Linguistic Resources for Reconstructing Spontaneous Speech Text
The output of a speech recognition system is not always ideal for subsequent downstream processing, in part because speakers themselves often make mistakes. A system would accompl...
Erin Fitzgerald, Frederick Jelinek
IRI
2008
IEEE
15 years 6 months ago
Curate a transliteration corpus from transliteration/translation pairs
Transliteration of new named entity is important for information retrieval that crosses two or multiple language. Rule-based machine transliteration is not satisfactory, since dif...
Shih-Hung Wu, Yu-Te Li
LREC
2010
143views Education» more  LREC 2010»
15 years 1 months ago
Towards a Large Parallel Corpus of Cleft Constructions
We present our efforts to create a large-scale, semi-automatically annotated parallel corpus of cleft constructions. The corpus is intended to reduce or make more effective the ma...
Gerlof Bouma, Lilja Øvrelid, Jonas Kuhn