Sciweavers

ACL
2008
13 years 6 months ago
Measure Word Generation for English-Chinese SMT Systems
Measure words in Chinese are used to indicate the count of nouns. Conventional statistical machine translation (SMT) systems do not perform well on measure word generation due to ...
Dongdong Zhang, Mu Li, Nan Duan, Chi-Ho Li, Ming Z...
ACL
2008
13 years 6 months ago
Joint Processing and Discriminative Training for Letter-to-Phoneme Conversion
We present a discriminative structureprediction model for the letter-to-phoneme task, a crucial step in text-to-speech processing. Our method encompasses three tasks that have bee...
Sittichai Jiampojamarn, Colin Cherry, Grzegorz Kon...
ACL
2008
13 years 6 months ago
Inducing Gazetteers for Named Entity Recognition by Large-Scale Clustering of Dependency Relations
We propose using large-scale clustering of dependency relations between verbs and multiword nouns (MNs) to construct a gazetteer for named entity recognition (NER). Since dependen...
Jun'ichi Kazama, Kentaro Torisawa
ACL
2008
13 years 6 months ago
Phrase Table Training for Precision and Recall: What Makes a Good Phrase and a Good Phrase Pair?
In this work, the problem of extracting phrase translation is formulated as an information retrieval process implemented with a log-linear model aiming for a balanced precision an...
Yonggang Deng, Jia Xu, Yuqing Gao
ACL
2008
13 years 6 months ago
Semi-Supervised Sequential Labeling and Segmentation Using Giga-Word Scale Unlabeled Data
This paper provides evidence that the use of more unlabeled data in semi-supervised learning can improve the performance of Natural Language Processing (NLP) tasks, such as part-o...
Jun Suzuki, Hideki Isozaki
ACL
2007
13 years 6 months ago
Construction of Domain Dictionary for Fundamental Vocabulary
For natural language understanding, it is essential to reveal semantic relations between words. To date, only the IS-A relation has been publicly available. Toward deeper natural ...
Chikara Hashimoto, Sadao Kurohashi
ACL
2008
13 years 6 months ago
Learning Bilingual Lexicons from Monolingual Corpora
We present a method for learning bilingual translation lexicons from monolingual corpora. Word types in each language are characterized by purely monolingual features, such as con...
Aria Haghighi, Percy Liang, Taylor Berg-Kirkpatric...
ACL
2008
13 years 6 months ago
A Critical Reassessment of Evaluation Baselines for Speech Summarization
We assess the current state of the art in speech summarization, by comparing a typical summarizer on two different domains: lecture data and the SWITCHBOARD corpus. Our results ca...
Gerald Penn, Xiaodan Zhu
ACL
2008
13 years 6 months ago
Cohesive Phrase-Based Decoding for Statistical Machine Translation
Phrase-based decoding produces state-of-theart translations with no regard for syntax. We add syntax to this process with a cohesion constraint based on a dependency tree for the ...
Colin Cherry