Sciweavers

ACL
2009
13 years 2 months ago
Using Generation for Grammar Analysis and Error Detection
We demonstrate that the bidirectionality of deep grammars, allowing them to generate as well as parse sentences, can be used to automatically and effectively identify errors in th...
Michael Goodman, Francis Bond
ACL
2009
13 years 2 months ago
Capturing Errors in Written Chinese Words
A collection of 3208 reported errors of Chinese words were analyzed. Among which, 7.2% involved rarely used character, and 98.4% were assigned common classifications of their caus...
Chao-Lin Liu, Kan-Wen Tien, Min-Hua Lai, Yi-Hsuan ...
ACL
2009
13 years 2 months ago
Automatic Satire Detection: Are You Having a Laugh?
We introduce the novel task of determining whether a newswire article is "true" or satirical. We experiment with SVMs, feature scaling, and a number of lexical and seman...
Clint Burfoot, Timothy Baldwin
ACL
2009
13 years 2 months ago
Optimizing Word Alignment Combination For Phrase Table Training
Combining word alignments trained in two translation directions has mostly relied on heuristics that are not directly motivated by intended applications. We propose a novel method...
Yonggang Deng, Bowen Zhou
ACL
2009
13 years 2 months ago
Syntax is from Mars while Semantics from Venus! Insights from Spectral Analysis of Distributional Similarity Networks
We study the global topology of the syntactic and semantic distributional similarity networks for English through the technique of spectral analysis. We observe that while the syn...
Chris Biemann, Monojit Choudhury, Animesh Mukherje...
ACL
2009
13 years 2 months ago
Validating the web-based evaluation of NLG systems
The GIVE Challenge is a recent shared task in which NLG systems are evaluated over the Internet. In this paper, we validate this novel NLG evaluation methodology by comparing the ...
Alexander Koller, Kristina Striegnitz, Donna Byron...
ACL
2009
13 years 2 months ago
A Succinct N-gram Language Model
Efficient processing of tera-scale text data is an important research topic. This paper proposes lossless compression of Ngram language models based on LOUDS, a succinct data stru...
Taro Watanabe, Hajime Tsukada, Hideki Isozaki
ACL
2009
13 years 2 months ago
Homophones and Tonal Patterns in English-Chinese Transliteration
The abundance of homophones in Chinese significantly increases the number of similarly acceptable candidates in English-to-Chinese transliteration (E2C). The dialectal factor also...
Oi Yee Kwong
ACL
2009
13 years 2 months ago
Bridging Morpho-Syntactic Gap between Source and Target Sentences for English-Korean Statistical Machine Translation
Often, Statistical Machine Translation (SMT) between English and Korean suffers from null alignment. Previous studies have attempted to resolve this problem by removing unnecessar...
Gum-Won Hong, Seung-Wook Lee, Hae-Chang Rim