Sciweavers

ACL
2012

Chinese Comma Disambiguation for Discourse Analysis

11 years 6 months ago
Chinese Comma Disambiguation for Discourse Analysis
The Chinese comma signals the boundary of discourse units and also anchors discourse relations between adjacent text spans. In this work, we propose a discourse structureoriented classification of the comma that can be automatically extracted from the Chinese Treebank based on syntactic patterns. We then experimented with two supervised learning methods that automatically disambiguate the Chinese comma based on this classification. The first method integrates comma classification into parsing, and the second method adopts a “post-processing” approach that extracts features from automatic parses to train a classifier. The experimental results show that the second approach compares favorably against the first approach.
Yaqin Yang, Nianwen Xue
Added 29 Sep 2012
Updated 29 Sep 2012
Type Journal
Year 2012
Where ACL
Authors Yaqin Yang, Nianwen Xue
Comments (0)