Sciweavers

EMNLP
2010

Automatic Comma Insertion for Japanese Text Generation

13 years 2 months ago
Automatic Comma Insertion for Japanese Text Generation
This paper proposes a method for automatically inserting commas into Japanese texts. In Japanese sentences, commas play an important role in explicitly separating the constituents, such as words and phrases, of a sentence. The method can be used as an elemental technology for natural language generation such as speech recognition and machine translation, or in writing-support tools for non-native speakers. We categorized the usages of commas and investigated the appearance tendency of each category. In this method, the positions where commas should be inserted are decided based on a machine learning approach. We conducted a comma insertion experiment using a text corpus and confirmed the effectiveness of our method.
Masaki Murata, Tomohiro Ohno, Shigeki Matsubara
Added 11 Feb 2011
Updated 11 Feb 2011
Type Journal
Year 2010
Where EMNLP
Authors Masaki Murata, Tomohiro Ohno, Shigeki Matsubara
Comments (0)