Sciweavers

IJCNLP
2004
Springer

Detecting Sentence Boundaries in Japanese Speech Transcriptions Using a Morphological Analyzer

13 years 10 months ago
Detecting Sentence Boundaries in Japanese Speech Transcriptions Using a Morphological Analyzer
We present a method to automatically detect sentence boundaries(SBs) in Japanese speech transcriptions. Our method uses a Japanese morphological analyzer that is based on a cost calculation and selects as the best result the one with the minimum cost. The idea behind using a morphological analyzer to identify candidates for SBs is that the analyzer outputs lower costs for better sequences of morphemes. After the candidate SBs have been identified, the unsuitable candidates are deleted by using lexical information acquired from the training corpus. Our method had a 77.24% precision, 88.00% recall, and 0.8277 F-Measure, for a corpus consisting of lecture speech transcriptions in which the SBs are not given.
Sachie Tajima, Hidetsugu Nanba, Manabu Okumura
Added 02 Jul 2010
Updated 02 Jul 2010
Type Conference
Year 2004
Where IJCNLP
Authors Sachie Tajima, Hidetsugu Nanba, Manabu Okumura
Comments (0)