Impact of automatic sentence segmentation on meeting summarization

9 years 1 months ago
Impact of automatic sentence segmentation on meeting summarization
This paper investigates the impact of automatic sentence segmentation on speech summarization using the ICSI meeting corpus. We use a hidden Markov model (HMM) for sentence segmentation that integrates the N-gram language model and pause information, and a maximum marginal relevance (MMR) based extractive summarization method. The system-generated summaries are compared to multiple human summaries using the ROUGE scores. The decision thresholds from the segmentation system are varied to examine the impact of different segments on summarization. We find that (1) using system generated utterance segments degrades summarization performance compared to using human annotated sentences; (2) segmentation needs to be optimized for summarization instead of the segmentation task itself, however, the patterns are slightly different from prior work for other tasks such as parsing; and (3) there are effects from different summarization evaluation metrics as well as speech recognition errors.
Yang Liu, Shasha Xie
Added 30 May 2010
Updated 30 May 2010
Type Conference
Year 2008
Authors Yang Liu, Shasha Xie
Comments (0)