Semi-supervised extractive speech summarization via co-training algorithm

14 years 11 months ago

Download ssli.ee.washington.edu

Supervised methods for extractive speech summarization require a large training set. Summary annotation is often expensive and time consuming. In this paper, we exploit semisupervised approaches to leverage unlabeled data. In particular, we investigate co-training algorithm for the task of extractive meeting summarization. Compared with text summarization, speech summarization task has its unique characteristic in that the features naturally split into two sets: textual features and prosodic/acoustic features. Such characteristic makes co-training an appropriate approach for semi-supervised speech summarization. Our experiments on ICSI meeting corpus show that by utilizing the unlabeled data, co-training algorithm significantly improves summarization performance when only a small amount of labeled data is available.

Shasha Xie, Hui Lin, Yang Liu

Real-time Traffic

Extractive Meeting Summarization | Extractive Speech Summarization | INTERSPEECH 2010 | Signal Processing | Speech Summarization |

claim paper

Post Info
More Details (n/a)

Added	18 May 2011
Updated	18 May 2011
Type	Journal
Year	2010
Where	INTERSPEECH
Authors	Shasha Xie, Hui Lin, Yang Liu

Comments (0)

Sciweavers

Semi-supervised extractive speech summarization via co-training algorithm

Extractive Meeting Summarization | Extractive Speech Summarization | INTERSPEECH 2010 | Signal Processing | Speech Summarization |

Explore & Download

Productivity Tools

Sciweavers