SpeakerLDA: Discovering Topics in Transcribed Multi-Speaker Audio Contents

9 years 12 months ago

Download marksanderson.org

Topic models such as Latent Dirichlet Allocation (LDA) [3] have been extensively used for characterizing text collections according to the topics discussed in documents. Organizing documents according to topic can be applied to different information access tasks such as document clustering, content-based recommendation or summarization. Spoken documents such as podcasts typically involve more than one speaker (e.g., meetings, interviews, chat shows or news with reporters). This paper presents a work-inprogress based on a variation of LDA that includes in the model the different speakers participating in conversational audio transcripts. Intuitively, each speaker has her own background knowledge which generates different topic and word distributions. We believe that informing a topic model with speaker segmentation (e.g., using existing speaker diarization techniques) may enhance discovery of topics in multi-speaker audio content. Categories and Subject Descriptors H.5.1 [Multimedia In...

Damiano Spina, Johanne R. Trippas, Lawrence Cavedo

Real-time Traffic

MM 2015 | Multimedia |

claim paper

Post Info
More Details (n/a)

Added	14 Apr 2016
Updated	14 Apr 2016
Type	Journal
Year	2015
Where	MM
Authors	Damiano Spina, Johanne R. Trippas, Lawrence Cavedon, Mark Sanderson

Comments (0)

Sciweavers

SpeakerLDA: Discovering Topics in Transcribed Multi-Speaker Audio Contents

MM 2015 | Multimedia |

Explore & Download

Productivity Tools

Sciweavers