Sciweavers

SIGIR
2008
ACM

Term clouds as surrogates for user generated speech

13 years 4 months ago
Term clouds as surrogates for user generated speech
User generated spoken audio remains a challenge for Automatic Speech Recognition (ASR) technology and content-based audio surrogates derived from ASR-transcripts must be error robust. An investigation of the use of term clouds as surrogates for podcasts demonstrates that ASR term clouds closely approximate term clouds derived from human-generated transcripts across a range of cloud sizes. A user study confirms the conclusion that ASR-clouds are viable surrogates for depicting the content of podcasts. Categories and Subject Descriptors H.3 [Information Storage and Retrieval]: H.3.1 Content Analysis and Indexing; H.3.3 Information Search and Retrieval General Terms Algorithms, Measurement, Performance, Experimentation
Manos Tsagkias, Martha Larson, Maarten de Rijke
Added 15 Dec 2010
Updated 15 Dec 2010
Type Journal
Year 2008
Where SIGIR
Authors Manos Tsagkias, Martha Larson, Maarten de Rijke
Comments (0)