Sciweavers

8 search results - page 1 / 2
» The Czech Broadcast Conversation Corpus
Sort
View
TSD
2009
Springer
13 years 11 months ago
The Czech Broadcast Conversation Corpus
Abstract. This paper presents the final version of the Czech Broadcast Conversation Corpus released at the Linguistic Data Consortium (LDC). The corpus contains 72 recordings of a...
Jáchym Kolár, Jan Svec
LREC
2008
97views Education» more  LREC 2008»
13 years 6 months ago
Structural Metadata Annotation of Speech Corpora: Comparing Broadcast News and Broadcast Conversations
Structural metadata extraction (MDE) research aims to develop techniques for automatic conversion of raw speech recognition output to forms that are more useful to humans and to d...
Jáchym Kolár, Jan Svec
ICASSP
2009
IEEE
13 years 11 months ago
Genre effects on automatic sentence segmentation of speech: A comparison of broadcast news and broadcast conversations
We investigate genre effects on the task of automatic sentence segmentation, focusing on two important domains – broadcast news (BN) and broadcast conversation (BC). We employ a...
Jáchym Kolár, Yang Liu, Elizabeth Sh...
LREC
2010
186views Education» more  LREC 2010»
13 years 6 months ago
The EPAC Corpus: Manual and Automatic Annotations of Conversational Speech in French Broadcast News
This paper presents the EPAC corpus which is composed by a set of 100 hours of conversational speech manually transcribed and by the outputs of automatic tools (automatic segmenta...
Yannick Estève, Thierry Bazillon, Jean-Yves...
ICASSP
2011
IEEE
12 years 8 months ago
Robust speaker turn role labeling of TV Broadcast News shows
Speaker role recognition in TV Broadcast News shows is addressed in this paper with a particular focus on speaker turn role labeling. A mixed approach combining speaker clustering...
Géraldine Damnati, Delphine Charlet