Sciweavers

7 search results - page 1 / 2
» RUNDKAST: an Annotated Norwegian Broadcast News Speech Corpu...
Sort
View
LREC
2008
99views Education» more  LREC 2008»
13 years 6 months ago
RUNDKAST: an Annotated Norwegian Broadcast News Speech Corpus
This paper describes the Norwegian broadcast news speech corpus RUNDKAST. The corpus contains recordings of approximately 77 hours of broadcast news shows from the Norwegian broad...
Ingunn Amdal, Ole Morten Strand, Jørn Almbe...
LREC
2010
186views Education» more  LREC 2010»
13 years 6 months ago
The EPAC Corpus: Manual and Automatic Annotations of Conversational Speech in French Broadcast News
This paper presents the EPAC corpus which is composed by a set of 100 hours of conversational speech manually transcribed and by the outputs of automatic tools (automatic segmenta...
Yannick Estève, Thierry Bazillon, Jean-Yves...
CORR
2000
Springer
79views Education» more  CORR 2000»
13 years 4 months ago
Many uses, many annotations for large speech corpora: Switchboard and TDT as case studies
This paper discusses the challenges that arise when large speech corpora receive an ever-broadening range of diverse and distinct annotations. Two case studies of this process are...
David Graff, Steven Bird
LREC
2010
173views Education» more  LREC 2010»
13 years 6 months ago
From Speech to Trees: Applying Treebank Annotation to Arabic Broadcast News
The Arabic Treebank (ATB) Project at the Linguistic Data Consortium (LDC) has embarked on a large corpus of Broadcast News (BN) transcriptions, and this has led to a number of new...
Mohamed Maamouri, Ann Bies, Seth Kulick, Wajdi Zag...
LREC
2010
193views Education» more  LREC 2010»
13 years 6 months ago
DiSCo - A German Evaluation Corpus for Challenging Problems in the Broadcast Domain
Typical broadcast material contains not only studio-recorded texts read by trained speakers, but also spontaneous and dialect speech, debates with cross-talk, voice-overs, and on-...
Doris Baum, Daniel Schneider, Rolf Bardeli, Jochen...