KALAKA: A TV Broadcast Speech Database for the Evaluation of Language Recognition Systems

15 years 2 months ago

Download www.lrec-conf.org

A speech database, named KALAKA, was created to support the Albayzin 2008 Evaluation of Language Recognition Systems, organized by the Spanish Network on Speech Technologies from May to November 2008. This evaluation, designed according to the criteria and methodology applied in the NIST Language Recognition Evaluations, involved four target languages: Basque, Catalan, Galician and Spanish (official languages in Spain), and included speech signals in other (unknown) languages to allow open-set verification trials. In this paper, the process of designing, collecting data and building the train, development and evaluation datasets of KALAKA is described. Results attained in the Albayzin 2008 LRE are presented as a means of evaluating the database. The performance of a state-of-the-art language recognition system on a closed-set evaluation task is also presented for reference. Future work includes extending KALAKA by adding Portuguese and English as target languages and renewing the set ...

Luis Javier Rodríguez-Fuentes, Mikel Pe&nti

Real-time Traffic

Albayzin 2008 Evaluation | Education | Language Recognition | LREC 2010 | Target Languages |

claim paper

» The Vera am Mittag German audiovisual emotional speech database

» Thai Broadcast News Corpus Construction and Evaluation

» Language and variety verification on broadcast news for Portuguese

» Advances in the CMUInteract Arabic GALE Transcription System

» A robust scene recognition system for baseball broadcast using datadriven approach

» A Study of the Influence of Speech Type on Automatic Language Recognition Performance

» Onthefly lattice rescoring for realtime automatic speech recognition

» SUBPAL A Device for Reading Aloud Subtitles from Television and Cinema

Post Info
More Details (n/a)

Added	29 Oct 2010
Updated	29 Oct 2010
Type	Conference
Year	2010
Where	LREC
Authors	Luis Javier Rodríguez-Fuentes, Mikel Peñagarikano, Germán Bordel, Amparo Varona, Mireia Díez

Comments (0)

Sciweavers

KALAKA: A TV Broadcast Speech Database for the Evaluation of Language Recognition Systems

Albayzin 2008 Evaluation | Education | Language Recognition | LREC 2010 | Target Languages |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers