Sciweavers

TSD
2007
Springer

Information Retrieval Test Collection for Searching Spontaneous Czech Speech

13 years 10 months ago
Information Retrieval Test Collection for Searching Spontaneous Czech Speech
Abstract. This paper describes the design of the first large-scale IR test collection built for the Czech language. The creation of this collection also happens to be very challenging, as it is based on a continuous text stream from automatic transcription of spontaneous speech and thus lacks clearly defined document boundaries. All aspects of the collection building are presented, together with some general findings of initial experiments.
Pavel Ircing, Pavel Pecina, Douglas W. Oard, Jianq
Added 09 Jun 2010
Updated 09 Jun 2010
Type Conference
Year 2007
Where TSD
Authors Pavel Ircing, Pavel Pecina, Douglas W. Oard, Jianqiang Wang, Ryen W. White, Jan Hoidekr
Comments (0)