Sciweavers

41 search results - page 8 / 9
» Predicting Automatic Speech Recognition Performance Using Pr...
Sort
View
MIR
2004
ACM
176views Multimedia» more  MIR 2004»
13 years 11 months ago
Analysing the performance of visual, concept and text features in content-based video retrieval
This paper describes revised content-based search experiments in the context of TRECVID 2003 benchmark. Experiments focus on measuring content-based video retrieval performance wi...
Mika Rautiainen, Timo Ojala, Tapio Seppänen
ICASSP
2009
IEEE
14 years 8 days ago
Language model transformation applied to lightly supervised training of acoustic model for congress meetings
For effective training of acoustic and language models for spontaneous speech such as meetings, it is significant to exploit the texts available in a large scale, which may not b...
Tatsuya Kawahara, Masato Mimura, Yuka Akita
NIPS
2003
13 years 6 months ago
A Classification-based Cocktail-party Processor
At a cocktail party, a listener can selectively attend to a single voice and filter out other acoustical interferences. How to simulate this perceptual ability remains a great cha...
Nicoleta Roman, DeLiang L. Wang, Guy J. Brown
IEICET
2008
136views more  IEICET 2008»
13 years 5 months ago
Bilingual Cluster Based Models for Statistical Machine Translation
We propose a domain specific model for statistical machine translation. It is wellknown that domain specific language models perform well in automatic speech recognition. We show ...
Hirofumi Yamamoto, Eiichiro Sumita
CLEAR
2007
Springer
271views Biometrics» more  CLEAR 2007»
13 years 11 months ago
The AIT Multimodal Person Identification System for CLEAR 2007
This paper presents the person identification system developed at Athens Information Technology and its performance in the CLEAR 2007 evaluations. The system operates on the audiov...
Andreas Stergiou, Aristodemos Pnevmatikakis, Lazar...