Sciweavers

127 search results - page 25 / 26
» Automatic Recognition and Evaluation of Tracheoesophageal Sp...
Sort
View
CIVR
2005
Springer
160views Image Analysis» more  CIVR 2005»
13 years 11 months ago
Story Segmentation in News Videos Using Visual and Text Cues
Abstract. In this paper, we present a framework for segmenting the news programs into different story topics. The proposed method utilizes both visual and text information of the v...
Yun Zhai, Alper Yilmaz, Mubarak Shah
ICMI
2009
Springer
129views Biometrics» more  ICMI 2009»
13 years 10 months ago
Cache-based language model adaptation using visual attention for ASR in meeting scenarios
In a typical group meeting involving discussion and collaboration, people look at one another, at shared information resources such as presentation material, and also at nothing i...
Neil Cooke, Martin J. Russell
CIVR
2008
Springer
166views Image Analysis» more  CIVR 2008»
13 years 7 months ago
A probabilistic ranking framework using unobservable binary events for video search
Recent content-based video retrieval systems combine output of concept detectors (also known as high-level features) with text obtained through automatic speech recognition. This ...
Robin Aly, Djoerd Hiemstra, Arjen P. de Vries, Fra...
CIKM
2010
Springer
13 years 4 months ago
Generating advertising keywords from video content
With the proliferation of online distribution methods for videos, content owners require easier and more effective methods for monetization through advertising. Matching advertis...
Michael J. Welch, Junghoo Cho, Walter Chang
ICASSP
2011
IEEE
12 years 9 months ago
Cross-language bootstrapping based on completely unsupervised training using multilingual A-stabil
This paper presents our work on rapid language adaptation of acoustic models based on multilingual cross-language bootstrapping and unsupervised training. We used Automatic Speech...
Ngoc Thang Vu, Franziska Kraus, Tanja Schultz