We show how the recognition performance of a speech recognition component in a speech retrieval system affects the retrieval effectiveness. A speech retrieval system facilitates c...
We describe a novel technique for multi-sensory speech processing for enhancing noisy speech and for improved noiserobust speech recognition. Both air- and bone-conductive microph...
Amarnag Subramanya, Li Deng, Zicheng Liu, Zhengyou...
The standard method for making the full content of audio and video material searchable and is to annotate it with humangenerated meta-data that describes the content in a way that...
This paper proposes a method to automatically extract highlight scenes from sports (baseball) live video in real time and to allow users to retrieve them. For this purpose, sophis...
This paper proposes a novel framework to index and retrieve audio content from broadcast database that contains both speech and music. In this framework, we model the acoustic eve...