Sciweavers

376 search results - page 47 / 76
» Analysis-by-synthesis features for speech recognition
Sort
View
CSL
2007
Springer
14 years 9 months ago
Soft indexing of speech content for search in spoken documents
The paper presents the Position Specific Posterior Lattice (PSPL), a novel lossy representation of automatic speech recognition lattices that naturally lends itself to efficient ...
Ciprian Chelba, Jorge Silva, Alex Acero
ICASSP
2009
IEEE
15 years 4 months ago
On the phonetic information in ultrasonic microphone signals
We study the phonetic information in the signal from an ultrasonic “microphone”, a device that emits an ultrasonic wave toward a speaker and receives the reflected, Doppler-s...
Karen Livescu, Bo Zhu, James R. Glass
ICASSP
2010
IEEE
14 years 10 months ago
Transcription-based video genre classification
In this paper, we present a new method for video genre identification based on the linguistic content analysis. This approach relies on the analysis of the most frequent words in...
Stanislas Oger, Mickael Rouvier, Georges Linares
SAC
2010
ACM
14 years 10 months ago
Visual processing-inspired fern-audio features for noise-robust speaker verification
In this paper, we consider the problem of speaker verification as a two-class object detection problem in computer vision, where the object instances are 1-D short-time spectral v...
Anindya Roy, Sébastien Marcel
ICMCS
2007
IEEE
147views Multimedia» more  ICMCS 2007»
15 years 4 months ago
Alignment of Speech to Highly Imperfect Text Transcriptions
We introduce a novel and inexpensive approach for the temporal alignment of speech to highly imperfect transcripts from automatic speech recognition (ASR). Transcripts are generat...
Alexander Haubold, John R. Kender