Sciweavers

366 search results - page 44 / 74
» Canonical state models for automatic speech recognition
Sort
View
80
Voted
TASLP
2010
106views more  TASLP 2010»
14 years 4 months ago
Efficient and Robust Music Identification With Weighted Finite-State Transducers
We present an approach to music identification based on weighted finite-state transducers and Gaussian mixture models, inspired by techniques used in large-vocabulary speech recogn...
Mehryar Mohri, Pedro Moreno, Eugene Weinstein
72
Voted
LREC
2010
155views Education» more  LREC 2010»
14 years 11 months ago
WTIMIT: The TIMIT Speech Corpus Transmitted Over The 3G AMR Wideband Mobile Network
Due to upcoming mobile telephony services with higher speech quality, a wideband (50 Hz to 7 kHz) mobile telephony derivative of TIMIT has been recorded called WTIMIT. It allows a...
Patrick Bauer, David Scheler, Tim Fingscheidt
116
Voted
ICASSP
2008
IEEE
15 years 4 months ago
Impact of automatic sentence segmentation on meeting summarization
This paper investigates the impact of automatic sentence segmentation on speech summarization using the ICSI meeting corpus. We use a hidden Markov model (HMM) for sentence segmen...
Yang Liu, Shasha Xie
142
Voted
ICASSP
2011
IEEE
14 years 1 months ago
Deep Belief Networks using discriminative features for phone recognition
Deep Belief Networks (DBNs) are multi-layer generative models. They can be trained to model windows of coefficients extracted from speech and they discover multiple layers of fea...
Abdel-rahman Mohamed, Tara N. Sainath, George Dahl...
CSL
2011
Springer
14 years 4 months ago
Detecting emotional state of a child in a conversational computer game
The automatic recognition of user’s communicative style within a spoken dialog system framework, including the affective aspects, has received increased attention in the past f...
Serdar Yildirim, Shrikanth Narayanan, Alexandros P...