Sciweavers

ICASSP
2008
IEEE
13 years 11 months ago
Effective error prediction using decision tree for ASR grammar network in call system
CALL (Computer Assisted Language Learning) systems using ASR (Automatic Speech Recognition) for second language learning have received increasing interest recently. However, it st...
Hongcui Wang, Tatsuya Kawahara
ICASSP
2008
IEEE
13 years 11 months ago
Adaptation of compressed HMM parameters for resource-constrained speech recognition
Recently, we successfully developed and reported a new unsupervised online adaptation technique, which jointly compensates for additive and convolutive distortions with vector Tay...
Jinyu Li, Li Deng, Dong Yu, Jian Wu, Yifan Gong, A...
MHCI
2009
Springer
13 years 11 months ago
Contextual push-to-talk: a new technique for reducing voice dialog duration
We present a technique in which physical controls have both normal and voice-enabled activation styles. In the case of the latter, knowledge of which physical control was activate...
Garrett Weinberg
ICMI
2009
Springer
95views Biometrics» more  ICMI 2009»
13 years 11 months ago
Multimodal inference for driver-vehicle interaction
In this paper we present a novel system for driver-vehicle interaction which combines speech recognition with facialexpression recognition to increase intention recognition accura...
Tevfik Metin Sezgin, Ian Davies, Peter Robinson
ICCPOL
2009
Springer
13 years 11 months ago
Dialogue Strategies to Overcome Speech Recognition Errors in Form-Filling Dialogue
Abstract. In a spoken dialogue system, the speech recognition performance accounts for the largest part of the overall system performance. Yet spontaneous speech recognition has an...
Sangwoo Kang, Songwook Lee, Jungyun Seo
IEEEIAS
2009
IEEE
13 years 11 months ago
Privacy Protection for Speech Information
—Ubiquitous network society will be achieved soon. In the society, all electronic equipments including “sensors” are connected to the network and communicate each other to sh...
Kazumasa Yamamoto, Seiichi Nakagawa
SEMCO
2009
IEEE
13 years 11 months ago
Enhanced Multimedia Content Access and Exploitation Using Semantic Speech Retrieval
—Techniques for automatic annotation of spoken content making use of speech recognition technology have long been characterized as holding unrealized promise to provide access to...
Roeland Ordelman, Franciska de Jong, Martha Larson
ICASSP
2009
IEEE
13 years 11 months ago
Filtering web text to match target genres
In language modeling for speech recognition, both the amount of training data and the match to the target task impact the goodness of the model, with the trade-off usually favorin...
Marius A. Marin, Sergey Feldman, Mari Ostendorf, M...
ICASSP
2009
IEEE
13 years 11 months ago
Affine invariant features and their application to speech recognition
This paper proposes a set of affine invariant features (AIFs) for sequence data. The proposed AIFs can be calculated directly from the sequence data, and their invariance to af...
Yu Qiao, Masayuki Suzuki, Nobuaki Minematsu
ICASSP
2009
IEEE
13 years 11 months ago
Training and adapting MLP features for Arabic speech recognition
Features derived from Multi-Layer Perceptrons (MLPs) are becoming increasingly popular for speech recognition. This paper describes various schemes for applying these features to ...
J. Park, Frank Diehl, M. J. F. Gales, Marcus Tomal...