Sciweavers

1423 search results - page 129 / 285
» Polyphase speech recognition
Sort
View
INTERSPEECH
2010
14 years 10 months ago
Improved neural network based language modelling and adaptation
Neural network language models (NNLM) have become an increasingly popular choice for large vocabulary continuous speech recognition (LVCSR) tasks, due to their inherent generalisa...
Junho Park, Xunying Liu, Mark J. F. Gales, Philip ...
ISMIR
2000
Springer
168views Music» more  ISMIR 2000»
15 years 7 months ago
Mel Frequency Cepstral Coefficients for Music Modeling
We examine in some detail Mel Frequency Cepstral Coefficients (MFCCs) - the dominant features used for speech recognition - and investigate their applicability to modeling music. ...
Beth Logan
LREC
2010
196views Education» more  LREC 2010»
15 years 5 months ago
HIFI-AV: An Audio-visual Corpus for Spoken Language Human-Machine Dialogue Research in Spanish
In this paper, we describe a new multi-purpose audio-visual database on the context of speech interfaces for controlling household electronic devices. The database comprises speec...
Fernando F. Fernández-Martínez, Juan...
IJON
1998
46views more  IJON 1998»
15 years 3 months ago
Self-organizing maps of symbol strings
SOM and LVQ algorithms for symbol strings have been introduced and applied to isolatedword recognition, for the construction of an optimal pronunciation dictionary for a given spe...
Teuvo Kohonen, Panu Somervuo
CHI
2004
ACM
16 years 4 months ago
Semantic speech editing
Editing speech data is currently time-consuming and errorprone. Speech editors rely on acoustic waveform representations, which force users to repeatedly sample the underlying spe...
Steve Whittaker, Brian Amento