Sciweavers

NOLISP
2005
Springer
13 years 10 months ago
MLP Internal Representation as Discriminative Features for Improved Speaker Recognition
Feature projection by non-linear discriminant analysis (NLDA) can substantially increase classification performance. In automatic speech recognition (ASR) the projection provided b...
Dalei Wu, Andrew C. Morris, Jacques C. Koreman
NOLISP
2005
Springer
13 years 10 months ago
The COST-277 European Action: An Overview
This paper summarizes the rationale for proposing the COST-277 “nonlinear speech processing” action, and the work done during these last four years. In addition, future perspec...
Marcos Faúndez-Zanuy, Unto Laine, Gernot Ku...
NOLISP
2005
Springer
13 years 10 months ago
Advanced Methods for Glottal Wave Extraction
Abstract. Glottal inverse filtering is a technique used to derive the glottal waveform during voiced speech. Closed phase inverse filtering (CPIF) is a common approach for achiev...
Jacqueline Walker, Peter J. Murphy
NOLISP
2005
Springer
13 years 10 months ago
Pseudo Cepstral Analysis of Czech Vowels
Real generalized cepstral analysis is introduced and applied to speech deconvolution. Real pseudo cepstrum of the vocal tract model impulse response is defined and applied to the a...
Robert Vích
NOLISP
2005
Springer
13 years 10 months ago
On the Acoustic-to-Electropalatographic Mapping
Electropalatography is a well established technique for recording information on the patterns of contact between the tongue and the hard palate during speech. It leads to a stream ...
Asterios Toutios, Konstantinos G. Margaritis
NOLISP
2005
Springer
13 years 10 months ago
Segment Boundaries in Low Latency Phonetic Recognition
This study analyses how the reduction of the look-ahead length of a two pass phonetic decoder influences the alignment of the segment boundaries. It is shown how the optimization ...
Giampiero Salvi
NOLISP
2005
Springer
13 years 10 months ago
Weighting Scores to Improve Speaker-Dependent Threshold Estimation in Text-Dependent Speaker Verification
The difficulty of obtaining data from impostors and the scarcity of data are two factors that have a large influence in the estimation of speakerdependent thresholds in text-depend...
Javier R. Saeta, Javier Hernando
NOLISP
2005
Springer
13 years 10 months ago
A Simple, Quasi-linear, Discrete Model of Vocal Fold Dynamics
In current speech technology, linear prediction dominates. The linear vocal tract model is well justified biomechanically, and linear prediction is a simple and well understood si...
Max Little, Patrick McSharry, Irene Moroz, Stephen...
NOLISP
2005
Springer
13 years 10 months ago
F0 and Intensity Distributions of Marsec Speakers: Types of Speaker Prosody
Most research on F0 has attempted to model the behaviour of an entire linguistic community (e.g of speakers of US or UK English, French, Japanese etc). In this research, we attempt...
Brigitte Zellner Keller
NOLISP
2005
Springer
13 years 10 months ago
Third-Order Moments of Filtered Speech Signals for Robust Speech Recognition
Novel speech features calculated from third-order statistics of subband-filtered speech signals are introduced and studied for robust speech recognition. These features have the p...
Kevin M. Indrebo, Richard J. Povinelli, Michael T....