Sciweavers

ICASSP
2009
IEEE
13 years 8 months ago
Acoustic compensation methods for body transmitted speech conversion
Statistical voice conversion is very effective for enhancing body transmitted speech recorded with Non-Audible Murmur (NAM) microphone. In this method, a probabilistic model to co...
Daisuke Miyamoto, Keigo Nakamura, Tomoki Toda, Hir...
ICASSP
2009
IEEE
13 years 8 months ago
Graphical Models: Statistical inference vs. determination
Using discrete Hidden-Markov-Models (HMMs) for recognition requires the quantization of the continuous feature vectors. In handwritten whiteboard note recognition it turns out tha...
Joachim Schenk, Benedikt Hörnler, Artur Braun...
ICASSP
2009
IEEE
13 years 8 months ago
High resolution audio synchronization using chroma onset features
The general goal of music synchronization is to automatically align the multiple information sources such as audio recordings, MIDI files, or digitized sheet music related to a gi...
Sebastian Ewert, Meinard Müller, Peter Grosch...
ICASSP
2009
IEEE
13 years 8 months ago
Timing and frequency synchronization for OFDM based cooperative systems
In this paper, we investigate the timing and carrier frequency offset (CFO) synchronization problem in decode and forward cooperative systems operating over frequency selective ch...
Qinfei Huang, Mounir Ghogho, Jibo Wei, Philippe Ci...
ICASSP
2009
IEEE
13 years 8 months ago
Dirichlet process mixture models with multiple modalities
The Dirichlet process can be used as a nonparametric prior for an infinite-dimensional probability mass function on the parameter space of a mixture model. The set of parameters o...
John William Paisley, Lawrence Carin
ICASSP
2009
IEEE
13 years 8 months ago
Improved lattice-based spoken document retrieval by directly learning from the evaluation measures
Lattice-based approaches have been widely used in spoken document retrieval to handle the speech recognition uncertainty and errors. Position Specific Posterior Lattices (PSPL) an...
Chao-hong Meng, Hung-yi Lee, Lin-shan Lee
ICASSP
2009
IEEE
13 years 8 months ago
Neural network based language models for highly inflective languages
Speech recognition of inflectional and morphologically rich languages like Czech is currently quite a challenging task, because simple n-gram techniques are unable to capture impo...
Tomas Mikolov, Jirí Kopecký, Lukas B...
ICASSP
2009
IEEE
13 years 8 months ago
The expected amplitude of overlapping partials of harmonic sounds
In analyzing polyphonic signals, the handling of overlapping partials is one important problem. The assumptions usually made for partial overlaps are the additivity of the linear ...
Chunghsin Yeh, Axel Roebel
ICASSP
2009
IEEE
13 years 8 months ago
Robust word boundary detection in spontaneous speech using acoustic and lexical cues
We consider the problem of word boundary detection in spontaneous speech utterances. Acoustic features have been well explored in the literature in the context of word boundary de...
Andreas Tsiartas, Prasanta K. Ghosh, Panayiotis G....
ICASSP
2009
IEEE
13 years 8 months ago
Strategies for modeling reverberant speech in the feature domain
The length of the room impulse response characterizing the acoustic path between speaker and microphone is significantly larger than the length of the analysis window used for fea...
Armin Sehr, Walter Kellermann