Search Sciweavers | Sciweavers

135 search results - page 6 / 27

» Temporal Feature Selection for Noisy Speech Recognition

108

click to vote

ISCAS
2008
IEEE

139views Hardware» more ISCAS 2008»

Missing feature speech recognition in a meeting situation with maximum SNR beamforming

15 years 8 months ago

Download www.tara.tsukuba.ac.jp

Abstract— Especially for tasks like automatic meeting transcription, it would be useful to automatically recognize speech also while multiple speakers are talking simultaneously....

Dorothea Kolossa, Shoko Araki, Marc Delcroix, Tomo...

claim paper

Read More »

115

Voted

ICASSP
2008
IEEE

137views Signal Processing» more ICASSP 2008»

Robust speaker identification using combined feature selection and missing data recognition

15 years 8 months ago

Download www.ee.uwa.edu.au

Missing data techniques have been recently applied to speaker recognition to increase performance in noisy environments. The drawback of these techniques is the vulnerability of t...

Daniel Pullella, Marco Kühne, Roberto Togneri

claim paper

Read More »

Voted

ICML
2006
IEEE

205views Machine Learning» more ICML 2006»

Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks

16 years 2 months ago

Download www6.in.tum.de

Many real-world sequence learning tasks require the prediction of sequences of labels from noisy, unsegmented input data. In speech recognition, for example, an acoustic signal is...

Alex Graves, Faustino J. Gomez, Jürgen Schmid...

claim paper

Read More »

100

Voted

ICASSP
2008
IEEE

101views Signal Processing» more ICASSP 2008»

Cepstral domain feature compensation based on diagonal approximation

15 years 8 months ago

Download hi.snu.ac.kr

In this paper, we propose a novel approach to feature compensation performed in the cepstral domain. We apply the linear approximation method in the cepstral domain to simplify th...

Woohyung Lim, Chang Woo Han, Jong Won Shin, Nam So...

claim paper

Read More »

121

click to vote

ICPR
2010
IEEE

219views Computer Vision» more ICPR 2010»

Crossmodal Matching of Speakers Using Lip and Voice Features in Temporally Non-Overlapping Audio and Video Streams

15 years 5 months ago

Download infoscience.epfl.ch

Person identification using audio (speech) and visual (facial appearance, static or dynamic) modalities, either independently or jointly, is a thoroughly investigated problem in pa...

Anindya Roy, Sebastien Marcel

claim paper

Read More »

« Prev « First page 6 / 27 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers