Search Sciweavers | Sciweavers

145 search results - page 17 / 29

» Factor analysed hidden Markov models for speech recognition

116

click to vote

ICMI
2004
Springer

159views Biometrics» more ICMI 2004»

A segment-based audio-visual speech recognizer: data collection, development, and initial experiments

15 years 7 months ago

Download groups.csail.mit.edu

This paper presents the development and evaluation of a speaker-independent audio-visual speech recognition (AVSR) system that utilizes a segment-based modeling strategy. To suppo...

Timothy J. Hazen, Kate Saenko, Chia-Hao La, James ...

claim paper

Read More »

100

click to vote

NAACL
2010

197views Computational Linguistics» more NAACL 2010»

Investigations into the Crandem Approach to Word Recognition

14 years 11 months ago

Download www.aclweb.org

We suggest improvements to a previously proposed framework for integrating Conditional Random Fields and Hidden Markov Models, dubbed a Crandem system (2009). The previous authors...

Rohit Prabhavalkar, Preethi Jyothi, William Hartma...

claim paper

Read More »

118

click to vote

BIOADIT
2004
Springer

137views Information Technology» more BIOADIT 2004»

Biologically Plausible Speech Recognition with LSTM Neural Nets

15 years 5 months ago

Download www.informatik.uni-ulm.de

Abstract. Long Short-Term Memory (LSTM) recurrent neural networks (RNNs) are local in space and time and closely related to a biological model of memory in the prefrontal cortex. N...

Alex Graves, Douglas Eck, Nicole Beringer, Jü...

claim paper

Read More »

122

click to vote

ICASSP
2011
IEEE

184views Signal Processing» more ICASSP 2011»

Multi-view and multi-objective semi-supervised learning for large vocabulary continuous speech recognition

14 years 5 months ago

Download mirlab.org

Current hidden Markov acoustic modeling for large vocabulary continuous speech recognition (LVCSR) relies on the availability of abundant labeled transcriptions. Given that speech...

Xiaodong Cui, Jing Huang, Jen-Tzung Chien

claim paper

Read More »

click to vote

PAMI
2002

98views more PAMI 2002»

Extraction of Visual Features for Lipreading

15 years 1 months ago

Download www.ri.cmu.edu

The multimodal nature of speech is often ignored in human-computer interaction, but lip deformations and other body motion, such as those of the head, convey additional information...

Iain Matthews, Timothy F. Cootes, J. Andrew Bangha...

claim paper

Read More »

« Prev « First page 17 / 29 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers