Search Sciweavers | Sciweavers

101 search results - page 2 / 21

» Automatic recognition of speech without any audio informatio...

click to vote

AND
2010

234views Machine Learning» more AND 2010»

Reshaping automatic speech transcripts for robust high-level spoken document analysis

13 years 2 months ago

Download www.irisa.fr

High-level spoken document analysis is required in many applications seeking access to the semantic content of audio data, such as information retrieval, machine translation or au...

Julien Fayolle, Fabienne Moreau, Christian Raymond...

claim paper

Read More »

click to vote

ACIVS
2009
Springer

188views Computer Vision» more ACIVS 2009»

Two-Level Bimodal Association for Audio-Visual Speech Recognition

13 years 11 months ago

Download infoscience.epfl.ch

This paper proposes a new method for bimodal information fusion in audio-visual speech recognition, where cross-modal association is considered in two levels. First, the acoustic a...

Jong-Seok Lee, Touradj Ebrahimi

claim paper

Read More »

click to vote

AMFG
2005
IEEE

183views Biometrics» more AMFG 2005»

Robust Automatic Human Identification Using Face, Mouth, and Acoustic Information

13 years 6 months ago

Download perso.telecom-paristech.fr

Discriminatory information about person identity is multimodal. Yet, most person recognition systems are unimodal, e.g. the use of facial appearance. With a view to exploiting the ...

Niall A. Fox, Ralph Gross, Jeffrey F. Cohn, Richar...

claim paper

Read More »

click to vote

ICASSP
2011
IEEE

132views Signal Processing» more ICASSP 2011»

Training of error-corrective model for ASR without using audio data

12 years 8 months ago

Download mirlab.org

This paper introduces a method to train an error-corrective model for Automatic Speech Recognition (ASR) without using audio data. In existing techniques, it is assumed that suf�...

Gakuto Kurata, Nobuyasu Itoh, Masafumi Nishimura

claim paper

Read More »

click to vote

MMM
2006
Springer

107views Multimedia» more MMM 2006»

History-based visual mining of semi-structured audio and text

13 years 11 months ago

Download www.scss.tcd.ie

Accessing speciﬁc or salient parts of multimedia recordings remains a challenge as there is no obvious way of structuring and representing a mix of space-based and timebased med...

Matt-Mouley Bouamrane, Saturnino Luz, Masood Masoo...

claim paper

Read More »

« Prev « First page 2 / 21 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers