INTERSPEECH 2010 | Sciweavers

9

INTERSPEECH
2010

99views Signal Processing» more INTERSPEECH 2010»

Multimodal speaker diarization using oriented optical flow histograms

12 years 11 months ago

Speaker diarization is the task of partitioning an input stream into speaker homogeneous regions, or in other words, to determine "who spoke when." While approaches to t...

Mary Tai Knox, Gerald Friedland

claim paper

Read More »

15

click to vote

INTERSPEECH
2010

98views Signal Processing» more INTERSPEECH 2010»

Lexical entrainment of real users in the let's go spoken dialog system

12 years 11 months ago

Download www.cs.cmu.edu

This paper examines the lexical entrainment of real users in the Let's Go spoken dialog system. First it presents a study of the presence of entrainment in a year of human-tr...

Gabriel Parent, Maxine Eskenazi

claim paper

Read More »

18

click to vote

INTERSPEECH
2010

166views Signal Processing» more INTERSPEECH 2010»

Emotion recognition using imperfect speech recognition

12 years 11 months ago

Download www5.informatik.uni-erlangen.de

This paper investigates the use of speech-to-text methods for assigning an emotion class to a given speech utterance. Previous work shows that an emotion extracted from text can c...

Florian Metze, Anton Batliner, Florian Eyben, Tim ...

claim paper

Read More »

12

click to vote

INTERSPEECH
2010

82views Signal Processing» more INTERSPEECH 2010»

Detection of hot spots in poster conversations based on reactive tokens of audience

12 years 11 months ago

Download www.ar.media.kyoto-u.ac.jp

We present a novel scheme for indexing "hot spots" in conversations, such as poster sessions, based on the reaction of the audience. Specifically, we focus on laughters ...

Tatsuya Kawahara, Kouhei Sumi, Zhi-Qiang Chang, Ka...

claim paper

Read More »

5

click to vote

INTERSPEECH
2010

90views Signal Processing» more INTERSPEECH 2010»

Exploring web-browser based runtimes engines for creating ubiquitous speech interfaces

12 years 11 months ago

Download www.furui.cs.titech.ac.jp

Paul R. Dixon, Sadaoki Furui

claim paper

Read More »

12

click to vote

INTERSPEECH
2010

101views Signal Processing» more INTERSPEECH 2010»

Evaluation of speaker mimic technology for personalizing SGD voices

12 years 11 months ago

Download www.cslu.ogi.edu

In this paper, we demonstrate the use of state-of-the-art speech technology to transform speech from a source speaker to mimic a particular target speaker with the intention of pr...

Esther Klabbers, Alexander Kain, Jan P. H. van San...

claim paper

Read More »

15

click to vote

INTERSPEECH
2010

123views Signal Processing» more INTERSPEECH 2010»

Investigation of full-sequence training of deep belief networks for speech recognition

12 years 11 months ago

Download research.microsoft.com

Recently, Deep Belief Networks (DBNs) have been proposed for phone recognition and were found to achieve highly competitive performance. In the original DBNs, only framelevel info...

Abdel-rahman Mohamed, Dong Yu, L. Deng

claim paper

Read More »

8

click to vote

INTERSPEECH
2010

92views Signal Processing» more INTERSPEECH 2010»

Improving monaural speaker identification by double-talk detection

12 years 11 months ago

Download cs.joensuu.fi

This paper describes a novel approach to improve monoaural speaker identification where two speakers are present in a single-microphone recording. The goal is to identify both of ...

Rahim Saeidi, Pejman Mowlaee, Tomi Kinnunen, Zheng...

claim paper

Read More »

4

click to vote

INTERSPEECH
2010

130views Signal Processing» more INTERSPEECH 2010»

Combination of probabilistic and possibilistic language models

12 years 11 months ago

Download lia.univ-avignon.fr

In a previous paper we proposed Web-based language models relying on the possibility theory. These models explicitly represent the possibility of word sequences. In this paper we ...

Stanislas Oger, Vladimir Popescu, Georges Linar&eg...

claim paper

Read More »

6

click to vote

INTERSPEECH
2010

71views Signal Processing» more INTERSPEECH 2010»

Time conditioned search in automatic speech recognition reconsidered

12 years 11 months ago