Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

140

BIOID
2008

135views Biometrics» more BIOID 2008»

Multimodal Speaker Identification Based on Text and Speech

15 years 6 months ago

Multimodal Speaker Identification Based on Text and Speech

Download cost2101.org

Abstract. This paper proposes a novel method for speaker identification based on both speech utterances and their transcribed text. The transcribed text of each speaker's utterance is processed by the probabilistic latent semantic indexing (PLSI) that offers a powerful means to model each speaker's vocabulary employing a number of hidden topics, which are closely related to his/her identity, function, or expertise. Melfrequency cepstral coefficients (MFCCs) are extracted from each speech frame and their dynamic range is quantized to a number of predefined bins in order to compute MFCC local histograms for each speech utterance, that is time-aligned with the transcribed text. Two identity scores are independently computed by the PLSI applied first to the text and the nearest neighbor classifier applied next to the local MFCC histograms. It is demonstrated that a convex combination of the two scores is more accurate than the individual scores on speaker identification experimen...

Panagiotis Moschonas, Constantine Kotropoulos

Real-time Traffic

BIOID 2008 | Biometrics | Cepstral Coefficients | Latent Semantic Indexing | Speaker Identification |

claim paper

Related Content

» Speaker Identification Using Instantaneous Frequencies

» Scalability Analysis of AudioVisual Person Identity Verification

» Language Identification via Large Vocabulary Speaker Independent Continuous Speech Recogni...

» Multimodal Speaker Segmentation in Presence of Overlapped Speech Segments

» Speaker Discriminative Weighting Method for VQBased Speaker Identification

» SignaltoSignal Ratio Independent Speaker Identification for Cochannel Speech Signals

» Speaker identification by combining MFCC and phase information in noisy environments

» Audiovisual speaker identification using coupled hidden Markov models

» Crossmodal Matching of Speakers Using Lip and Voice Features in Temporally NonOverlapping ...

Post Info
More Details (n/a)

Added	12 Oct 2010
Updated	12 Oct 2010
Type	Conference
Year	2008
Where	BIOID
Authors	Panagiotis Moschonas, Constantine Kotropoulos

Comments (0)