Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

160

ICASSP
2010
IEEE

239views Signal Processing» more ICASSP 2010»

Towards multi-speaker unsupervised speech pattern discovery

15 years 6 months ago

Towards multi-speaker unsupervised speech pattern discovery

Download people.csail.mit.edu

In this paper, we explore the use of a Gaussian posteriorgram based representation for unsupervised discovery of speech patterns. Compared with our previous work, the new approach provides signiﬁcant improvement towards speaker independence. The framework consists of three main procedures: a Gaussian posteriorgram generation procedure which learns an unsupervised Gaussian mixture model and labels each speech frame with a Gaussian posteriorgram representation; a segmental dynamic time warping procedure which locates pairs of similar sequences of Gaussian posteriorgram vectors; and a graph clustering procedure which groups similar sequences into clusters. We demonstrate the viability of using the posteriorgram approach to handle many talkers by ﬁnding clusters of words in the TIMIT corpus.

Yaodong Zhang, James R. Glass

Real-time Traffic

Gaussian Posteriorgram | ICASSP 2010 | Procedure | Signal Processing | Unsupervised Gaussian Mixture |

claim paper

Related Content

» Unsupervised Pattern Discovery in Speech

» Towards robust word discovery by selfsimilarity matrix comparison

» Analysis of Head Gesture and Prosody Patterns for ProsodyDriven HeadGesture Animation

» Unsupervised content discovery in composite audio

» Discovering meaningful multimedia patterns with audiovisual concepts and associated text

» Discovering Multivariate Motifs using Subsequence Density Estimation and Greedy Mixture Le...

» Discovering voter preferences in blogs using mixtures of topic models

Post Info
More Details (n/a)

Added	06 Dec 2010
Updated	06 Dec 2010
Type	Conference
Year	2010
Where	ICASSP
Authors	Yaodong Zhang, James R. Glass

Comments (0)