Sciweavers

TASLP
2011

A Probabilistic Interaction Model for Multipitch Tracking With Factorial Hidden Markov Models

12 years 11 months ago
A Probabilistic Interaction Model for Multipitch Tracking With Factorial Hidden Markov Models
—We present a simple and efficient feature modeling approach for tracking the pitch of two simultaneously active speakers. We model the spectrogram features of single speakers using Gaussian mixture models in combination with the minimum description length model selection criterion. To obtain a probabilistic representation for the speech mixture spectrogram features of both speakers, we employ the mixture maximization model (MIXMAX) and, as an alternative, a linear interaction model. A factorial hidden Markov model is applied for tracking pitch over time. This statistical model can be used for applications beyond speech, whenever the interaction between individual sources can be represented as MIXMAX or linear model. For tracking, we use the loopy max-sum algorithm, and provide empirical comparisons to exact methods. Furthermore, we discuss a scheduling mechanism of loopy belief propagation for online tracking. We demonstrate experimental results using Mocha-TIMIT as well as data fr...
Michael Wohlmayr, Michael Stark, Franz Pernkopf
Added 15 May 2011
Updated 15 May 2011
Type Journal
Year 2011
Where TASLP
Authors Michael Wohlmayr, Michael Stark, Franz Pernkopf
Comments (0)