Sciweavers

TASLP
2008

Semantic Annotation and Retrieval of Music and Sound Effects

13 years 4 months ago
Semantic Annotation and Retrieval of Music and Sound Effects
We present a computer audition system that can both annotate novel audio tracks with semantically meaningful words and retrieve relevant tracks from a database of unlabeled audio content given a text-based query. We consider the related tasks of content-based audio annotation and retrieval as one supervised multiclass, multilabel problem in which we model the joint probability of acoustic features and words. We collect a data set of 1700 human-generated annotations that describe 500 Western popular music tracks. For each word in a vocabulary, we use this data to train a Gaussian mixture model (GMM) over an audio feature space. We estimate the parameters of the model using the weighted mixture hierarchies expectation maximization algorithm. This algorithm is more scalable to large data sets and produces better density estimates than standard parameter estimation techniques. The quality of the music annotations produced by our system is comparable with the performance of humans on the sa...
Douglas Turnbull, Luke Barrington, D. Torres, Gert
Added 15 Dec 2010
Updated 15 Dec 2010
Type Journal
Year 2008
Where TASLP
Authors Douglas Turnbull, Luke Barrington, D. Torres, Gert R. G. Lanckriet
Comments (0)