Automatic audio tagging using covariate shift adaptation

15 years 1 months ago

Download sugiyama-www.cs.titech.ac.jp

Automatically annotating or tagging unlabeled audio ﬁles has several applications, such as database organization and recommender systems. We are interested in the case where the system is trained using clean high-quality audio ﬁles, but most of the ﬁles that need to be automatically tagged during the test phase are heavily compressed and noisy, perhaps because they were captured on a mobile device. In this situation we assume the audio ﬁles follow a covariate shift model in the acoustic feature space, i.e., the feature distributions are different in the training and test phases, but the conditional distribution of labels given features remains unchanged. Our method uses a specially designed audio similarity measure as input to a set of weighted logistic regressors, which attempt to alleviate the inﬂuence of covariate shift. Results on a freely available database of sound ﬁles contributed and labeled by non-expert users, demonstrate effective automatic tagging performance.

Gordon Wichern, Makoto Yamada, Harvey D. Thornburg

Real-time Traffic

Covariate Shift | ICASSP 2010 | Signal Processing | Test Phases | Unlabeled Audio ﬁles |

claim paper

Added	06 Dec 2010
Updated	06 Dec 2010
Type	Conference
Year	2010
Where	ICASSP
Authors	Gordon Wichern, Makoto Yamada, Harvey D. Thornburg, Masashi Sugiyama, Andreas Spanias

Sciweavers

Automatic audio tagging using covariate shift adaptation

Covariate Shift | ICASSP 2010 | Signal Processing | Test Phases | Unlabeled Audio ﬁles |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers