Uncovering the Latent Structures of Crowd Labeling

5 years 7 days ago
Uncovering the Latent Structures of Crowd Labeling
Crowdsourcing provides a new way to distribute enormous tasks to a crowd of annotators. The divergent knowledge background and personal preferences of crowd annotators lead to noisy (or even inconsistent) answers to a same question. However, diverse labels provide us information about the underlying structures of tasks and annotators. This paper proposes latent-class assumptions for learning-from-crowds models, that is, items can be separated into several latent classes and workers’ annotating behaviors may differ among different classes. We propose a nonparametric model to uncover the latent classes, and also extend the state-of-the-art minimax entropy estimator to learn latent structures. Experimental results on both synthetic data and real data collected from Amazon Mechanical Turk demonstrate our methods can disclose interesting and meaningful latent structures, and incorporating latent class structures can also bring significant improvements on ground truth label recovery for...
Tian Tian, Jun Zhu
Added 16 Apr 2016
Updated 16 Apr 2016
Type Journal
Year 2015
Authors Tian Tian, Jun Zhu
Comments (0)