Sciweavers

ICPR
2004
IEEE

Selective Sampling Based on the Variation in Label Assignments

14 years 5 months ago
Selective Sampling Based on the Variation in Label Assignments
In this paper, a new selective sampling method for the active learning framework is presented. Initially, a small training set ? and a large unlabeled set ? are given. The goal is to select, one by one, the most informative objects from ?such that, after labeling by an expert, they will guarantee the best improvement in the classifier performance. Our sampling strategy relies on measuring the variation in label assignments (of the unlabeled set) between the classifier trained on ? and the classifiers trained on ? with a single unlabeled object added with all possible labels. We compare the performance of our algorithm with two traditional procedures random sampling and uncertainty sampling. We show empirically across a range of datasets that the proposed selective sampling method decreases the number of labeled instances needed to achieve the desired error for the fixed size of ?. Experimental results on toy problems and the UCI datasets are presented.
Piotr Juszczak, Robert P. W. Duin
Added 09 Nov 2009
Updated 09 Nov 2009
Type Conference
Year 2004
Where ICPR
Authors Piotr Juszczak, Robert P. W. Duin
Comments (0)