Active Learning and the Total Cost of Annotation

13 years 1 months ago
Active Learning and the Total Cost of Annotation
Active learning (AL) promises to reduce the cost of annotating labeled datasets for trainable human language technologies. Contrary to expectations, when creating labeled training material for HPSG parse selection and later reusing it with other models, gains from AL may be negligible or even negative. This has serious implications for using AL, showing that additional cost-saving strategies may need to be adopted. We explore one such strategy: using a model during annotation to automate some of the decisions. Our best results show an 80% reduction in annotation cost compared with labeling randomly selected data with a single model.
Jason Baldridge, Miles Osborne
Added 30 Oct 2010
Updated 30 Oct 2010
Type Conference
Year 2004
Authors Jason Baldridge, Miles Osborne
Comments (0)