Active Learning With Sampling by Uncertainty and Density for Data Annotations

12 years 11 months ago

Download www.nlplab.cn

To solve the knowledge bottleneck problem, active learning has been widely used for its ability to automatically select the most informative unlabeled examples for human annotation. One of the key enabling techniques of active learning is uncertainty sampling, which uses one classifier to identify unlabeled examples with the least confidence. Uncertainty sampling often presents problems when outliers are selected. To solve the outlier problem, this paper presents two techniques, sampling by uncertainty and density (SUD) and density-based re-ranking. Both techniques prefer not only the most informative example in terms of uncertainty criterion, but also the most representative example in terms of density criterion. Experimental results of active learning for word sense disambiguation and text classification tasks using six real-world evaluation data sets demonstrate the effectiveness of the proposed methods.

Jingbo Zhu, Huizhen Wang, Benjamin K. Tsou, Matthe

Real-time Traffic

Informative Unlabeled Examples | Knowledge Bottleneck Problem | Software Engineering | TASLP 2010 | Unlabeled Examples |

claim paper

» Uncertainty sampling and transductive experimental design for active dual supervision

» Theres no Data like More Data Revisiting the Impact of Data Size on a Classification Task

» Adaptive Informative Sampling for Active Learning

» Active Learning with Adaptive Heterogeneous Ensembles

» Dual Strategy Active Learning

» Semiautomatic video annotation based on active learning with multiple complementary predic...

» Mobile Social Software with Context Awareness and Data Uncertainty for TechnologyEnhanced ...

» Memorybased active learning for French broadcast news

Post Info
More Details (n/a)

Added	21 May 2011
Updated	21 May 2011
Type	Journal
Year	2010
Where	TASLP
Authors	Jingbo Zhu, Huizhen Wang, Benjamin K. Tsou, Matthew Y. Ma

Comments (0)

Sciweavers

Active Learning With Sampling by Uncertainty and Density for Data Annotations

Informative Unlabeled Examples | Knowledge Bottleneck Problem | Software Engineering | TASLP 2010 | Unlabeled Examples |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers