Sciweavers

2735 search results - page 287 / 547
» Comparing notions of randomness
Sort
View
112
Voted
LREC
2008
155views Education» more  LREC 2008»
15 years 2 months ago
Exploiting Multiply Annotated Corpora in Biomedical Information Extraction Tasks
This paper discusses the problem of utilising multiply annotated data in training biomedical information extraction systems. Two corpora, annotated with entities and relations, an...
Barry Haddow, Beatrice Alex
69
Voted
LREC
2008
73views Education» more  LREC 2008»
15 years 2 months ago
Acquisition and Evaluation of a Dialog Corpus through WOz and Dialog Simulation Techniques
In this paper, we present a comparison between two corpora acquired by means of two different techniques. The first corpus was acquired by means of the Wizard of Oz technique. A d...
David Griol, Lluís F. Hurtado, Encarna Sega...
AAAI
2006
15 years 2 months ago
Estimating Search Tree Size
We propose two new online methods for estimating the size of a backtracking search tree. The first method is based on a weighted sample of the branches visited by chronological ba...
Philip Kilby, John K. Slaney, Sylvie Thiéba...
113
Voted
ACL
2006
15 years 2 months ago
Subword-Based Tagging for Confidence-Dependent Chinese Word Segmentation
We proposed a subword-based tagging for Chinese word segmentation to improve the existing character-based tagging. The subword-based tagging was implemented using the maximum entr...
Ruiqiang Zhang, Gen-ichiro Kikui, Eiichiro Sumita
106
Voted
EMNLP
2004
15 years 2 months ago
Active Learning and the Total Cost of Annotation
Active learning (AL) promises to reduce the cost of annotating labeled datasets for trainable human language technologies. Contrary to expectations, when creating labeled training...
Jason Baldridge, Miles Osborne