Sciweavers

2723 search results - page 460 / 545
» Learning lexicographic orders
Sort
View
LREC
2010
176views Education» more  LREC 2010»
15 years 28 days ago
The DAD Parallel Corpora and their Uses
This paper deals with the uses of the annotations of third person singular neuter pronouns in the DAD parallel and comparable corpora of Danish and Italian texts and spoken data. ...
Costanza Navarretta
LREC
2010
138views Education» more  LREC 2010»
15 years 28 days ago
Experiments in Human-computer Cooperation for the Semantic Annotation of Portuguese Corpora
In this paper, we present a system to aid human annotation of semantic information in the scope of the project AC/DC, called corte-e-costura. This system leverages on the human an...
Diana Santos, Cristina Mota
LREC
2010
140views Education» more  LREC 2010»
15 years 28 days ago
mwetoolkit: a Framework for Multiword Expression Identification
This paper presents the Multiword Expression Toolkit (mwetoolkit), an environment for type and language-independent MWE identification from corpora. The mwetoolkit provides a targ...
Carlos Ramisch, Aline Villavicencio, Christian Boi...
SDM
2010
SIAM
184views Data Mining» more  SDM 2010»
15 years 28 days ago
A Robust Decision Tree Algorithm for Imbalanced Data Sets
We propose a new decision tree algorithm, Class Confidence Proportion Decision Tree (CCPDT), which is robust and insensitive to class distribution and generates rules which are st...
Wei Liu, Sanjay Chawla, David A. Cieslak, Nitesh V...
103
Voted
EMNLP
2008
15 years 28 days ago
Cheap and Fast - But is it Good? Evaluating Non-Expert Annotations for Natural Language Tasks
Human linguistic annotation is crucial for many natural language processing tasks but can be expensive and time-consuming. We explore the use of Amazon's Mechanical Turk syst...
Rion Snow, Brendan O'Connor, Daniel Jurafsky, Andr...