Sciweavers

735 search results - page 14 / 147
» Corpora and data preparation
Sort
View
ACL
2011
14 years 1 months ago
Using Large Monolingual and Bilingual Corpora to Improve Coordination Disambiguation
Resolving coordination ambiguity is a classic hard problem. This paper looks at coordination disambiguation in complex noun phrases (NPs). Parsers trained on the Penn Treebank are...
Shane Bergsma, David Yarowsky, Kenneth Ward Church
LREC
2008
155views Education» more  LREC 2008»
14 years 11 months ago
Exploiting Multiply Annotated Corpora in Biomedical Information Extraction Tasks
This paper discusses the problem of utilising multiply annotated data in training biomedical information extraction systems. Two corpora, annotated with entities and relations, an...
Barry Haddow, Beatrice Alex
GECCO
2006
Springer
186views Optimization» more  GECCO 2006»
15 years 1 months ago
Characterizing large text corpora using a maximum variation sampling genetic algorithm
An enormous amount of information available via the Internet exists. Much of this data is in the form of text-based documents. These documents cover a variety of topics that are v...
Robert M. Patton, Thomas E. Potok
NLPRS
2001
Springer
15 years 2 months ago
Automatic Sense Tagging Using Parallel Corpora
This article reports the results of an analysis of translation equivalents in six languages from different language families, extracted from an on-line parallel corpus of George O...
Nancy Ide, Tomaz Erjavec, Dan Tufis
ACL
2003
14 years 11 months ago
Extraction and Verification of KO-OU Expressions from Large Corpora
In the Japanese language, as a predicate is placed at the end of a sentence, the content of a sentence cannot be inferred until reaching the end. However, when the content is comp...
Atsuko Kida, Eiko Yamamoto, Kyoko Kanzaki, Hitoshi...