Sciweavers

592 search results - page 43 / 119
» lrec 2008
Sort
View
LREC
2008
113views Education» more  LREC 2008»
15 years 1 months ago
Subdomain Sensitive Statistical Parsing using Raw Corpora
Modern statistical parsers are trained on large annotated corpora (treebanks). These treebanks usually consist of sentences addressing different subdomains (e.g. sports, politics,...
Barbara Plank, Khalil Sima'an
83
Voted
LREC
2008
88views Education» more  LREC 2008»
15 years 1 months ago
Using Movie Subtitles for Creating a Large-Scale Bilingual Corpora
This paper presents a method for compiling a large-scale bilingual corpus from a database of movie subtitles. To create the corpus, we propose an algorithm based on Gale and Churc...
Einav Itamar, Alon Itai
103
Voted
LREC
2008
112views Education» more  LREC 2008»
15 years 1 months ago
Can we Evaluate the Quality of Generated Text?
Evaluating the output of NLG systems is notoriously difficult, and performing assessments of text quality even more so. A range of automated and subject-based approaches to the ev...
David Hardcastle, Donia Scott
LREC
2008
113views Education» more  LREC 2008»
15 years 1 months ago
Automatic Rewriting of Patient Record Narratives
Patients require access to Electronic Patient Records, however medical language is often too difficult for patients to understand. Explaining records to patients is a time consumi...
Catalina Hallett, David Hardcastle
112
Voted
LREC
2008
155views Education» more  LREC 2008»
15 years 1 months ago
Exploiting Multiply Annotated Corpora in Biomedical Information Extraction Tasks
This paper discusses the problem of utilising multiply annotated data in training biomedical information extraction systems. Two corpora, annotated with entities and relations, an...
Barry Haddow, Beatrice Alex