Sciweavers

942 search results - page 2 / 189
» Comparing Automatic and Human Evaluation of NLG Systems
Sort
View
INLG
2010
Springer
13 years 2 months ago
Finding Common Ground: Towards a Surface Realisation Shared Task
In many areas of NLP reuse of utility tools such as parsers and POS taggers is now common, but this is still rare in NLG. The subfield of surface realisation has perhaps come clos...
Anja Belz, Mike White, Josef van Genabith, Deirdre...
ACL
2001
13 years 6 months ago
Using a Randomised Controlled Clinical Trial to Evaluate an NLG System
The STOP system, which generates personalised smoking-cessation letters, was evaluated by a randomised controlled clinical trial. We believe this is the largest and perhaps most r...
Ehud Reiter, Roma Robertson, A. Scott Lennox, Lies...
AI
2011
Springer
12 years 8 months ago
Comparing Humans and Automatic Speech Recognition Systems in Recognizing Dysarthric Speech
Abstract. Speech is a complex process that requires control and coordination of articulation, breathing, voicing, and prosody. Dysarthria is a manifestation of an inability to cont...
Kinfe Tadesse Mengistu, Frank Rudzicz
NAACL
2007
13 years 6 months ago
Probabilistic Generation of Weather Forecast Texts
This paper reports experiments in which pCRU — a generation framework that combines probabilistic generation methodology with a comprehensive model of the generation space — i...
Anja Belz
LREC
2010
148views Education» more  LREC 2010»
13 years 6 months ago
Mining the Correlation between Human and Automatic Evaluation at Sentence Level
Automatic evaluation metrics are fast and cost-effective measurements of the quality of a Machine Translation (MT) system. However, as humans are the end-user of MT output, human ...
Yanli Sun