Sciweavers

NAACL
2003
13 years 6 months ago
Exploiting Diversity for Answering Questions
We describe initial experiments in combining the output of question answering systems using data from the 2002 TREC Question Answering task. We explore several distance-based comb...
John D. Burger, John C. Henderson
NAACL
2003
13 years 6 months ago
Getting More Mileage from Web Text Sources for Conversational Speech Language Modeling using Class-Dependent Mixtures
Sources of training data suitable for language modeling of conversational speech are limited. In this paper, we show how training data can be supplemented with text from the web ...
Ivan Bulyko, Mari Ostendorf, Andreas Stolcke
NAACL
2003
13 years 6 months ago
Towards Emotion Prediction in Spoken Tutoring Dialogues
Human tutors detect and respond to student emotional states, but current machine tutors do not. Our preliminary machine learning experiments involving transcription, emotion annot...
Diane J. Litman, Katherine Forbes, Scott Silliman
NAACL
2003
13 years 6 months ago
Automatic Evaluation of Summaries Using N-gram Co-occurrence Statistics
Following the recent adoption by the machine translation community of automatic evaluation using the BLEU/NIST scoring process, we conduct an in-depth study of a similar idea for ...
Chin-Yew Lin, Eduard H. Hovy
NAACL
2003
13 years 6 months ago
An Analysis of Clarification Dialogue for Question Answering
We examine clarification dialogue, a mechanism for refining user questions with follow-up questions, in the context of open domain Question Answering systems. We develop an algori...
Marco De Boni, Suresh Manandhar
NAACL
2003
13 years 6 months ago
Word Alignment with Cohesion Constraint
We present a syntax-based constraint for word alignment, known as the cohesion constraint. It requires disjoint English phrases to be mapped to non-overlapping intervals in the Fr...
Dekang Lin, Colin Cherry
NAACL
2003
13 years 6 months ago
Factored Language Models and Generalized Parallel Backoff
We introduce factored language models (FLMs) and generalized parallel backoff (GPB). An FLM represents words as bundles of features (e.g., morphological classes, stems, data-drive...
Jeff Bilmes, Katrin Kirchhoff
NAACL
2003
13 years 6 months ago
Learning to Paraphrase: An Unsupervised Approach Using Multiple-Sequence Alignment
We address the text-to-text generation problem of sentence-level paraphrasing — a phenomenon distinct from and more difficult than word- or phrase-level paraphrasing. Our appro...
Regina Barzilay, Lillian Lee
NAACL
2003
13 years 6 months ago
Japanese Named Entity Extraction with Redundant Morphological Analysis
Named Entity (NE) extraction is an important subtask of document processing such as information extraction and question answering. A typical method used for NE extraction of Japan...
Masayuki Asahara, Yuji Matsumoto
NAACL
2003
13 years 6 months ago
TIPS: A Translingual Information Processing System
Searching online information is increasingly a daily activity for many people. The multilinguality of online content is also increasing (e.g. the proportion of English web users, ...
Yaser Al-Onaizan, Radu Florian, Martin Franz, Hany...