Sciweavers

NAACL
2003
13 years 6 months ago
Simpler and More General Minimization for Weighted Finite-State Automata
Previous work on minimizing weighted finite-state automata (including transducers) is limited to particular types of weights. We present efficient new minimization algorithms th...
Jason Eisner
NAACL
2003
13 years 6 months ago
References to Named Entities: a Corpus Study
References included in multi-document summaries are often problematic. In this paper, we present a corpus study performed to derive a statistical model for the syntactic realizati...
Ani Nenkova, Kathleen McKeown
NAACL
2003
13 years 6 months ago
Semantic Language Models for Topic Detection and Tracking
In this work, we present a new semantic language modeling approach to model news stories in the Topic Detection and Tracking (TDT) task. In the new approach, we build a unigram la...
Ramesh Nallapati
NAACL
2003
13 years 6 months ago
QCS: A Tool for Querying, Clustering, and Summarizing Documents
The QCS information retrieval (IR) system is presented as a tool for querying, clustering, and summarizing document sets. QCS has been developed as a modular development framework...
Daniel M. Dunlavy, John M. Conroy, Dianne P. O'Lea...
NAACL
2003
13 years 6 months ago
Category-based Pseudowords
A pseudoword is a composite comprised of two or more words chosen at random; the individual occurrences of the original words within a text are replaced by their conflation. Pseu...
Preslav Nakov, Marti A. Hearst
NAACL
2003
13 years 6 months ago
A Spoken Dialogue Interface to a Geologist's Field Assistant
We will demonstrate a spoken dialogue interface to a Geologist’s Field Assistant that is being developed as part of NASA’s Mobile Agents project. The assistant consists of a r...
John Dowding, James Hieronymus
NAACL
2003
13 years 6 months ago
Active Learning for Classifying Phone Sequences from Unsupervised Phonotactic Models
This paper describes an application of active learning methods to the classification of phone strings recognized using unsupervised phonotactic models. The only training data req...
Shona Douglas
NAACL
2003
13 years 6 months ago
Adaptation Using Out-of-Domain Corpus within EBMT
In order to boost the translation quality of EBMT based on a small-sized bilingual corpus, we use an out-of-domain bilingual corpus and, in addition, the language model of an indo...
Takao Doi, Eiichiro Sumita, Hirofumi Yamamoto
NAACL
2003
13 years 6 months ago
WordFreak: An Open Tool for Linguistic Annotation
WordFreak is a natural language annotation tool that has been designed to be easy to extend to new domains and tasks. Specifically, a plug-in architecture has been developed whic...
Thomas S. Morton, Jeremy LaCivita
NAACL
2003
13 years 6 months ago
COGEX: A Logic Prover for Question Answering
Recent TREC results have demonstrated the need for deeper text understanding methods. This paper introduces the idea of automated reasoning applied to question answering and shows...
Dan I. Moldovan, Christine Clark, Sanda M. Harabag...