Sciweavers

NAACL
2007
13 years 6 months ago
Speech Summarization Without Lexical Features for Mandarin Broadcast News
We present the first known empirical study on speech summarization without lexical features for Mandarin broadcast news. We evaluate acoustic, lexical and structural features as ...
Jian Zhang, Pascale Fung
NAACL
2007
13 years 6 months ago
Entity Extraction is a Boring Solved Problem - Or is it?
This paper presents empirical results that contradict the prevailing opinion that entity extraction is a boring solved problem. In particular, we consider data sets that resemble ...
Marc Vilain, Jennifer Su, Suzi Lubar
NAACL
2007
13 years 6 months ago
Agenda-Based User Simulation for Bootstrapping a POMDP Dialogue System
This paper investigates the problem of bootstrapping a statistical dialogue manager without access to training data and proposes a new probabilistic agenda-based method for simula...
Jost Schatzmann, Blaise Thomson, Karl Weilhammer, ...
NAACL
2007
13 years 6 months ago
Advances in the CMU/Interact Arabic GALE Transcription System
This paper describes the CMU/InterACT effort in developing an Arabic Automatic Speech Recognition (ASR) system for broadcast news and conversations within the GALE 2006 evaluation...
Mohamed Noamany, Thomas Schaaf, Tanja Schultz
NAACL
2007
13 years 6 months ago
K-Best Suffix Arrays
Kenneth Ward Church, Bo Thiesson, Robert Ragno
NAACL
2007
13 years 6 months ago
Combination of Statistical Word Alignments Based on Multiple Preprocessing Schemes
We present an approach to using multiple preprocessing schemes to improve statistical word alignments. We show a relative reduction of alignment error rate of about 38%.
Jakob Elming, Nizar Habash
NAACL
2007
13 years 6 months ago
An Integrated Architecture for Speech-Input Multi-Target Machine Translation
The aim of this work is to show the ability of finite-state transducers to simultaneously translate speech into multiple languages. Our proposal deals with an extension of stocha...
Alicia Pérez, Maria-Teresa González,...
NAACL
2007
13 years 6 months ago
Comparing Wikipedia and German Wordnet by Evaluating Semantic Relatedness on Multiple Datasets
We evaluate semantic relatedness measures on different German datasets showing that their performance depends on: (i) the definition of relatedness that was underlying the constr...
Torsten Zesch, Iryna Gurevych, Max Mühlhä...
NAACL
2007
13 years 6 months ago
Tagging Icelandic Text using a Linguistic and a Statistical Tagger
We describe our linguistic rule-based tagger IceTagger, and compare its tagging accuracy to the TnT tagger, a state-of-theart statistical tagger, when tagging Icelandic, a morphol...
Hrafn Loftsson
NAACL
2007
13 years 6 months ago
A High Accuracy Method for Semi-Supervised Information Extraction
Customization to specific domains of discourse and/or user requirements is one of the greatest challenges for today’s Information Extraction (IE) systems. While demonstrably eff...
Stephen Tratz, Antonio Sanfilippo