Sciweavers

LREC
2008
131views Education» more  LREC 2008»
13 years 6 months ago
Learning Morphology with Morfette
Morfette is a modular, data-driven, probabilistic system which learns to perform joint morphological tagging and lemmatization from morphologically annotated corpora. The system i...
Grzegorz Chrupala, Georgiana Dinu, Josef van Genab...
LREC
2008
120views Education» more  LREC 2008»
13 years 6 months ago
The U.S. Policy Agenda Legislation Corpus Volume 1 - a Language Resource from 1947 - 1998
We introduce the corpus of United States Congressional bills from 1947 to 1998 for use by language research communities. The U.S. Policy Agenda Legislation Corpus Volume 1 (USPALC...
Stephen Purpura, John Wilkerson, Dustin Hillard
LREC
2008
100views Education» more  LREC 2008»
13 years 6 months ago
Ensuring Semantic Interoperability on Lexical Resources
In this paper, we describe a unifying approach to tackle data heterogeneity issues for lexica and related resources. We present LEXUS, our software that implements the Lexical Mar...
Marc Kemps-Snijders, Claus Zinn, Jacquelijn Ringer...
LREC
2008
155views Education» more  LREC 2008»
13 years 6 months ago
Exploring and Enriching a Language Resource Archive via the Web
The "download first, then process paradigm" is still the predominant working method amongst the research community. The web-based paradigm, however, offers many advantag...
Marc Kemps-Snijders, Alexander Klassmann, Claus Zi...
LREC
2008
106views Education» more  LREC 2008»
13 years 6 months ago
A Corpus for Cross-Document Co-reference
This paper describes a newly created text corpus of news articles that has been annotated for cross-document co-reference. Being able to robustly resolve references to entities ac...
David Day, Janet Hitzeman, Michael L. Wick, Keith ...
LREC
2008
95views Education» more  LREC 2008»
13 years 6 months ago
Application of Resource-based Machine Translation to Real Business Scenes
As huge quantities of documents have become available, services using natural language processing technologies trained by huge corpora have emerged, such as information retrieval ...
Hitoshi Isahara, Masao Utiyama, Eiko Yamamoto, Aki...
LREC
2008
158views Education» more  LREC 2008»
13 years 6 months ago
Linguistic Description and Automatic Extraction of Definitions from German Court Decisions
This paper discusses the use of computational linguistic technology to extract definitions from a large corpus of German court decisions. We present a corpus-based survey of defin...
Stephan Walter
LREC
2008
111views Education» more  LREC 2008»
13 years 6 months ago
The ATCOSIM Corpus of Non-Prompted Clean Air Traffic Control Speech
Air traffic control (ATC) is based on voice communication between pilots and controllers and uses a highly task and domain specific language. Due to this very reason, spoken langu...
Konrad Hofbauer, Stefan Petrik, Horst Hering
LREC
2008
137views Education» more  LREC 2008»
13 years 6 months ago
The TextPro Tool Suite
We present TextPro, a suite of modular Natural Language Processing (NLP) tools for analysis of Italian and English texts. The suite has been designed so as to integrate and reuse ...
Emanuele Pianta, Christian Girardi, Roberto Zanoli
LREC
2008
128views Education» more  LREC 2008»
13 years 6 months ago
Emotion Recognition from Speech: Stress Experiment
The goal of this work is to introduce an architecture to automatically detect the amount of stress in the speech signal close to real time. For this an experimental setup to recor...
Stefan Scherer, Hansjörg Hofmann, Malte Lampm...