Sciweavers

99
Voted
LREC
2010
199views Education» more  LREC 2010»
14 years 12 months ago
Building a Cross-lingual Relatedness Thesaurus using a Graph Similarity Measure
The Internet is an ever growing source of information stored in documents of different languages. Hence, cross-lingual resources are needed for more and more NLP applications. Thi...
Lukas Michelbacher, Florian Laws, Beate Dorow, Ulr...
42
Voted
LREC
2010
142views Education» more  LREC 2010»
14 years 12 months ago
The South African Human Language Technologies Audit
Human language technologies (HLT) can play a vital role in bridging the digital divide and thus the HLT field has been recognised as a priority area by the South African governmen...
Aditi Sharma Grover, Gerhard van Huyssteen
LREC
2010
148views Education» more  LREC 2010»
14 years 12 months ago
A Morphological Processor Based on Foma for Biscayan (a Basque dialect)
We present a new morphological processor for Biscayan, a dialect of Basque, developed on the description of the morphology of standard Basque. The database for the standard morpho...
Iñaki Alegria, Garbiñe Aranbarri, Kl...
60
Voted
LREC
2010
160views Education» more  LREC 2010»
14 years 12 months ago
STeP-1: A Set of Fundamental Tools for Persian Text Processing
Many NLP applications need fundamental tools to convert the input text into appropriate form or format and extract the primary linguistic knowledge of words and sentences. These t...
Mehrnoush Shamsfard, Hoda Sadat Jafari, Mahdi Ilbe...
LREC
2010
153views Education» more  LREC 2010»
14 years 12 months ago
A Survey of Idiomatic Preposition-Noun-Verb Triples on Token Level
Most of the research on the extraction of idiomatic multiword expressions (MWEs) focused on the acquisition of MWE types. In the present work we investigate whether a text instanc...
Fabienne Fritzinger, Marion Weller, Ulrich Heid
LREC
2010
115views Education» more  LREC 2010»
14 years 12 months ago
A General Methodology for Equipping Ontologies with Time
In the first part of this paper, we present a framework for enriching arbitrary upper or domain-specific ontologies with a concept of time. To do so, we need the notion of a time ...
Hans-Ulrich Krieger
63
Voted
LREC
2010
170views Education» more  LREC 2010»
14 years 12 months ago
Construction of Text Summarization Corpus for the Credibility of Information on the Web
Recently, the credibility of information on the Web has become an important issue. In addition to telling about content of source documents, indicating how to interpret the conten...
Masahiro Nakano, Hideyuki Shibuki, Rintaro Miyazak...
LREC
2010
166views Education» more  LREC 2010»
14 years 12 months ago
Corpora for Automatically Learning to Map Natural Language Questions into SQL Queries
Automatically translating natural language into machine-readable instructions is one of major interesting and challenging tasks in Natural Language (NL) Processing. This problem c...
Alessandra Giordani, Alessandro Moschitti
69
Voted
LREC
2010
150views Education» more  LREC 2010»
14 years 12 months ago
A Dataset for Assessing Machine Translation Evaluation Metrics
We describe a dataset containing 16,000 translations produced by four machine translation systems and manually annotated for quality by professional translators. This dataset can ...
Lucia Specia, Nicola Cancedda, Marc Dymetman
LREC
2010
168views Education» more  LREC 2010»
14 years 12 months ago
Diabase: Towards a Diachronic BLARK in Support of Historical Studies
We present our ongoing work on language technology-based e-science in the humanities, social sciences and education, with a focus on text-based research in the historical sciences...
Lars Borin, Markus Forsberg, Dimitrios Kokkinakis