Sciweavers

LREC
2010
164views Education» more  LREC 2010»
13 years 6 months ago
Experimental Deployment of a Grid Virtual Organization for Human Language Technologies
After a brief overview of the elements of modern grid computing, a number of common use-cases of natural language processing tasks running on the grid are presented, notably corpu...
Jan Jona Javorsek, Tomaz Erjavec
LREC
2010
136views Education» more  LREC 2010»
13 years 6 months ago
LoonyBin: Keeping Language Technologists Sane through Automated Management of Experimental (Hyper)Workflows
Many contemporary language technology systems are characterized by long pipelines of tools with complex dependencies. Too often, these workflows are implemented by ad hoc scripts;...
Jonathan H. Clark, Alon Lavie
LREC
2010
208views Education» more  LREC 2010»
13 years 6 months ago
A Case Study on Interoperability for Language Resources and Applications
This paper reports our experience when integrating differ resources and services into a grid environment. The use case we address implies the deployment of several NLP application...
Marta Villegas, Núria Bel, Santiago Bel, V&...
LREC
2010
176views Education» more  LREC 2010»
13 years 6 months ago
There's no Data like More Data? Revisiting the Impact of Data Size on a Classification Task
In the paper we investigate the impact of data size on a Word Sense Disambiguation task (WSD). We question the assumption that the knowledge acquisition bottleneck, which is known...
Ines Rehbein, Josef Ruppenhofer
LREC
2010
174views Education» more  LREC 2010»
13 years 6 months ago
Enhancing Language Resources with Maps
We will look at how maps can be integrated in research resources, such as language databases and language corpora. By using maps, search results can be illustrated in a way that i...
Janne Bondi Johannessen, Kristin Hagen, Anders N&o...
LREC
2010
133views Education» more  LREC 2010»
13 years 6 months ago
Towards a Learning Approach for Abbreviation Detection and Resolution
The explosion of biomedical literature and with it the -uncontrolled- creation of abbreviations presents some special challenges for both human readers and computer applications. ...
Klaar Vanopstal, Bart Desmet, Véronique Hos...
LREC
2010
150views Education» more  LREC 2010»
13 years 6 months ago
Achieving Domain Specificity in SMT without Overt Siloing
We examine pooling data as a method for improving Statistical Machine Translation (SMT) quality for narrowly defined domains, such as data for a particular company or public entit...
William D. Lewis, Chris Wendt, David Bullock
LREC
2010
186views Education» more  LREC 2010»
13 years 6 months ago
The EPAC Corpus: Manual and Automatic Annotations of Conversational Speech in French Broadcast News
This paper presents the EPAC corpus which is composed by a set of 100 hours of conversational speech manually transcribed and by the outputs of automatic tools (automatic segmenta...
Yannick Estève, Thierry Bazillon, Jean-Yves...
LREC
2010
187views Education» more  LREC 2010»
13 years 6 months ago
Belgisch Staatsblad Corpus: Retrieving French-Dutch Sentences from Official Documents
We describe the compilation of a large corpus of French-Dutch sentence pairs from official Belgian documents which are available in the online version of the publication Belgisch ...
Tom Vanallemeersch