Sciweavers

LREC
2010
113views Education» more  LREC 2010»
13 years 6 months ago
The Design of Syntactic Annotation Levels in the National Corpus of Polish
This paper presents the procedure of the syntactic annotation of the National Corpus of Polish. Syntactic annotation consists here of shallow parsing and manual post-editing of th...
Katarzyna Glowinska, Adam Przepiórkowski
LREC
2010
177views Education» more  LREC 2010»
13 years 6 months ago
IndoWordNet
India is a multilingual country where machine translation and cross lingual search are highly relevant problems. These problems require large resources- like wordnets and lexicons...
Pushpak Bhattacharyya
LREC
2010
155views Education» more  LREC 2010»
13 years 6 months ago
Djangology: A Light-weight Web-based Tool for Distributed Collaborative Text Annotation
Manual text annotation is a resource-consuming endeavor necessary for NLP systems when they target new tasks or domains for which there are no existing annotated corpora. Distribu...
Emilia Apostolova, Sean Neilan, Gary An, Noriko To...
LREC
2010
146views Education» more  LREC 2010»
13 years 6 months ago
From XML to XML: The Why and How of Making the Biodiversity Literature Accessible to Researchers
We present the ABLE document collection, which consists of a set of annotated volumes of the Bulletin of the British Museum (Natural History). These were developed during our ongo...
Alistair Willis, David King, David Morse, Anton Di...
LREC
2010
153views Education» more  LREC 2010»
13 years 6 months ago
Homographic Ideogram Understanding Using Contextual Dynamic Network
Conventional methods for disambiguation problems have been using statistical methods with co-occurrence of words in their contexts. It seems that human-beings assign appropriate w...
Jun Okamoto, Shun Ishizaki
LREC
2010
208views Education» more  LREC 2010»
13 years 6 months ago
Extraction of German Multiword Expressions from Parsed Corpora Using Context Features
We report about tools for the extraction of German multiword expressions (MWEs) from text corpora; we extract word pairs, but also longer MWEs of different patterns, e.g. verb-nou...
Marion Weller, Ulrich Heid
LREC
2010
144views Education» more  LREC 2010»
13 years 6 months ago
Empty Categories in a Hindi Treebank
We are in the process of creating a multi-representational and multi-layered treebank for Hindi/Urdu (Palmer et al., 2009), which has three main layers: dependency structure, pred...
Archna Bhatia, Rajesh Bhatt, Bhuvana Narasimhan, M...
LREC
2010
163views Education» more  LREC 2010»
13 years 6 months ago
Meaning Representation: From Continuity to Discreteness
This paper presents a geometric approach to meaning representation within the framework of continuous mathematics. Meaning representation is a central issue in Natural Language Pr...
Fabienne Venant
LREC
2010
200views Education» more  LREC 2010»
13 years 6 months ago
The D-TUNA Corpus: A Dutch Dataset for the Evaluation of Referring Expression Generation Algorithms
In this paper, we present the D-TUNA corpus, which is the first semantically annotated corpus of referring expressions in Dutch. Its primary function is to evaluate and improve th...
Ruud Koolen, Emiel Krahmer
LREC
2010
209views Education» more  LREC 2010»
13 years 6 months ago
Lingua-Align: An Experimental Toolbox for Automatic Tree-to-Tree Alignment
In this paper we present an experimental toolbox for automatic tree-to-tree alignment based on local classification and alignment inference. The aligner implements a recurrent arc...
Jörg Tiedemann