Sciweavers

SEMWEB
2009
Springer

Cross-lingual Dutch to English alignment using EuroWordNet and Dutch Wikipedia

13 years 11 months ago
Cross-lingual Dutch to English alignment using EuroWordNet and Dutch Wikipedia
Abstract. This paper describes a system for linking the thesaurus of the Netherlands Institute for Sound and Vision to English WordNet and dbpedia. We used EuroWordNet, a multilingual wordnet, and Dutch Wikipedia as intermediaries for the two alignments. EuroWordNet covers most of the subject terms in the thesaurus, but the organization of the cross-lingual links makes selection of the most appropriate English target term almost impossible. Using page titles, redirects, disambiguation pages, and anchor text harvested from Dutch Wikipedia gives reasonable performance on subject terms and geographical terms. Many person and organization names in the thesaurus could not be located in (Dutch or English) Wikipedia. 1 Presentation of the system This paper describes our system for the very large cross-lingual resources (vlcr) task, which asked for an alignment between the thesaurus of the Netherlands Institute for Sound and Vision and English WordNet and (English) dbpedia, a database extracte...
Gosse Bouma
Added 27 May 2010
Updated 27 May 2010
Type Conference
Year 2009
Where SEMWEB
Authors Gosse Bouma
Comments (0)