I present a method of identifying cognates in the vocabularies of related languages. I show that a measure of phonetic similarity based on multivalued features performs better tha...
Lexical selection is a significant problem for widecoverage machine translation: depending on the context, a given source language word can often be translated into different targ...
This paper proposes a method that automatically acquires the SAs (semantic attributes) of user defined words. Applyingthis method to the compilation of a user dictionary targeting...
In this paper we explore an unsupervised approach to classify video content by analyzing the corresponding subtitles. The proposed method is based on the WordNet lexical database a...
Address standardization is a very challenging task in data cleansing. To provide better customer relationship management and business intelligence for customer-oriented cooperates...