This work addresses the soundtrack indexing of multimedia documents. We present and merge two audio classification tools that we have developed. The first one, a speech music clas...
Named Entity (NE) recognition is a task in which proper nouns and numerical information are extracted from documents and are classified into categories such as person, organizatio...
We present a grand challenge to build a corpus that will include all of the world's languages, in a consistent structure that permits large-scale cross-linguistic processing,...
This paper presents the novel task of best topic word selection, that is the selection of the topic word that is the best label for a given topic, as a means of enhancing the inte...
Jey Han Lau, David Newman, Sarvnaz Karimi, Timothy...
Query-oriented summarization aims at extracting an informative summary from a document collection for a given query. It is very useful to help users grasp the main information rel...