Abstract. The aim of this paper is to investigate the feasibility of predicting the gender of a text document’s author using linguistic evidence. For this purpose, term- and styl...
In this paper, we describe KES, a system that integrates text categorisation and information extraction in order to extract key elements of information from particular types of doc...
We have been handling video with supplementary documents, such as cooking programs, and are working on integration of such media. Through the integration, many applications will b...
In this paper, we propose to study the characteristics for analyzing subjective content in documents. For that purpose, we present and evaluate a novel method based on abstraction...
We describe a method to extract tabular data from web pages. Rather than just analyzing the DOM tree, we also exploit visual cues in the rendered version of the document to extrac...