Web pages contain clutter (such as ads, unnecessary images and extraneous links) around the body of an article, which distracts a user from actual content. Extraction of "use...
Information extraction is one of the most important techniques used in Text Mining. One of the main problems in building information extraction (IE) systems is that the knowledge ...
This paper presents a novel method for the classification of images that combines information extracted from the images and contextual information. The main hypothesis is that con...
This paper introduces a new architecture that aims at combining molecular biology data with information automatically extracted from scientific literature (using text mining techn...
We present an approach of how to extract automatically an XML document structure from a conceptual data model that describes the content of a document. We use UML class diagrams as...