The goal of the DARPA MADCAT (Multilingual Automatic Document Classification Analysis and Translation) Program is to automatically convert foreign language text images into Englis...
In my thesis I will address the problem of interoperation between information spaces on the web. We explain how this problem is different to traditional database integration scenar...
For execution of complex biological queries, data integration systems often use several intermediate data sources because the domain coverage of individual sources is limited. Qual...
Abstract Clustering text data streams is an important issue in data mining community and has a number of applications such as news group filtering, text crawling, document organiza...
Background: Visualization of DNA microarray data in two or three dimensional spaces is an important exploratory analysis step in order to detect quality issues or to generate new ...
Christoph Bartenhagen, Hans-Ulrich Klein, Christia...