We present in this paper a system for converting PDF legacy documents into structured XML format. This conversion system first extracts the different streams contained in PDF files...
We present a system to automatically generate RSS feeds from HTML documents that consist of time-series items with date expressions, e.g., archives of weblogs, BBSs, chats, mailin...
Including mathematical expressions in documents can be a tiresome and difficult process. A recognition system for handwritten mathematical expressions would greatly simplify the t...
John A. Fitzgerald, Franz Geiselbrechtinger, M. Ta...
Abstract. This paper presents a multi-agent approach to gene expression analysis and illustrates the working steps using real dataset produced from a microarray experiment. The ana...
H. C. Lam, M. Vazquez, B. Juneja, Scott C. Fahrenk...
We present a novel algorithm for structural analysis of audio to detect repetitive patterns that are suitable for content-based audio information retrieval systems, since repetiti...