Sciweavers

16 search results - page 2 / 4
» The PAGE (Page Analysis and Ground-Truth Elements) Format Fr...
Sort
View
DGO
2010
173views Education» more  DGO 2010»
13 years 6 months ago
Digital sustainable publication of legacy parliamentary proceedings
We address the problem of publishing parliamentary proceedings in a digital sustainable manner. We give an extensive requirements analysis, and based on that propose a uniform XML...
Maarten Marx, Nelleke Aders, Anne Schuth
ICDAR
2009
IEEE
13 years 3 months ago
Analysis of Book Documents' Table of Content Based on Clustering
Table of contents (TOC) recognition has attracted a great deal of attention in recent years. After reviewing the merits and drawbacks of the existing TOC recognition methods, we h...
Liangcai Gao, Zhi Tang, Xiaofan Lin, Xin Tao, Yimi...
ICDAR
2003
IEEE
13 years 10 months ago
Detection, Extraction and Representation of Tables
We are concerned with the extraction of tables from exchange format representations of very diverse composite documents. We put forward a flexible representation scheme for comple...
Jean-Yves Ramel, Michel Crucianu, Nicole Vincent, ...
KDD
2007
ACM
152views Data Mining» more  KDD 2007»
14 years 5 months ago
A framework for classification and segmentation of massive audio data streams
In recent years, the proliferation of VOIP data has created a number of applications in which it is desirable to perform quick online classification and recognition of massive voi...
Charu C. Aggarwal
BMCBI
2007
109views more  BMCBI 2007»
13 years 5 months ago
Seahawk: moving beyond HTML in Web-based bioinformatics analysis
Background: Traditional HTML interfaces for input to and output from Bioinformatics analysis on the Web are highly variable in style, content and data formats. Combining multiple ...
Paul M. K. Gordon, Christoph W. Sensen