In this paper we describe a prototypical system that is able to generate document annotations based on eye movement data. Document parts can be annotated as being read or skimmed....
Georg Buscher, Andreas Dengel, Ludger van Elst, Fl...
While earlier work provided a partial view of users’ preferences about manuals, for most users in most work contexts the important question remains open: What do users want in d...
Cheap and versatile cameras make it possible to easily and quickly capture a wide variety of documents. However, low resolution cameras present a challenge to OCR because it is vi...
Charles E. Jacobs, Patrice Y. Simard, Paul A. Viol...
Structure analysis of table form documents is an important issue because a printed document and even an electronic document do not provide logical structural information but merely...
We describe a method to extract tabular data from web pages. Rather than just analyzing the DOM tree, we also exploit visual cues in the rendered version of the document to extrac...