Document management inside an organization is a complex and broadly scoped problem. This paper approaches the technical and social issues of Intranet document management by develop...
We present SchemaScope, a system to derive Document Type Definitions and XML Schemas from corpora of sample XML documents. Tools are provided to visualize, clean, and refine exist...
In this paper, we present an automatic method for character prototyping and font type characterization in machine-printed document images at a character level. To do so, we use a ...
TextVis is a visual data mining system for document collections. Such a collection represents an application domain, and the primary goal of the system is to derive patterns that p...
David Landau, Ronen Feldman, Yonatan Aumann, Moshe...
Multi-organizational EDI message networks are complicated communication environments with various standards and technologies. The role of third party message exchange hubs has bec...