Sciweavers

CAISE
2006
Springer
13 years 10 months ago
Supporting Customised Collaboration over Shared Document Repositories
The development of collaborative environments that not only manage information and communication, but also support the actual work processes of organisations is very important. XML...
Claudia-Lavinia Ignat, Moira C. Norrie
DOCENG
2007
ACM
13 years 10 months ago
The Mars project: PDF in XML
The Portable Document Format (PDF) is a page-oriented, graphically rich document format based on PostScript semantics. It is the file format underlying the Adobe
Matthew R. B. Hardy
ICDAR
2007
IEEE
13 years 10 months ago
Identification of Latin-Based Languages through Character Stroke Categorization
This paper presents a language identification technique that detects Latin-based languages of imaged documents without OCR. The proposed technique detects languages through the wo...
S. J. Lu, L. Li, Chew Lim Tan
ICA
2007
Springer
13 years 10 months ago
Text Clustering on Latent Thematic Spaces: Variants, Strengths and Weaknesses
Deriving a thematically meaningful partition of an unlabeled document corpus is a challenging task. In this context, the use of document representations based on latent thematic ge...
Xavier Sevillano, Germán Cobo, Francesc Al&...
SIGIR
2010
ACM
13 years 10 months ago
Self-taught hashing for fast similarity search
The ability of fast similarity search at large scale is of great importance to many Information Retrieval (IR) applications. A promising way to accelerate similarity search is sem...
Dell Zhang, Jun Wang, Deng Cai, Jinsong Lu
DOCENG
2007
ACM
13 years 10 months ago
Mapping paradigm for document transformation
Since the advent of XML, the ability to transform documents using transformation languages such as XSLT has become an important challenge. However, writing a transformation script...
Arnaud Blouin, Olivier Beaudoux
DOCENG
2007
ACM
13 years 10 months ago
SALT: a semantic approach for generating document representations
The structure of a document has an important influence on the perception of its content. Considering scientific publications, we can affirm that by making use of the ordinary line...
Tudor Groza, Alexander Schutz, Siegfried Handschuh
DOCENG
2007
ACM
13 years 10 months ago
Logical document conversion: combining functional and formal knowledge
We present in this paper a method for document layout analysis based on identifying the function of document elements (what they do). This approach is orthogonal and complementary...
Hervé Déjean, Jean-Luc Meunier
DOCENG
2007
ACM
13 years 10 months ago
Genre driven multimedia document production by means of incremental transformation
Genre, like layout, is an important factor in effective communication, and automated tools which assist in genre compliance are thus of considerable value. Genres are reusable met...
Marc Nanard, Jocelyne Nanard, Peter R. King, Ludov...
DOCENG
2007
ACM
13 years 10 months ago
Speculative document evaluation
Optimisation of real world Variable Data printing (VDP) documents is a difficult problem because the interdependencies between layout functions may drastically reduce the number o...
Alexander J. Macdonald, David F. Brailsford, Steve...