Sciweavers

DAS
2008
Springer
13 years 6 months ago
An End-to-End Administrative Document Analysis System
This paper presents an end-to-end administrative document analysis system. This system uses case-based reasoning in order to process documents from known and unknown classes. For ...
Hatem Hamza, Yolande Belaïd, Abdel Belaï...
VLDB
1995
ACM
89views Database» more  VLDB 1995»
13 years 8 months ago
Document Management as a Database Problem
abstract of invited paper Document management has many aspects, among them acquisition, storage, retrieval, presentation and processing of documents (work flow). These aspects will...
Rudolf Bayer
PARA
2000
Springer
13 years 8 months ago
Parallel and Distributed Document Overlap Detection on the Web
Proliferation of digital libraries plus availability of electronic documents from the Internet have created new challenges for computer science researchers and professionals. Docum...
Krisztián Monostori, Arkady B. Zaslavsky, H...
CIKM
2006
Springer
13 years 8 months ago
Multi-evidence, multi-criteria, lazy associative document classification
We present a novel approach for classifying documents that combines different pieces of evidence (e.g., textual features of documents, links, and citations) transparently, through...
Adriano Veloso, Wagner Meira Jr., Marco Cristo, Ma...
APWEB
2006
Springer
13 years 8 months ago
Sample Sizes for Query Probing in Uncooperative Distributed Information Retrieval
The goal of distributed information retrieval is to support effective searching over multiple document collections. For efficiency, queries should be routed to only those collectio...
Milad Shokouhi, Falk Scholer, Justin Zobel
AIMSA
2006
Springer
13 years 8 months ago
A Proposal for Annotation, Semantic Similarity and Classification of Textual Documents
Abstract. In this paper, we present an approach for classifying documents based on the notion of a semantic similarity and the effective representation of the content of the docume...
Emmanuel Nauer, Amedeo Napoli
DOCENG
2007
ACM
13 years 8 months ago
A model for mapping between printed and digital document instances
The first steps towards bridging the paper-digital divide have been achieved with the development of a range of technologies that allow printed documents to be linked to digital c...
Nadir Weibel, Moira C. Norrie, Beat Signer
CICLING
2010
Springer
13 years 8 months ago
Word Length n-Grams for Text Re-use Detection
Abstract. The automatic detection of shared content in written documents –which includes text reuse and its unacknowledged commitment, plagiarism– has become an important probl...
Alberto Barrón-Cedeño, Chiara Basile...
SIGIR
1993
ACM
13 years 9 months ago
A Model of Information Retrieval Based on a Terminological Logic
According to the logical model of Information Retrieval (IR), the task of IR can be described as the extraction, from a given document base, of those documents d that, given a que...
Carlo Meghini, Fabrizio Sebastiani, Umberto Stracc...
PKDD
1998
Springer
113views Data Mining» more  PKDD 1998»
13 years 9 months ago
Text Mining at the Term Level
Knowledge Discovery in Databases (KDD) focuses on the computerized exploration of large amounts of data and on the discovery of interesting patterns within them. While most work on...
Ronen Feldman, Moshe Fresko, Yakkov Kinar, Yehuda ...