Sciweavers

3090 search results - page 86 / 618
» Document Processing with LinkIT
Sort
View
ICDAR
2003
IEEE
15 years 3 months ago
A Bilingual OCR for Hindi-Telugu Documents and its Applications
This paper describes the character recognition process from printed documents containing Hindi and Telugu text. Hindi and Telugu are among the most popular languages in India. The...
C. V. Jawahar, M. N. S. S. K. Pavan Kumar, S. S. R...
CICLING
2001
Springer
15 years 2 months ago
Chi-Square Classifier for Document Categorization
The problem of document categorization is considered. The set of domains and the keywords specific for these domains is supposed to be selected beforehand as initial data. We apply...
Mikhail Alexandrov, Alexander F. Gelbukh, George L...
COLING
2002
14 years 9 months ago
An XML-based Document Suite
We report about the current state of development of a document suite and its applications. This collection of tools for the flexible and robust processing of documents in German i...
Dietmar Rösner, Manuela Kunze
SAC
2011
ACM
14 years 21 days ago
Towards discovering criminal communities from textual data
In many criminal cases, forensically collected data contain valuable information about a suspect’s social networks. An investigator often has to manually extract information fro...
Rabeah Al-Zaidy, Benjamin C. M. Fung, Amr M. Youss...
SIGIR
2011
ACM
14 years 20 days ago
Active learning to maximize accuracy vs. effort in interactive information retrieval
We consider an interactive information retrieval task in which the user is interested in finding several to many relevant documents with minimal effort. Given an initial documen...
Aibo Tian, Matthew Lease