Sciweavers

241 search results - page 5 / 49
» Detecting Co-Derivative Documents in Large Text Collections
Sort
View
90
Voted
ICMCS
2006
IEEE
189views Multimedia» more  ICMCS 2006»
15 years 3 months ago
Multiscale Edge-Based Text Extraction from Complex Images
Text that appears in images contains important and useful information. Detection and extraction of text in images have been used in many applications. In this paper, we propose a ...
Xiaoqing Liu, Jagath Samarabandu
ICDAR
2009
IEEE
14 years 7 months ago
Clutter Noise Removal in Binary Document Images
The paper presents a clutter detection and removal algorithm for complex document images. The distance transform based approach is independent of clutter's position, size, sh...
Mudit Agrawal, David S. Doermann
CORR
2006
Springer
132views Education» more  CORR 2006»
14 years 9 months ago
Navigating multilingual news collections using automatically extracted information
We are presenting a text analysis tool set that allows analysts in various fields to sieve through large collections of multilingual news items quickly and to find information that...
Ralf Steinberger, Bruno Pouliquen, Camelia Ignat
87
Voted
CIKM
2000
Springer
15 years 2 months ago
Scalable association-based text classification
Naïve Bayes (NB) classifier has long been considered a core methodology in text classification mainly due to its simplicity and computational efficiency. There is an increasing n...
Dimitris Meretakis, Dimitris Fragoudis, Hongjun Lu...
ECIR
2007
Springer
14 years 11 months ago
Entropy-Based Authorship Search in Large Document Collections
The purpose of authorship search is to identify documents written by a particular author or in a particular style in large document collections. Standard search engines match docum...
Ying Zhao, Justin Zobel