Sciweavers

57 search results - page 2 / 12
» Content Characterization Using Word Shape Tokens
Sort
View
ICIP
2000
IEEE
14 years 6 months ago
A Hierarchical Characterization Scheme for Image Retrieval
* In this paper, we present a study of hierarchically characterizing image content from coarse level to fine level conducted using a series of shape features as a case in point. I...
Liu Wenyin, Tao Wang, HongJiang Zhang
ICPR
2008
IEEE
14 years 6 months ago
Generic scale-space process for handwriting documents analysis
This paper presents a generic architecture for handwriting documents analysis. It covers all analysis steps from the content description of the document (layout analysis, handwrit...
Guillaume Joutel, Hubert Emptoz, Véronique ...
SIGIR
2003
ACM
13 years 10 months ago
Single n-gram stemming
Stemming can improve retrieval accuracy, but stemmers are language-specific. Character n-gram tokenization achieves many of the benefits of stemming in a language independent way,...
James Mayfield, Paul McNamee
PR
2008
146views more  PR 2008»
13 years 5 months ago
Retrieval of machine-printed Latin documents through Word Shape Coding
This paper reports a document retrieval technique that retrieves machine-printed Latin-based document images through word shape coding. Adopting the idea of image annotation, a wo...
Shijian Lu, Chew Lim Tan
CIARP
2006
Springer
13 years 9 months ago
Authorship Attribution Using Word Sequences
Authorship attribution is the task of identifying the author of a given text. The main concern of this task is to define an appropriate characterization of documents that captures ...
Rosa María Coyotl-Morales, Luis Villase&nti...