Sciweavers

637 search results - page 4 / 128
» Training and documentation
Sort
View
ICDAR
2005
IEEE
15 years 3 months ago
Text Degradations and OCR Training
Printing and scanning of text documents introduces degradations to the characters which can be modeled. Interestingly, certain combinations of the parameters that govern the degra...
Elisa H. Barney Smith, Tim L. Andersen
ICDAR
2003
IEEE
15 years 2 months ago
Generation of Synthetic Training Data for an HMM-based Handwriting Recognition System
A perturbation model for generating synthetic textlines from existing cursively handwritten lines of text produced by human writers is presented. Our purpose is to improve the per...
Tamás Varga, Horst Bunke
TREC
2007
14 years 10 months ago
The Robert Gordon University at the Opinion Retrieval Task of the 2007 TREC Blog Track
Abstract. The Robert Gordon University (RGU) participated in the Opinion Retrieval Task of the Trec 2007 Blog Track. At the core of the system we developed is a set of training doc...
Rahman Mukras, Nirmalie Wiratunga, Robert Lothian
TAL
2010
Springer
14 years 8 months ago
Summarization as Feature Selection for Document Categorization on Small Datasets
Abstract. Most common feature selection techniques for document categorization are supervised and require lots of training data in order to accurately capture the descriptive and d...
Emmanuel Anguiano-Hernández, Luis Villase&n...
ECIR
2003
Springer
14 years 11 months ago
Hierarchical Classification of HTML Documents with WebClassII
This paper describes a new method for the classification of a HTML document into a hierarchy of categories. The hierarchy of categories is involved in all phases of automated docum...
Michelangelo Ceci, Donato Malerba