Search Sciweavers | Sciweavers

195

ICDAR
2005
IEEE

97views Document Analysis» more ICDAR 2005»

16 years 1 months ago

Printing and scanning of text documents introduces degradations to the characters which can be modeled. Interestingly, certain combinations of the parameters that govern the degra...

Elisa H. Barney Smith, Tim L. Andersen

claim paper

Read More »

179

click to vote

ICDAR
2003
IEEE

121views Document Analysis» more ICDAR 2003»

Generation of Synthetic Training Data for an HMM-based Handwriting Recognition System

16 years 22 days ago

Download www.iam.unibe.ch

A perturbation model for generating synthetic textlines from existing cursively handwritten lines of text produced by human writers is presented. Our purpose is to improve the per...

Tamás Varga, Horst Bunke

claim paper

Read More »

195

click to vote

TREC
2007

106views Information Technology» more TREC 2007»

The Robert Gordon University at the Opinion Retrieval Task of the 2007 TREC Blog Track

15 years 8 months ago

Download trec.nist.gov

Abstract. The Robert Gordon University (RGU) participated in the Opinion Retrieval Task of the Trec 2007 Blog Track. At the core of the system we developed is a set of training doc...

Rahman Mukras, Nirmalie Wiratunga, Robert Lothian

claim paper

Read More »

228

click to vote

TAL
2010
Springer

127views Natural Language Processing» more TAL 2010»

Summarization as Feature Selection for Document Categorization on Small Datasets

15 years 5 months ago

Download users.dsic.upv.es

Abstract. Most common feature selection techniques for document categorization are supervised and require lots of training data in order to accurately capture the descriptive and d...

Emmanuel Anguiano-Hernández, Luis Villase&n...

claim paper

Read More »

205

click to vote

ECIR
2003
Springer

108views Information Technology» more ECIR 2003»

Hierarchical Classification of HTML Documents with WebClassII

15 years 8 months ago

Download www.di.uniba.it

This paper describes a new method for the classification of a HTML document into a hierarchy of categories. The hierarchy of categories is involved in all phases of automated docum...

Michelangelo Ceci, Donato Malerba

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers