Search Sciweavers | Sciweavers

167

ECIR
2008
Springer

103views Information Technology» more ECIR 2008»

Semi-supervised Document Classification with a Mislabeling Error Model

15 years 6 months ago

Abstract. This paper investigates a new extension of the Probabilistic Latent Semantic Analysis (PLSA) model [6] for text classification where the training set is partially labeled...

Anastasia Krithara, Massih-Reza Amini, Jean-Michel...

claim paper

Read More »

194

click to vote

ICDAR
2003
IEEE

113views Document Analysis» more ICDAR 2003»

Word Segmentation of Handwritten Dates in Historical Documents by Combining Semantic A-Priori-Knowledge with Local Features

15 years 10 months ago

Download www.cse.salford.ac.uk

The recognition of script in historical documents requires suitable techniques in order to identify single words. Segmentation of lines and words is a challenging task because lin...

Markus Feldbach, Klaus D. Tönnies

claim paper

Read More »

137

click to vote

WIDM
2003
ACM

99views Internet Technology» more WIDM 2003»

Clustering documents in a web directory

15 years 10 months ago

Download sra.itc.it

Hierarchical categorization of documents is a task receiving growing interest due to the widespread proliferation of topic hierarchies for text documents. The worst problem of hie...

Giordano Adami, Paolo Avesani, Diego Sona

claim paper

Read More »

117

click to vote

ICDAR
2005
IEEE

122views Document Analysis» more ICDAR 2005»

Language Identification of Character Images Using Machine Learning Techniques

15 years 11 months ago

Download datf.iis.sinica.edu.tw

In this paper, we propose a new approach for identifying the language type of character images. We do this by classifying individual character images to determine the language bou...

Ying-Ho Liu, Fu Chang, Chin-Chin Lin

claim paper

Read More »

240

click to vote

SAC
2010
ACM

187views Applied Computing» more SAC 2010»

Enhancing document structure analysis using visual analytics

16 years 8 days ago

Download infovis.uni-konstanz.de

During the last decade national archives, libraries, museums and companies started to make their records, books and ﬁles electronically available. In order to allow eﬃcient ac...

Andreas Stoffel, David Spretke, Henrik Kinnemann, ...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers