Sciweavers

168 search results - page 3 / 34
» Document Classification Using Multiword Features
Sort
View
IJCAI
2003
14 years 11 months ago
Learning to Classify Texts Using Positive and Unlabeled Data
In traditional text classification, a classifier is built using labeled training documents of every class. This paper studies a different problem. Given a set P of documents of a ...
Xiaoli Li, Bing Liu
CICLING
2010
Springer
14 years 4 months ago
An Empirical Study on the Feature's Type Effect on the Automatic Classification of Arabic Documents
The Arabic language is a highly flexional and morphologically very rich language. It presents serious challenges to the automatic classification of documents, one of which is deter...
Saeed Raheel, Joseph Dichy
ICDAR
2009
IEEE
15 years 4 months ago
A Novel Feature Extraction and Classification Methodology for the Recognition of Historical Documents
In this paper, we present a methodology for off-line character recognition that mainly focuses on handling the difficult cases of historical fonts and styles. The proposed methodo...
Georgios Vamvakas, Basilios Gatos, Stavros J. Pera...
ICMLA
2009
14 years 7 months ago
Knowledge Transfer for Feature Generation in Document Classification
One important problem in machine learning is how to extract knowledge from prior experience, then transfer and apply this knowledge in new learning tasks. To address this problem, ...
Jian Zhang, Shobhit S. Shakya
ICDAR
1999
IEEE
15 years 1 months ago
Document Image Layout Comparison and Classification
This paper describes features and methods for document image comparison and classification at the spatial layout level. The methods are useful for visual similarity based document...
Jianying Hu, Ramanujan S. Kashi, Gordon T. Wilfong