Sciweavers

180 search results - page 7 / 36
» Iterated Document Content Classification
Sort
View
CIKM
2010
Springer
14 years 7 months ago
Fast dimension reduction for document classification based on imprecise spectrum analysis
This paper proposes an algorithm called Imprecise Spectrum Analysis (ISA) to carry out fast dimension reduction for document classification. ISA is designed based on the one-sided...
Hu Guan, Bin Xiao, Jingyu Zhou, Minyi Guo, Tao Yan...
ML
2000
ACM
124views Machine Learning» more  ML 2000»
14 years 9 months ago
Text Classification from Labeled and Unlabeled Documents using EM
This paper shows that the accuracy of learned text classifiers can be improved by augmenting a small number of labeled training documents with a large pool of unlabeled documents. ...
Kamal Nigam, Andrew McCallum, Sebastian Thrun, Tom...
ICPR
2002
IEEE
15 years 2 months ago
Progress in Document Reconstruction
We combine information from a language model and character image pattern matching to iteratively reduce ambiguity in document images. Combining word shape information and lists of...
A. Lawrence Spitz
BIRD
2007
Springer
168views Bioinformatics» more  BIRD 2007»
15 years 1 months ago
Ontology-Based MEDLINE Document Classification
Abstract. An increasing and overwhelming amount of biomedical information is available in the research literature mainly in the form of free-text. Biologists need tools that automa...
Fabrice Camous, Stephen Blott, Alan F. Smeaton
ICDAR
2011
IEEE
13 years 9 months ago
Identification of Indic Scripts on Torn-Documents
—Questioned Document Examination processes often encompass analysis of torn documents. To aid a forensic expert, automatic classification of content type in torn documents might ...
Sukalpa Chanda, Katrin Franke, Umapada Pal