Sciweavers

264 search results - page 14 / 53
» Clustering Documents with Active Learning Using Wikipedia
Sort
View
EMNLP
2008
15 years 1 months ago
An Analysis of Active Learning Strategies for Sequence Labeling Tasks
Active learning is well-suited to many problems in natural language processing, where unlabeled data may be abundant but annotation is slow and expensive. This paper aims to shed ...
Burr Settles, Mark Craven
ICDAR
2003
IEEE
15 years 5 months ago
Unsupervised Feature Selection Using Multi-Objective Genetic Algorithms for Handwritten Word Recognition
In this paper a methodology for feature selection in unsupervised learning is proposed. It makes use of a multiobjective genetic algorithm where the minimization of the number of ...
Marisa E. Morita, Robert Sabourin, Flávio B...
111
Voted
MLDM
2005
Springer
15 years 5 months ago
CorePhrase: Keyphrase Extraction for Document Clustering
Abstract. The ability to discover the topic of a large set of text documents using relevant keyphrases is usually regarded as a very tedious task if done by hand. Automatic keyphra...
Khaled M. Hammouda, Diego N. Matute, Mohamed S. Ka...
HIS
2003
15 years 1 months ago
Evolving Better Stoplists for Document Clustering and Web Intelligence
: Text classification, document clustering and similar document analysis tasks are currently the subject of significant global research, since such areas underpin web intelligence,...
Mark P. Sinka, David Corne
91
Voted
ICIP
2001
IEEE
16 years 1 months ago
Image data mining from financial documents based on wavelet features
In this paper, we present a framework for clustering and classifying cheque images according to their payee-line content. The features used in the clustering and classificationpro...
Ossama El Badawy, Mahmoud R. El-Sakka, Khaled Hass...