Sciweavers

523 search results - page 15 / 105
» Metric Learning for Text Documents
Sort
View
ICDM
2008
IEEE
147views Data Mining» more  ICDM 2008»
15 years 5 months ago
Clustering Documents with Active Learning Using Wikipedia
Wikipedia has been applied as a background knowledge base to various text mining problems, but very few attempts have been made to utilize it for document clustering. In this pape...
Anna Huang, David N. Milne, Eibe Frank, Ian H. Wit...
SSPR
2010
Springer
14 years 9 months ago
Impact of Visual Information on Text and Content Based Image Retrieval
Abstract. Nowadays, multimedia documents composed of text and images are increasingly used, thanks to the Internet and the increasing capacity of data storage. It is more and more ...
Christophe Moulin, Christine Largeron, Mathias G&e...
SIGIR
2010
ACM
15 years 2 months ago
Combining coregularization and consensus-based self-training for multilingual text categorization
We investigate the problem of learning document classifiers in a multilingual setting, from collections where labels are only partially available. We address this problem in the ...
Massih-Reza Amini, Cyril Goutte, Nicolas Usunier
IJCAI
2003
15 years 21 days ago
Information Extraction from Tree Documents by Learning Subtree Delimiters
Information extraction from HTML pages has been conventionally treated as plain text documents extended with HTML tags. However, the growing maturity and correct usage of HTML/XHT...
Boris Chidlovskii
AND
2009
14 years 9 months ago
A comprehensive evaluation methodology for noisy historical document recognition techniques
In this paper, we propose a new comprehensive methodology in order to evaluate the performance of noisy historical document recognition techniques. We aim to evaluate not only the...
Nikolaos Stamatopoulos, Georgios Louloudis, Basili...