Sciweavers

170 search results - page 21 / 34
» Text Retrieval from Document Images based on N-Gram Algorith...
Sort
View
84
Voted
SIGIR
2010
ACM
15 years 3 months ago
Combining coregularization and consensus-based self-training for multilingual text categorization
We investigate the problem of learning document classifiers in a multilingual setting, from collections where labels are only partially available. We address this problem in the ...
Massih-Reza Amini, Cyril Goutte, Nicolas Usunier
CBMS
2006
IEEE
15 years 5 months ago
Biomedical Ontology MeSH Improves Document Clustering Qualify on MEDLINE Articles: A Comparison Study
Document clustering has been used for better document retrieval, document browsing, and text mining. In this paper, we investigate if biomedical ontology MeSH improves the cluster...
Illhoi Yoo, Xiaohua Hu
JCDL
2006
ACM
167views Education» more  JCDL 2006»
15 years 5 months ago
Combining DOM tree and geometric layout analysis for online medical journal article segmentation
We describe an HTML web page segmentation algorithm, which is applied to segment online medical journal articles (regular HTML and PDF-Converted-HTML files). The web page content ...
Jie Zou, Daniel X. Le, George R. Thoma
89
Voted
MM
2004
ACM
112views Multimedia» more  MM 2004»
15 years 5 months ago
Multi-model similarity propagation and its application for web image retrieval
In this paper, we propose an iterative similarity propagation approach to explore the inter-relationships between Web images and their textual annotations for image retrieval. By ...
Xin-Jing Wang, Wei-Ying Ma, Gui-Rong Xue, Xing Li
RIAO
2004
15 years 1 months ago
Multilingual document clusters discovery
Cross Language Information Retrieval community has brought up search engines over multilingual corpora, and multilingual text categorization systems. In this paper, we focus on th...
Benoît Mathieu, Romaric Besançon, Chr...