Sciweavers

3152 search results - page 108 / 631
» Retrieval of Partial Documents
Sort
View
ICDAR
2003
IEEE
15 years 3 months ago
Document page similarity based on layout visual saliency: Application to query by example and document classification
In this paper we propose to define a measure of visual similarity to compare different pages in a corpus. This measure is based on the analysis of the visual layout saliency of th...
Véronique Eglin, Stéphane Bres
SAC
2008
ACM
14 years 9 months ago
Exploring social annotations for web document classification
Social annotation via so-called collaborative tagging describes the process by which many users add metadata in the form of unstructured keywords to shared content. In this paper,...
Michael G. Noll, Christoph Meinel
SIGIR
2003
ACM
15 years 3 months ago
Document clustering based on non-negative matrix factorization
In this paper, we propose a novel document clustering method based on the non-negative factorization of the termdocument matrix of the given document corpus. In the latent semanti...
Wei Xu, Xin Liu, Yihong Gong
DMIN
2006
150views Data Mining» more  DMIN 2006»
14 years 11 months ago
Effect of Document Representation on the Performance of Medical Document Classification
Text classification in the medical domain is a real world problem with wide applicability. This paper investigates extensively the effect of text representation approaches on the p...
Fathi H. Saad, Beatriz de la Iglesia, Duncan G. Be...
SIGIR
2009
ACM
15 years 4 months ago
The ESA retrieval model revisited
Among the retrieval models that have been proposed in the last years, the ESA model of Gabrilovich and Markovitch received much attention. The authors report on a significant imp...
Maik Anderka, Benno Stein