Sciweavers

121 search results - page 17 / 25
» Pagerank based clustering of hypertext document collections
Sort
View
AIIA
2009
Springer
15 years 4 months ago
Mathematical Symbol Indexing
This paper addresses the indexing and retrieval of mathematical symbols from digitized documents. The proposed approach exploits Shape Contexts (SC) to describe the shape of mathe...
Simone Marinai, Beatrice Miotti, Giovanni Soda
SDM
2008
SIAM
140views Data Mining» more  SDM 2008»
14 years 11 months ago
Creating a Cluster Hierarchy under Constraints of a Partially Known Hierarchy
Although clustering under constraints is a current research topic, a hierarchical setting, in which a hierarchy of clusters is the goal, is usually not considered. This paper trie...
Korinna Bade, Andreas Nürnberger
SAC
2002
ACM
14 years 9 months ago
Benefits of document maps for text access in knowledge management: a comparative study
Analyzing, structuring and organizing documented knowledge is an important aspect of knowledge management. In order to ease the access to text collections, in literature so-called...
Andreas Becks, Christian Seeling, Ralf Minkenberg
JCDL
2003
ACM
160views Education» more  JCDL 2003»
15 years 2 months ago
Automatic Document Metadata Extraction Using Support Vector Machines
Automatic metadata generation provides scalability and usability for digital libraries and their collections. Machine learning methods offer robust and adaptable automatic metadat...
Hui Han, C. Lee Giles, Eren Manavoglu, Hongyuan Zh...
WWW
2008
ACM
15 years 10 months ago
As we may perceive: finding the boundaries of compound documents on the web
This paper considers the problem of identifying on the Web compound documents (cDocs) ? groups of web pages that in aggregate constitute semantically coherent information entities...
Pavel Dmitriev