Sciweavers

121 search results - page 9 / 25
» Pagerank based clustering of hypertext document collections
Sort
View
HIS
2003
14 years 11 months ago
Evolving Better Stoplists for Document Clustering and Web Intelligence
: Text classification, document clustering and similar document analysis tasks are currently the subject of significant global research, since such areas underpin web intelligence,...
Mark P. Sinka, David Corne
ICPR
2008
IEEE
15 years 10 months ago
Clustering of short commercial documents for the web
Document clustering techniques have been applied in several areas, with the web as one of the most recent and influent. Both general-purpose and text-oriented techniques exist and...
Elisabetta Binaghi, Ignazio Gallo, Moreno Carullo,...
INEX
2005
Springer
15 years 3 months ago
A Flexible Structured-Based Representation for XML Document Mining
This paper reports on the INRIA group’s approach to XML mining while participating in the INEX XML Mining track 2005. We use a flexible representation of XML documents that allo...
Anne-Marie Vercoustre, Mounir Fegas, Saba Gul, Yve...
DAS
2006
Springer
15 years 1 months ago
Efficient Word Retrieval by Means of SOM Clustering and PCA
Abstract. We propose an approach for efficient word retrieval from printed documents belonging to Digital Libraries. The approach combines word image clustering (based on Self Orga...
Simone Marinai, Stefano Faini, Emanuele Marino, Gi...
90
Voted
ECML
2007
Springer
15 years 1 months ago
User Oriented Hierarchical Information Organization and Retrieval
Abstract. In order to organize huge document collections, labeled hierarchical structures are used frequently. Users are most efficient in navigating such hierarchies, if they refl...
Korinna Bade, Marcel Hermkes, Andreas Nürnber...