Sciweavers

8795 search results - page 129 / 1759
» Measuring Generality of Documents
Sort
View
KDD
2008
ACM
244views Data Mining» more  KDD 2008»
16 years 4 months ago
Probabilistic latent semantic visualization: topic model for visualizing documents
We propose a visualization method based on a topic model for discrete data such as documents. Unlike conventional visualization methods based on pairwise distances such as multi-d...
Tomoharu Iwata, Takeshi Yamada, Naonori Ueda
WSDM
2010
ACM
210views Data Mining» more  WSDM 2010»
16 years 1 months ago
Leveraging Temporal Dynamics of Document Content in Relevance Ranking
Many web documents are dynamic, with content changing in varying amounts at varying frequencies. However, current document search algorithms have a static view of the document con...
Jonathan L. Elsas, Susan T. Dumais
DOCENG
2009
ACM
15 years 10 months ago
Object-level document analysis of PDF files
The PDF format is commonly used for the exchange of documents on the Web and there is a growing need to understand and extract or repurpose data held in PDF documents. Many system...
Tamir Hassan
DIAL
2006
IEEE
185views Image Analysis» more  DIAL 2006»
15 years 10 months ago
Automatic Content-based Indexing of Digital Documents through Intelligent Processing Techniques
The availability of large, heterogeneous repositories of electronic documents is increasing rapidly, and the need for flexible, sophisticated document manipulation tools is growi...
Floriana Esposito, Stefano Ferilli, Teresa Maria A...
DOCENG
2006
ACM
15 years 10 months ago
Describing multistructured XML documents by means of delay nodes
Multistructured documents are documents whose structure is composed of a set of concurrent hierarchical structures. In this paper, we propose a new model of multistructured docume...
Jacques Le Maitre