Sciweavers

469 search results - page 12 / 94
» On Compressing the Textual Web
Sort
View
SIGIR
2008
ACM
14 years 9 months ago
A large time-aware web graph
We describe the techniques developed to gather and distribute in a highly compressed, yet accessible, form a series of twelve snapshot of the .uk web domain. Ad hoc compression
Paolo Boldi, Massimo Santini, Sebastiano Vigna
CMMR
2004
Springer
110views Music» more  CMMR 2004»
15 years 3 months ago
A Self-Organizing Map Based Knowledge Discovery for Music Recommendation Systems
Abstract. In this paper, we present an approach for musical artist recommendation based on Self-Organizing Maps (SOMs) of artist reviews from Amazon web site. The Amazon reviews fo...
Shankar Vembu, Stephan Baumann
ICIP
2000
IEEE
15 years 11 months ago
Optimizing Block-Threshold Segmentation for MRC Compression
Compound document images contain graphic or textual content along with pictures. They are a very common form of documents, found in magazines, brochures, web-sites, etc. We focus ...
Ricardo L. de Queiroz, Zhigang Fan, Trac D. Tran
HICSS
2008
IEEE
105views Biometrics» more  HICSS 2008»
15 years 4 months ago
Using Visual Features for Fine-Grained Genre Classification of Web Pages
The field of automatic genre classification has primarily focused on extracting textual features from documents. The goal of this research is to investigate whether visual feature...
Ryan Levering, Michal Cutler, Lei Yu
WWW
2004
ACM
15 years 10 months ago
The webgraph framework I: compression techniques
Studying Web graphs is often difficult due to their large size. Recently, several proposals have been published about various techniques that allow to store a Web graph in memory ...
Paolo Boldi, Sebastiano Vigna