Sciweavers

6103 search results - page 702 / 1221
» Multimedia Retrieval Algorithmics
Sort
View
ICML
2005
IEEE
16 years 5 months ago
Hierarchical Dirichlet model for document classification
The proliferation of text documents on the web as well as within institutions necessitates their convenient organization to enable efficient retrieval of information. Although tex...
Sriharsha Veeramachaneni, Diego Sona, Paolo Avesan...
135
Voted
WWW
2008
ACM
16 years 5 months ago
Generating hypotheses from the web
Hypothesis generation is a crucial initial step for making scientific discoveries. This paper addresses the problem of automatically discovering interesting hypotheses from the we...
Wei Jin, Rohini K. Srihari, Abhishek Singh
WWW
2006
ACM
16 years 5 months ago
Visually guided bottom-up table detection and segmentation in web documents
In the AllRight project, we are developing an algorithm for unsupervised table detection and segmentation that uses the visual rendition of a Web page rather than the HTML code. O...
Bernhard Krüpl, Marcus Herzog
WWW
2006
ACM
16 years 5 months ago
Using symbolic objects to cluster web documents
Web Clustering is useful for several activities in the WWW, from automatically building web directories to improve retrieval performance. Nevertheless, due to the huge size of the...
Esteban Meneses, Oldemar Rodríguez-Rojas
151
Voted
SPIRE
2009
Springer
15 years 11 months ago
A Linear-Time Burrows-Wheeler Transform Using Induced Sorting
To compute Burrows-Wheeler Transform (BWT), one usually builds a suffix array (SA) first, and then obtains BWT using SA, which requires much redundant working space. In previous ...
Daisuke Okanohara, Kunihiko Sadakane