Sciweavers

81 search results - page 11 / 17
» Human Performance on Clustering Web Pages: A Preliminary Stu...
Sort
View
ISAAC
2009
Springer
175views Algorithms» more  ISAAC 2009»
15 years 6 months ago
Worst-Case and Smoothed Analysis of k-Means Clustering with Bregman Divergences
The k-means algorithm is the method of choice for clustering large-scale data sets and it performs exceedingly well in practice. Most of the theoretical work is restricted to the c...
Bodo Manthey, Heiko Röglin
WISE
2002
Springer
15 years 4 months ago
Cluster-Based Delta Compression of a Collection of Files
Delta compression techniques are commonly used to succinctly represent an updated version of a file with respect to an earlier one. In this paper, we study the use of delta compr...
Zan Ouyang, Nasir D. Memon, Torsten Suel, Dimitre ...
TREC
2003
15 years 1 months ago
Overview of the TREC 2003 Web Track
The TREC 2003 web track consisted of both a non-interactive stream and an interactive stream. Both streams worked with the .GOV test collection. The non-interactive stream continu...
Nick Craswell, David Hawking, Ross Wilkinson, Ming...
90
Voted
WWW
2007
ACM
16 years 13 days ago
U-REST: an unsupervised record extraction system
In this paper, we describe a system that can extract record structures from web pages with no direct human supervision. Records are commonly occurring HTML-embedded data tuples th...
Yuan Kui Shen, David R. Karger
WWW
2004
ACM
16 years 13 days ago
Smartback: supporting users in back navigation
This paper presents the design and user evaluation of SmartBack, a feature that complements the standard Back button by enabling users to jump directly to key pages in their navig...
Natasa Milic-Frayling, Rachel Jones, Kerry Rodden,...