Sciweavers

1759 search results - page 55 / 352
» Distributed Paging
Sort
View
WWW
2009
ACM
16 years 2 months ago
Incorporating site-level knowledge to extract structured data from web forums
Web forums have become an important data resource for many web applications, but extracting structured data from unstructured web forum pages is still a challenging task due to bo...
Jiang-Ming Yang, Rui Cai, Yida Wang, Jun Zhu, Lei ...
WWW
2007
ACM
16 years 2 months ago
Detecting near-duplicates for web crawling
Near-duplicate web documents are abundant. Two such documents differ from each other in a very small portion that displays advertisements, for example. Such differences are irrele...
Gurmeet Singh Manku, Arvind Jain, Anish Das Sarma
95
Voted
DAGSTUHL
2003
15 years 3 months ago
SHOE: A Blueprint for the Semantic Web
The term Semantic Web was coined by Tim Berners-Lee to describe his proposal for \a web of meaning," as opposed to the \web of links" that currently exists on the Intern...
Jeff Heflin, James A. Hendler, Sean Luke
OSDI
1994
ACM
15 years 3 months ago
Opportunistic Log: Efficient Installation Reads in a Reliable Storage Server
In a distributed storage system, client caches managed on the basis of small granularity objects can provide better memory utilization then page-based caches. However, object serv...
James O'Toole, Liuba Shrira
JSS
2008
116views more  JSS 2008»
15 years 2 months ago
Characterization of the evolution of a news Web site
The Web has become a ubiquitous tool for distributing knowledge and information and for conducting businesses. To exploit the huge potential of the Web as a global information rep...
Mariacarla Calzarossa, Daniele Tessera