Sciweavers

75 search results - page 2 / 15
» Capturing page freshness for web search
Sort
View
LAWEB
2003
IEEE
13 years 10 months ago
Cooperation Schemes between a Web Server and a Web Search Engine
Search engines provide search results based on a large repository of pages downloaded by a web crawler from several servers. To provide best results, this repository must be kept ...
Carlos Castillo
IADIS
2003
13 years 6 months ago
A Scalable Distributed Search Engine for Fresh Information Retrieval
We have developed a distributed search engine, Cooperative Search Engine (CSE) to retrieve fresh information. In CSE, a local search engine located in each web server makes an ind...
Nobuyoshi Sato, Minoru Uehara, Yoshifumi Sakai
SIGMOD
2010
ACM
232views Database» more  SIGMOD 2010»
13 years 5 months ago
Optimizing content freshness of relations extracted from the web using keyword search
An increasing number of applications operate on data obtained from the Web. These applications typically maintain local copies of the web data to avoid network latency in data acc...
Mohan Yang, Haixun Wang, Lipyeow Lim, Min Wang
WWW
2008
ACM
14 years 5 months ago
Microscale evolution of web pages
We track a large set of "rapidly" changing web pages and examine the assumption that the arrival of content changes follows a Poisson process on a microscale. We demonst...
Carrie Grimes
WWW
2009
ACM
14 years 5 months ago
Sitemaps: above and beyond the crawl of duty
Comprehensive coverage of the public web is crucial to web search engines. Search engines use crawlers to retrieve pages and then discover new ones by extracting the pages' o...
Uri Schonfeld, Narayanan Shivakumar