Sciweavers

2141 search results - page 381 / 429
» Classifying web sites
Sort
View
WWW
2008
ACM
15 years 10 months ago
IRLbot: scaling to 6 billion pages and beyond
This paper shares our experience in designing a web crawler that can download billions of pages using a single-server implementation and models its performance. We show that with ...
Hsin-Tsang Lee, Derek Leonard, Xiaoming Wang, Dmit...
WWW
2005
ACM
15 years 10 months ago
Fully automatic wrapper generation for search engines
When a query is submitted to a search engine, the search engine returns a dynamically generated result page containing the result records, each of which usually consists of a link...
Hongkun Zhao, Weiyi Meng, Zonghuan Wu, Vijay Ragha...
CHI
2009
ACM
15 years 10 months ago
The doctor as the second opinion and the internet as the first
People who use the Internet for health information often obtain their first opinion that way, and then, if they go to a doctor, the doctors advice is relegated to the second opini...
Lisa Neal Gualtieri
CHI
2009
ACM
15 years 10 months ago
StoryTags: once upon a time, there was a photo
Nuno Tom?s Daniel Gon?alves With the growing volume of digital information users must deal with, management and retrieval tasks have become increasingly problematic. A popular way ...
Nuno Tomás, Tiago João Vieira Guerre...
SIGMOD
2008
ACM
206views Database» more  SIGMOD 2008»
15 years 10 months ago
Ad-hoc aggregations of ranked lists in the presence of hierarchies
A variety of web sites and web based services produce textual lists at varying time granularities ranked according to several criteria. For example, Google Trends produces lists o...
Nilesh Bansal, Sudipto Guha, Nick Koudas