Sciweavers

611 search results - page 50 / 123
» Random web crawls
Sort
View
ECIR
2008
Springer
14 years 11 months ago
The Importance of Link Evidence in Wikipedia
Wikipedia is one of the most popular information sources on the Web. The free encyclopedia is densely linked. The link structure in Wikipedia differs from the Web at large: interna...
Jaap Kamps, Marijn Koolen
PODS
2006
ACM
127views Database» more  PODS 2006»
15 years 10 months ago
Evolution of page popularity under random web graph models
The link structure of the Web can be viewed as a massive graph. The preferential attachment model and its variants are well-known random graph models that help explain the evoluti...
Rajeev Motwani, Ying Xu 0002
CN
1999
73views more  CN 1999»
14 years 9 months ago
Measuring Index Quality Using Random Walks on the Web
Recent researchhas studied howto measurethe size of a searchengine, in terms of the number of pages indexed. In this paper, we consider a di erent measure for search engines, name...
Monika Rauch Henzinger, Allan Heydon, Michael Mitz...
ICML
2007
IEEE
15 years 10 months ago
Dynamic hierarchical Markov random fields and their application to web data extraction
Hierarchical models have been extensively studied in various domains. However, existing models assume fixed model structures or incorporate structural uncertainty generatively. In...
Jun Zhu, Zaiqing Nie, Bo Zhang, Ji-Rong Wen
CIKM
2009
Springer
15 years 4 months ago
Vetting the links of the web
Many web links mislead human surfers and automated crawlers because they point to changed content, out-of-date information, or invalid URLs. It is a particular problem for large, ...
Na Dai, Brian D. Davison