Sciweavers

106 search results - page 12 / 22
» Retrieving Web Pages Using Content, Links, URLs and Anchors
Sort
View
WWW
2008
ACM
16 years 9 days ago
IRLbot: scaling to 6 billion pages and beyond
This paper shares our experience in designing a web crawler that can download billions of pages using a single-server implementation and models its performance. We show that with ...
Hsin-Tsang Lee, Derek Leonard, Xiaoming Wang, Dmit...
CIKM
2010
Springer
14 years 10 months ago
Entity ranking using Wikipedia as a pivot
In this paper we investigate the task of Entity Ranking on the Web. Searchers looking for entities are arguably better served by presenting a ranked list of entities directly, rat...
Rianne Kaptein, Pavel Serdyukov, Arjen P. de Vries...
AAAI
2006
15 years 1 months ago
Bookmark Hierarchies and Collaborative Recommendation
GiveALink.org is a social bookmarking site where users may donate and view their personal bookmark files online securely. The bookmarks are analyzed to build a new generation of i...
Benjamin Markines, Lubomira Stoilova, Filippo Menc...
CORR
2007
Springer
76views Education» more  CORR 2007»
14 years 11 months ago
Portlet Wrappers using JavaScript
In this paper we extend the classical portal (with static portlets) design with HTML DOM Web clipping on the client browser using dynamic JavaScript portlets: the portal server su...
Paul Fodor
ICTIR
2009
Springer
15 years 6 months ago
PageRank: Splitting Homogeneous Singular Linear Systems of Index One
Abstract. The PageRank algorithm is used today within web information retrieval to provide a content-neutral ranking metric over web pages. It employs power method iterations to so...
Douglas V. de Jager, Jeremy T. Bradley