Sciweavers

421 search results - page 55 / 85
» Page Quality: In Search of an Unbiased Web Ranking
Sort
View
WWW
2009
ACM
15 years 11 months ago
Graph based crawler seed selection
This paper identifies and explores the problem of seed selection in a web-scale crawler. We argue that seed selection is not a trivial but very important problem. Selecting proper...
Shuyi Zheng, Pavel Dmitriev, C. Lee Giles
CORR
2008
Springer
133views Education» more  CORR 2008»
14 years 11 months ago
Faceted Ranking of Egos in Collaborative Tagging Systems
Multimedia uploaded content is tagged and recommended by users of collaborative systems, resulting in informal classifications also known as folksonomies. Faceted web ranking has ...
José Ignacio Orlicki, Pablo Ignacio Fierens...
WWW
2005
ACM
15 years 4 months ago
An information extraction engine for web discussion forums
In this poster, we present an information extraction engine for web-based forums. The engine analyzes the HTML files crawled from web forums, deduces the wrapper (template) of the...
Hanny Yulius Limanto, Nguyen Ngoc Giang, Vo Tan Tr...
WWW
2001
ACM
15 years 11 months ago
Crawling the Hidden Web
Current-day crawlers retrieve content only from the publicly indexable Web, i.e., the set of Web pages reachable purely by following hypertext links, ignoring search forms and pag...
Sriram Raghavan, Hector Garcia-Molina
93
Voted
ICDE
2008
IEEE
152views Database» more  ICDE 2008»
15 years 5 months ago
Automated generation of object summaries from relational databases: A novel keyword searching paradigm
— This paper introduces a novel keyword searching paradigm in Relational Databases (DBs), where the result of a search is a ranked set of Object Summaries (OSs). An OS summarizes...
Georgios John Fakas