Sciweavers

106 search results - page 3 / 22
» Retrieving Web Pages Using Content, Links, URLs and Anchors
Sort
View
SIGIR
2009
ACM
14 years 22 days ago
Building enriched document representations using aggregated anchor text
It is well known that anchor text plays a critical role in a variety of search tasks performed over hypertextual domains, including enterprise search, wiki search, and web search....
Donald Metzler, Jasmine Novak, Hang Cui, Srihari R...
SIGIR
2000
ACM
13 years 10 months ago
Topical locality in the Web
Most web pages are linked to others with related content. This idea, combined with another that says that text in, and possibly around, HTML anchors describe the pages to which th...
Brian D. Davison
TREC
2001
13 years 7 months ago
Yonsei/ETRI at TREC-10: Utilizing Web Document Properties
As our first TREC participation, four runs were submitted for the ad hoc task and two runs for the home page finding task in the web track. For the ad hoc task we experimented on ...
Dong-Yul Ra, Eui-Kyu Park, Joong-Sik Jang
WWW
2006
ACM
14 years 7 months ago
Geographically focused collaborative crawling
A collaborative crawler is a group of crawling nodes, in which each crawling node is responsible for a specific portion of the web. We study the problem of collecting geographical...
Weizheng Gao, Hyun Chul Lee, Yingbo Miao
WWW
2006
ACM
14 years 5 days ago
Do not crawl in the DUST: different URLs with similar text
We consider the problem of dust: Different URLs with Similar Text. Such duplicate URLs are prevalent in web sites, as web server software often uses aliases and redirections, and...
Uri Schonfeld, Ziv Bar-Yossef, Idit Keidar