Search Sciweavers | Sciweavers

106 search results - page 3 / 22

» Retrieving Web Pages Using Content, Links, URLs and Anchors

170

click to vote

SIGIR
2009
ACM

153views Information Technology» more SIGIR 2009»

Building enriched document representations using aggregated anchor text

15 years 12 months ago

Download ciir.cs.umass.edu

It is well known that anchor text plays a critical role in a variety of search tasks performed over hypertextual domains, including enterprise search, wiki search, and web search....

Donald Metzler, Jasmine Novak, Hang Cui, Srihari R...

claim paper

Read More »

140

click to vote

SIGIR
2000
ACM

81views Information Technology» more SIGIR 2000»

Topical locality in the Web

15 years 9 months ago

Download dspc11.cs.ccu.edu.tw

Most web pages are linked to others with related content. This idea, combined with another that says that text in, and possibly around, HTML anchors describe the pages to which th...

Brian D. Davison

claim paper

Read More »

153

click to vote

TREC
2001

123views Information Technology» more TREC 2001»

Yonsei/ETRI at TREC-10: Utilizing Web Document Properties

15 years 6 months ago

Download trec.nist.gov

As our first TREC participation, four runs were submitted for the ad hoc task and two runs for the home page finding task in the web track. For the ad hoc task we experimented on ...

Dong-Yul Ra, Eui-Kyu Park, Joong-Sik Jang

claim paper

Read More »

144

click to vote

WWW
2006
ACM

138views Internet Technology» more WWW 2006»

Geographically focused collaborative crawling

16 years 6 months ago

Download www2006.org

A collaborative crawler is a group of crawling nodes, in which each crawling node is responsible for a specific portion of the web. We study the problem of collecting geographical...

Weizheng Gao, Hyun Chul Lee, Yingbo Miao

claim paper

Read More »

137

click to vote

WWW
2006
ACM

139views Internet Technology» more WWW 2006»

Do not crawl in the DUST: different URLs with similar text

15 years 11 months ago

Download www2007.org

We consider the problem of dust: Diﬀerent URLs with Similar Text. Such duplicate URLs are prevalent in web sites, as web server software often uses aliases and redirections, and...

Uri Schonfeld, Ziv Bar-Yossef, Idit Keidar

claim paper

Read More »

« Prev « First page 3 / 22 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers