Search Sciweavers | Sciweavers

694 search results - page 81 / 139

» Web page ranking using link attributes

click to vote

WWW
2006
ACM

139views Internet Technology» more WWW 2006»

Do not crawl in the DUST: different URLs with similar text

15 years 6 months ago

Download www2007.org

We consider the problem of dust: Diﬀerent URLs with Similar Text. Such duplicate URLs are prevalent in web sites, as web server software often uses aliases and redirections, and...

Uri Schonfeld, Ziv Bar-Yossef, Idit Keidar

claim paper

Read More »

139

click to vote

SIGMOD
2012
ACM

230views Database» more SIGMOD 2012»

Pay-as-you-go data integration for linked data: opportunities, challenges and architectures

13 years 3 months ago

Download www.cs.man.ac.uk

Linked Data (LD) provides principles for publishing data that underpin the development of an emerging web of data. LD follows the web in providing low barriers to entry: publisher...

Norman W. Paton, Klitos Christodoulou, Alvaro A. A...

claim paper

Read More »

112

Voted

LREC
2008

159views Education» more LREC 2008»

Corpus Exploitation from Wikipedia for Ontology Construction

15 years 2 months ago

Download www.lrec-conf.org

Ontology construction usually requires a domain-specific corpus for building corresponding concept hierarchy. The domain corpus must have a good coverage of domain knowledge. Wiki...

Gaoying Cui, Qin Lu, Wenjie Li, Yi-Rong Chen

claim paper

Read More »

110

click to vote

WWW
2008
ACM

91views Internet Technology» more WWW 2008»

IRLbot: scaling to 6 billion pages and beyond

16 years 1 months ago

Download irl.cs.tamu.edu

This paper shares our experience in designing a web crawler that can download billions of pages using a single-server implementation and models its performance. We show that with ...

Hsin-Tsang Lee, Derek Leonard, Xiaoming Wang, Dmit...

claim paper

Read More »

102

Voted

SIGIR
2008
ACM

131views Information Technology» more SIGIR 2008»

Pagerank based clustering of hypertext document collections

15 years 7 days ago

Download www-sop.inria.fr

Clustering hypertext document collection is an important task in Information Retrieval. Most clustering methods are based on document content and do not take into account the hype...

Konstantin Avrachenkov, Vladimir Dobrynin, Danil N...

claim paper

Read More »

« Prev « First page 81 / 139 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers