Search Sciweavers | Sciweavers

89

Voted

WWW
2003
ACM

130views Internet Technology» more WWW 2003»

DOM-based content extraction of HTML documents

15 years 11 months ago

Web pages often contain clutter (such as pop-up ads, unnecessary images and extraneous links) around the body of an article that distracts a user from actual content. Extraction o...

Suhit Gupta, Gail E. Kaiser, David Neistadt, Peter...

claim paper

Read More »

85

click to vote

KDD
2009
ACM

185views Data Mining» more KDD 2009»

On compressing social networks

15 years 10 months ago

Download www.eecs.harvard.edu

Motivated by structural properties of the Web graph that support efficient data structures for in memory adjacency queries, we study the extent to which a large network can be com...

Flavio Chierichetti, Ravi Kumar, Silvio Lattanzi, ...

claim paper

Read More »

88

Voted

SIGIR
2009
ACM

153views Information Technology» more SIGIR 2009»

Building enriched document representations using aggregated anchor text

15 years 4 months ago

Download ciir.cs.umass.edu

It is well known that anchor text plays a critical role in a variety of search tasks performed over hypertextual domains, including enterprise search, wiki search, and web search....

Donald Metzler, Jasmine Novak, Hang Cui, Srihari R...

claim paper

Read More »

75

Voted

VLDB
2004
ACM

113views Database» more VLDB 2004»

Accurate and Efficient Crawling for Relevant Websites

15 years 3 months ago

Download www.vldb.org

Focused web crawlers have recently emerged as an alternative to the well-established web search engines. While the well-known focused crawlers retrieve relevant webpages, there ar...

Martin Ester, Hans-Peter Kriegel, Matthias Schuber...

claim paper

Read More »

83

Voted

WECWIS
1999
IEEE

111views ECommerce» more WECWIS 1999»

A Quantitative Analysis of the User Behavior of a Large E-Broker

15 years 2 months ago

Download homepages.dcc.ufmg.br

The Internet and the World Wide Web provide a global virtual marketplace. However, there is little information about the behavior of e-commerce users worldwide. The goal of the pa...

Virgilio Almeida, Wagner Meira Jr., Victor F. Ribe...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers