Sciweavers

4895 search results - page 624 / 979
» Web object retrieval
Sort
View
SOSP
2003
ACM
16 years 3 months ago
Preserving peer replicas by rate-limited sampled voting
The LOCKSS project has developed and deployed in a worldwide test a peer-to-peer system for preserving access to journals and other archival information published on the Web. It c...
Petros Maniatis, David S. H. Rosenthal, Mema Rouss...
WWW
2010
ACM
16 years 1 months ago
CETR: content extraction via tag ratios
We present Content Extraction via Tag Ratios (CETR) – a method to extract content text from diverse webpages by using the HTML document’s tag ratios. We describe how to comput...
Tim Weninger, William H. Hsu, Jiawei Han
ADAPTIVE
2007
Springer
16 years 10 days ago
Adaptive Focused Crawling
The large amount of available information on the Web makes it hard for users to locate resources about particular topics of interest. Traditional search tools, e.g., search engines...
Alessandro Micarelli, Fabio Gasparetti
159
Voted
AIRWEB
2007
Springer
16 years 10 days ago
Using Spam Farm to Boost PageRank
Nowadays web spamming has emerged to take the economic advantage of high search rankings and threatened the accuracy and fairness of those rankings. Understanding spamming techniq...
Ye Du, Yaoyun Shi, Xin Zhao
ENTER
2007
Springer
16 years 9 days ago
Annotating Accommodation Advertisements Using CERNO
There has been great interest in applying Semantic Web technologies to the tourism sector ever since Tim Berners-Lee introduced his vision. Unfortunately, there is a major obstacl...
Nadzeya Kiyavitskaya, Nicola Zeni, Luisa Mich, Jam...