Search Sciweavers | Sciweavers

9 search results - page 2 / 2

» A Query-Dependent Duplicate Detection Approach for Large Sca...

click to vote

WWW
2010
ACM

224views Internet Technology» more WWW 2010»

Large-scale bot detection for search engines

14 years 25 days ago

Download www.hwkang.com

In this paper, we propose a semi-supervised learning approach for classifying program (bot) generated web search traﬃc from that of genuine human users. The work is motivated by...

Hongwen Kang, Kuansan Wang, David Soukal, Fritz Be...

claim paper

Read More »

click to vote

CPM
2000
Springer

177views Combinatorics» more CPM 2000»

Identifying and Filtering Near-Duplicate Documents

13 years 10 months ago

Download www.cs.princeton.edu

Abstract. The mathematical concept of document resemblance captures well the informal notion of syntactic similarity. The resemblance can be estimated using a ﬁxed size “sketch...

Andrei Z. Broder

claim paper

Read More »

click to vote

PAKDD
2009
ACM

120views Data Mining» more PAKDD 2009»

Detecting Link Hijacking by Web Spammers.

14 years 3 months ago

Download www.tkl.iis.u-tokyo.ac.jp

Abstract. Since current search engines employ link-based ranking algorithms as an important tool to decide a ranking of sites, Web spammers are making a signiﬁcant eﬀort to man...

Masaru Kitsuregawa, Masashi Toyoda, Young-joo Chun...

claim paper

Read More »

click to vote

ICDE
2009
IEEE

251views Database» more ICDE 2009»

Contextual Ranking of Keywords Using Click Data

14 years 7 months ago

Download cis.poly.edu

The problem of automatically extracting the most interesting and relevant keyword phrases in a document has been studied extensively as it is crucial for a number of applications. ...

Utku Irmak, Vadim von Brzeski, Reiner Kraft

claim paper

Read More »

« Prev « First page 2 / 2 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers