Sciweavers

18 search results - page 1 / 4
» A Fast Community Based Algorithm for Generating Web Crawler ...
Sort
View
WEBIST
2008
13 years 6 months ago
A Fast Community Based Algorithm for Generating Web Crawler Seeds Set
Shervin Daneshpajouh, Mojtaba Mohammadi Nasiri, Mo...
AIRWEB
2007
Springer
13 years 11 months ago
Extracting Link Spam using Biased Random Walks from Spam Seed Sets
Link spam deliberately manipulates hyperlinks between web pages in order to unduly boost the search engine ranking of one or more target pages. Link based ranking algorithms such ...
Baoning Wu, Kumar Chellapilla
SDM
2007
SIAM
137views Data Mining» more  SDM 2007»
13 years 6 months ago
Are approximation algorithms for consensus clustering worthwhile?
Consensus clustering has emerged as one of the principal clustering problems in the data mining community. In recent years the theoretical computer science community has generated...
Michael Bertolacci, Anthony Wirth
WWW
2006
ACM
14 years 5 months ago
Detecting semantic cloaking on the web
By supplying different versions of a web page to search engines and to browsers, a content provider attempts to cloak the real content from the view of the search engine. Semantic...
Baoning Wu, Brian D. Davison
IM
2007
13 years 4 months ago
Cluster Generation and Labeling for Web Snippets: A Fast, Accurate Hierarchical Solution
This paper describes Armil, a meta-search engine that groups the web snippets returned by auxiliary search engines into disjoint labeled clusters. The cluster labels generated by A...
Filippo Geraci, Marco Pellegrini, Marco Maggini, F...