Search Sciweavers | Sciweavers

226 search results - page 33 / 46

» Web Page Clustering Using Heuristic Search in the Web Graph

231

click to vote

KDD
2007
ACM

155views Data Mining» more KDD 2007»

Mining templates from search result records of search engines

16 years 8 months ago

Download www.cs.binghamton.edu

Metasearch engine, Comparison-shopping and Deep Web crawling applications need to extract search result records enwrapped in result pages returned from search engines in response ...

Hongkun Zhao, Weiyi Meng, Clement T. Yu

claim paper

Read More »

209

click to vote

CIKM
2011
Springer

218views Information Technology» more CIKM 2011»

Probabilistic near-duplicate detection using simhash

14 years 7 months ago

Download irl.cs.tamu.edu

This paper oﬀers a novel look at using a dimensionalityreduction technique called simhash [8] to detect similar document pairs in large-scale collections. We show that this algo...

Sadhan Sood, Dmitri Loguinov

claim paper

Read More »

178

click to vote

CIKM
2009
Springer

129views Information Technology» more CIKM 2009»

A general markov framework for page importance computation

16 years 2 months ago

Download research.microsoft.com

We propose a General Markov Framework for computing page importance. Under the framework, a Markov Skeleton Process is used to model the random walk conducted by the web surfer on...

Bin Gao, Tie-Yan Liu, Zhiming Ma, Taifeng Wang, Ha...

claim paper

Read More »

232

click to vote

CIKM
2008
Springer

166views Information Technology» more CIKM 2008»

A random walk on the red carpet: rating movies with user reviews and pagerank

15 years 9 months ago

Download dblab.cs.nccu.edu.tw

Although PageRank has been designed to estimate the popularity of Web pages, it is a general algorithm that can be applied to the analysis of other graphs other than one of hypert...

Derry Tanti Wijaya, Stéphane Bressan

claim paper

Read More »

205

Voted

WSDM
2009
ACM

125views Data Mining» more WSDM 2009»

Less is more: sampling the neighborhood graph makes SALSA better and faster

16 years 2 months ago

Download wsdm2009.org

In this paper, we attempt to improve the eﬀectiveness and the eﬃciency of query-dependent link-based ranking algorithms such as HITS, MAX and SALSA. All these ranking algorith...

Marc Najork, Sreenivas Gollapudi, Rina Panigrahy

claim paper

Read More »

« Prev « First page 33 / 46 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers