Search Sciweavers | Sciweavers

311 search results - page 51 / 63

» Cleaning Web Pages for Effective Web Content Mining

click to vote

WSDM
2010
ACM

204views Data Mining» more WSDM 2010»

Learning URL patterns for webpage de-duplication

15 years 6 months ago

Download www.wsdm-conference.org

Presence of duplicate documents in the World Wide Web adversely aﬀects crawling, indexing and relevance, which are the core building blocks of web search. In this paper, we pres...

Hema Swetha Koppula, Krishna P. Leela, Amit Agarwa...

claim paper

Read More »

101

click to vote

KDD
2007
ACM

155views Data Mining» more KDD 2007»

Mining templates from search result records of search engines

16 years 2 days ago

Download www.cs.binghamton.edu

Metasearch engine, Comparison-shopping and Deep Web crawling applications need to extract search result records enwrapped in result pages returned from search engines in response ...

Hongkun Zhao, Weiyi Meng, Clement T. Yu

claim paper

Read More »

Voted

SIGSOFT
2008
ACM

114views Software Engineering» more SIGSOFT 2008»

Doloto: code splitting for network-bound web 2.0 applications

16 years 13 days ago

Download research.microsoft.com

Modern Web 2.0 applications, such as GMail, Live Maps, Facebook and many others, use a combination of Dynamic HTML, JavaScript and other Web browser technologies commonly referred...

V. Benjamin Livshits, Emre Kiciman

claim paper

Read More »

186

click to vote

ICDE
2004
IEEE

151views Database» more ICDE 2004»

Improved File Synchronization Techniques for Maintaining Large Replicated Collections over Slow Networks

16 years 1 months ago

Download cis.poly.edu

We study the problem of maintaining large replicated collections of files or documents in a distributed environment with limited bandwidth. This problem arises in a number of impo...

Torsten Suel, Patrick Noel, Dimitre Trendafilov

claim paper

Read More »

111

click to vote

KDD
2006
ACM

198views Data Mining» more KDD 2006»

Event detection from evolution of click-through data

16 years 2 days ago

Download research.microsoft.com

Previous efforts on event detection from the web have focused primarily on web content and structure data ignoring the rich collection of web log data. In this paper, we propose t...

Qiankun Zhao, Tie-Yan Liu, Sourav S. Bhowmick, Wei...

claim paper

Read More »

« Prev « First page 51 / 63 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers