Sciweavers

6651 search results - page 161 / 1331
» Translating Web Data
Sort
View
WWW
2007
ACM
16 years 4 months ago
Detecting near-duplicates for web crawling
Near-duplicate web documents are abundant. Two such documents differ from each other in a very small portion that displays advertisements, for example. Such differences are irrele...
Gurmeet Singh Manku, Arvind Jain, Anish Das Sarma
102
Voted
KDD
2007
ACM
376views Data Mining» more  KDD 2007»
16 years 4 months ago
Truth discovery with multiple conflicting information providers on the web
The world-wide web has become the most important information source for most of us. Unfortunately, there is no guarantee for the correctness of information on the web. Moreover, d...
Xiaoxin Yin, Jiawei Han, Philip S. Yu
SDM
2008
SIAM
135views Data Mining» more  SDM 2008»
15 years 5 months ago
A Spamicity Approach to Web Spam Detection
Web spam, which refers to any deliberate actions bringing to selected web pages an unjustifiable favorable relevance or importance, is one of the major obstacles for high quality ...
Bin Zhou 0002, Jian Pei, ZhaoHui Tang
135
Voted
WEBNET
2000
15 years 4 months ago
Mining the Most Interesting Web Access Associations
: Web access patterns can provide valuable information for website designers in making website-based communication more efficient. To extract interesting or useful web access patte...
Li Shen, Ling Cheng, James Ford, Fillia Makedon, V...
WWW
2008
ACM
16 years 4 months ago
Linked data on the web (LDOW2008)
The Web is increasingly understood as a global information space consisting not just of linked documents, but also of Linked Data. More than just a vision, the resulting Web of Da...
Christian Bizer, Tom Heath, Kingsley Idehen, Tim B...