Sciweavers

311 search results - page 22 / 63
» Cleaning Web Pages for Effective Web Content Mining
Sort
View
WWW
2011
ACM
14 years 4 months ago
Prophiler: a fast filter for the large-scale detection of malicious web pages
Malicious web pages that host drive-by-download exploits have become a popular means for compromising hosts on the Internet and, subsequently, for creating large-scale botnets. In...
Davide Canali, Marco Cova, Giovanni Vigna, Christo...
AAAI
2007
14 years 11 months ago
Mining Web Query Hierarchies from Clickthrough Data
In this paper, we propose to mine query hierarchies from clickthrough data, which is within the larger area of automatic acquisition of knowledge from the Web. When a user submits...
Dou Shen, Min Qin, Weizhu Chen, Qiang Yang, Zheng ...
SIGMOD
2008
ACM
167views Database» more  SIGMOD 2008»
15 years 9 months ago
DiMaC: a system for cleaning disguised missing data
In some applications such as filling in a customer information form on the web, some missing values may not be explicitly represented as such, but instead appear as potentially va...
Ming Hua, Jian Pei

Publication
212views
15 years 2 months ago
Browser independent content based image resizing for liquid web layouts
A typical problem for webdesigners is to realize pages that can be potentially accessed from a number of display devices with different screen sizes and resolutions. Liquid layouts...
Gallea Roberto, Ardizzone Edoardo, Pirrone Roberto
CIB
2002
100views more  CIB 2002»
14 years 9 months ago
Web-log Mining for Quantitative Temporal-Event Prediction
The web log data embed much of web users' browsing behavior. From the web logs, one can discover patterns that predict the users' future requests based on their current b...
Qiang Yang, Hui Wang, Wei Zhang