Sciweavers

219 search results - page 23 / 44
» Web page language identification based on URLs
Sort
View
83
Voted
CIKM
2009
Springer
15 years 7 months ago
Vetting the links of the web
Many web links mislead human surfers and automated crawlers because they point to changed content, out-of-date information, or invalid URLs. It is a particular problem for large, ...
Na Dai, Brian D. Davison
WWW
2010
ACM
15 years 18 days ago
A characterization of online browsing behavior
In this paper, we undertake a large-scale study of online user behavior based on search and toolbar logs. We propose a new CCS taxonomy of pageviews consisting of Content (news, p...
Ravi Kumar, Andrew Tomkins
83
Voted
AIRWEB
2007
Springer
15 years 6 months ago
A Taxonomy of JavaScript Redirection Spam
Redirection spam presents a web page with false content to a crawler for indexing, but automatically redirects the browser to a different web page. Redirection is usually immediat...
Kumar Chellapilla, Alexey Maykov
100
Voted
IC
2004
15 years 1 months ago
IskaWeb: A Web-Based Information System for the Classification of Industrial Wastes
Industrial wastes must be classified at least two times on the way from the owner of the waste to the waste disposal facility in order to ensure that waste handling is in conformi...
J. O. Dada, Hans-Dieter Kochs, Jörg Petersen
JCIT
2010
104views more  JCIT 2010»
14 years 7 months ago
Linguistic Information Processing Based on Aggregation Operator over the Internet
Much information over the Internet is expressed by natural languages. The management of linguistic information involves an operation of comparison and aggregation. In this paper, ...
Li Yan, Yi Qin, Zheng Pei