Sciweavers

311 search results - page 45 / 63
» Cleaning Web Pages for Effective Web Content Mining
Sort
View
GIR
2007
ACM
15 years 1 months ago
Geo-tagging for imprecise regions of different sizes
Extracting geographical information from various web sources is likely to be important for a variety of applications. One such use for this information is to enable the study of v...
Robert Pasley, Paul Clough, Mark Sanderson
91
Voted
CIKM
2006
Springer
15 years 1 months ago
Mining blog stories using community-based and temporal clustering
In recent years, weblogs, or blogs for short, have become an important form of online content. The personal nature of blogs, online interactions between bloggers, and the temporal...
Arun Qamra, Belle L. Tseng, Edward Y. Chang
WWW
2009
ACM
15 years 10 months ago
Large scale multi-label classification via metalabeler
The explosion of online content has made the management of such content non-trivial. Web-related tasks such as web page categorization, news filtering, query categorization, tag r...
Lei Tang, Suju Rajan, Vijay K. Narayanan
WWW
2004
ACM
15 years 10 months ago
Continuous web: a new image-based hypermedia and scape-oriented browsing
Conventionally, Web pages have been recognized as documents described by HTML. Image data, such as photographs, logos, maps, illustrations, and decorated text, have been treated a...
Hiroya Tanaka, Katsumi Tanaka
SIGMETRICS
2002
ACM
14 years 9 months ago
Inferring client response time at the web server
As businesses continue to grow their World Wide Web presence, it is becoming increasingly vital for them to have quantitative measures of the client perceived response times of th...
David P. Olshefski, Jason Nieh, Dakshi Agrawal