Sciweavers

51 search results - page 10 / 11
» Exploiting Web Log Mining for Web Cache Enhancement
Sort
View
ICDE
2004
IEEE
151views Database» more  ICDE 2004»
15 years 11 months ago
Improved File Synchronization Techniques for Maintaining Large Replicated Collections over Slow Networks
We study the problem of maintaining large replicated collections of files or documents in a distributed environment with limited bandwidth. This problem arises in a number of impo...
Torsten Suel, Patrick Noel, Dimitre Trendafilov
JCDL
2004
ACM
114views Education» more  JCDL 2004»
15 years 3 months ago
Translating unknown cross-lingual queries in digital libraries using a web-based approach
Users’ cross-lingual queries to a digital library system might be short and not included in a common translation dictionary (unknown terms). In this paper, we investigate the fe...
Jenq-Haur Wang, Jei-Wen Teng, Pu-Jen Cheng, Wen-Hs...
91
Voted
SIGCOMM
2006
ACM
15 years 3 months ago
Drafting behind Akamai (travelocity-based detouring)
To enhance web browsing experiences, content distribution networks (CDNs) move web content “closer” to clients by caching copies of web objects on thousands of servers worldwi...
Ao-Jan Su, David R. Choffnes, Aleksandar Kuzmanovi...
WSDM
2010
ACM
315views Data Mining» more  WSDM 2010»
15 years 7 months ago
SBotMiner: Large Scale Search Bot Detection
In this paper, we study search bot traffic from search engine query logs at a large scale. Although bots that generate search traffic aggressively can be easily detected, a large ...
Fang Yu, Yinglian Xie, Qifa Ke
AUSDM
2008
Springer
243views Data Mining» more  AUSDM 2008»
14 years 11 months ago
Structure-Based Document Model with Discrete Wavelet Transforms and Its Application to Document Classification
Term signal is an existing text representation that depicts a term as a vector of frequencies of occurrences in a number of user-defined partitions of a document. Although term si...
Supphachai Thaicharoen, Tom Altman, Krzysztof J. C...