Sciweavers

311 search results - page 48 / 63
» Cleaning Web Pages for Effective Web Content Mining
Sort
View
CIKM
2008
Springer
14 years 11 months ago
Book search: indexing the valuable parts
With massive book digitization efforts underway, there is a need for developing effective book retrieval strategies. This paper explores the relative contribution of different par...
Walid Magdy, Kareem Darwish
ICDE
2003
IEEE
208views Database» more  ICDE 2003»
15 years 2 months ago
DBProxy: A dynamic data cache for Web applications
The majority of web pages served today are generated dynamically, usually by an application server querying a back-end database. To enhance the scalability of dynamic content serv...
Khalil Amiri, Sanghyun Park, Renu Tewari, Sriram P...
KDD
2007
ACM
193views Data Mining» more  KDD 2007»
15 years 9 months ago
Joint optimization of wrapper generation and template detection
Many websites have large collections of pages generated dynamically from an underlying structured source like a database. The data of a category are typically encoded into similar...
Shuyi Zheng, Ruihua Song, Ji-Rong Wen, Di Wu
SIGIR
2005
ACM
15 years 3 months ago
A study of relevance propagation for web search
Different from traditional information retrieval, both content and structure are critical to the success of Web information retrieval. In recent years, many relevance propagation ...
Tao Qin, Tie-Yan Liu, Xu-Dong Zhang, Zheng Chen, W...
TISSEC
2008
100views more  TISSEC 2008»
14 years 9 months ago
Message Dropping Attacks in Overlay Networks: Attack Detection and Attacker Identification
Overlay multicast networks are used by service providers to distribute contents such as web pages, streaming multimedia data, or security updates to a large number of users. Howeve...
Liang Xie, Sencun Zhu