Sciweavers

33 search results - page 7 / 7
» Lossless compression for large scale cluster logs
Sort
View
WSDM
2010
ACM
204views Data Mining» more  WSDM 2010»
14 years 3 days ago
Learning URL patterns for webpage de-duplication
Presence of duplicate documents in the World Wide Web adversely affects crawling, indexing and relevance, which are the core building blocks of web search. In this paper, we pres...
Hema Swetha Koppula, Krishna P. Leela, Amit Agarwa...
WWW
2010
ACM
14 years 7 days ago
A pattern tree-based approach to learning URL normalization rules
Duplicate URLs have brought serious troubles to the whole pipeline of a search engine, from crawling, indexing, to result serving. URL normalization is to transform duplicate URLs...
Tao Lei, Rui Cai, Jiang-Ming Yang, Yan Ke, Xiaodon...
ICPP
1998
IEEE
13 years 9 months ago
MPEG-4 Based Interactive Video using Parallel Processing
MPEG-4 which is currently being developed by MPEG (Moving Pictures Experts Group), is poised to become a standard for supporting current and emerging interactive multimedia applic...
Yong He, Ishfaq Ahmad, Ming L. Liou