Sciweavers

131 search results - page 27 / 27
» Scalable Clustering for Large High-Dimensional Data Based on...
Sort
View
WWW
2010
ACM
14 years 23 days ago
A pattern tree-based approach to learning URL normalization rules
Duplicate URLs have brought serious troubles to the whole pipeline of a search engine, from crawling, indexing, to result serving. URL normalization is to transform duplicate URLs...
Tao Lei, Rui Cai, Jiang-Ming Yang, Yan Ke, Xiaodon...