Sciweavers

471 search results - page 41 / 95
» On the Relative Dominance of Paging Algorithms
Sort
View
IDEAL
2005
Springer
15 years 3 months ago
Differential Priors for Elastic Nets
The elastic net and related algorithms, such as generative topographic mapping, are key methods for discretized dimension-reduction problems. At their heart are priors that specify...
Miguel Á. Carreira-Perpiñán, ...
WWW
2006
ACM
15 years 10 months ago
Effective web-scale crawling through website analysis
The web crawler space is often delimited into two general areas: full-web crawling and focused crawling. We present netSifter, a crawler system which integrates features from thes...
Iván Gonzlez, Adam Marcus 0002, Daniel N. M...
74
Voted
AIRWEB
2006
Springer
15 years 1 months ago
Tracking Web Spam with Hidden Style Similarity
Automatically generated content is ubiquitous in the web: dynamic sites built using the three-tier paradigm are good examples (e.g. commercial sites, blogs and other sites powered...
Tanguy Urvoy, Thomas Lavergne, Pascal Filoche
DAS
2006
Springer
15 years 1 months ago
Ground Truth for Layout Analysis Performance Evaluation
Over the past two decades a significant number of layout analysis (page segmentation and region classification) approaches have been proposed in the literature. Each approach has b...
Apostolos Antonacopoulos, Dimosthenis Karatzas, Da...
WWW
2009
ACM
15 years 10 months ago
Extracting article text from the web with maximum subsequence segmentation
Much of the information on the Web is found in articles from online news outlets, magazines, encyclopedias, review collections, and other sources. However, extracting this content...
Jeff Pasternack, Dan Roth