Sciweavers

55 search results - page 1 / 11
» Web page sectioning using regex-based template
Sort
View
WWW
2008
ACM
14 years 4 months ago
Web page sectioning using regex-based template
This work aims to provide a novel, site-specific web page segmentation and section importance detection algorithm, which leverages structural, content, and visual information. The...
Rupesh R. Mehta, Amit Madaan
ICDM
2007
IEEE
476views Data Mining» more  ICDM 2007»
13 years 10 months ago
FiVaTech: Page-Level Web Data Extraction from Template Pages
In this paper, we proposed a new approach, called FiVaTech for the problem of Web data extraction. FiVaTech is a page-level data extraction system which deduces the data schema an...
Mohammed Kayed, Chia-Hui Chang, Khaled F. Shaalan,...
CIKM
2006
Springer
13 years 7 months ago
A fast and robust method for web page template detection and removal
The widespread use of templates on the Web is considered harmful for two main reasons. Not only do they compromise the relevance judgment of many web IR and web mining methods suc...
Karane Vieira, Altigran Soares da Silva, Nick Pint...
WWW
2005
ACM
14 years 4 months ago
The volume and evolution of web page templates
Web pages contain a combination of unique content and template material, which is present across multiple pages and used primarily for formatting, navigation, and branding. We stu...
David Gibson, Kunal Punera, Andrew Tomkins
SAC
2002
ACM
13 years 3 months ago
Dynamically generating web application fragments from page templates
Web-based applications are typically required to be highly customizable and configurable. New application requirements have to be introduced rapidly, often without stopping the ru...
Uwe Zdun