Sciweavers

19 search results - page 3 / 4
» Incremental web page template detection
Sort
View
DKE
2006
122views more  DKE 2006»
14 years 9 months ago
Sampling, information extraction and summarisation of Hidden Web databases
Hidden Web databases maintain a collection of specialised documents, which are dynamically generated in response to users' queries. The majority of these documents are genera...
Yih-Ling Hedley, Muhammad Younas, Anne E. James, M...
JCDL
2011
ACM
301views Education» more  JCDL 2011»
14 years 15 days ago
Archiving the web using page changes patterns: a case study
A pattern is a model or a template used to summarize and describe the behavior (or the trend) of a data having generally some recurrent events. Patterns have received a considerab...
Myriam Ben Saad, Stéphane Gançarski
WSDM
2010
ACM
215views Data Mining» more  WSDM 2010»
15 years 7 months ago
Boilerplate Detection using Shallow Text Features
In addition to the actual content Web pages consist of navigational elements, templates, and advertisements. This boilerplate text typically is not related to the main content, ma...
Christian Kohlschütter, Peter Fankhauser, Wol...
ICWE
2004
Springer
15 years 3 months ago
Accelerating Dynamic Web Content Delivery Using Keyword-Based Fragment Detection
The recent trend in the Internet traffic is increasing in requests for dynamic and personalized content. To efficiently serve this trend, several serverside and cache-side fragme...
Daniel Brodie, Amrish Gupta, Weisong Shi
VLDB
2007
ACM
134views Database» more  VLDB 2007»
15 years 3 months ago
Building Structured Web Community Portals: A Top-Down, Compositional, and Incremental Approach
Structured community portals extract and integrate information from raw Web pages to present a unified view of entities and relationships in the community. In this paper we argue...
Pedro DeRose, Warren Shen, Fei Chen 0002, AnHai Do...