Sciweavers

244 search results - page 19 / 49
» From HTML documents to web tables and rules
Sort
View
DL
2000
Springer
156views Digital Library» more  DL 2000»
15 years 4 months ago
Re-engineering structures from Web documents
To realise a wide range of applications (including digital libraries) on the Web, a more structured way of accessing the Web is required and such requirement can be facilitated by...
Chuang-Hue Moh, Ee-Peng Lim, Wee Keong Ng
CIS
2004
Springer
15 years 5 months ago
A Method of Acquiring Ontology Information from Web Documents
Abstract. Ontology plays an important role on the Semantic Web. In this paper, we propose a method, AOIWD, of acquiring ontology information from Web documents. The AOIWD method em...
Lixin Han, Guihai Chen, Li Xie
SIGMOD
2009
ACM
140views Database» more  SIGMOD 2009»
15 years 6 months ago
Robust web extraction: an approach based on a probabilistic tree-edit model
On script-generated web sites, many documents share common HTML tree structure, allowing wrappers to effectively extract information of interest. Of course, the scripts and thus ...
Nilesh N. Dalvi, Philip Bohannon, Fei Sha
INTERNET
2007
182views more  INTERNET 2007»
14 years 11 months ago
Analysis of Caching and Replication Strategies for Web Applications
Replication and caching mechanisms are often employed to enhance the performance of Web applications. In this article, we present a qualitative and quantitative analysis of state-...
Swaminathan Sivasubramanian, Guillaume Pierre, Maa...
WWW
2006
ACM
16 years 14 days ago
Robust web content extraction
We present an empirical evaluation and comparison of two content extraction methods in HTML: absolute XPath expressions and relative XPath expressions. We argue that the relative ...
Marek Kowalkiewicz, Maria E. Orlowska, Tomasz Kacz...