Search Sciweavers | Sciweavers

244 search results - page 19 / 49

» From HTML documents to web tables and rules

169

click to vote

DL
2000
Springer

156views Digital Library» more DL 2000»

Re-engineering structures from Web documents

15 years 10 months ago

Download ir.iit.edu

To realise a wide range of applications (including digital libraries) on the Web, a more structured way of accessing the Web is required and such requirement can be facilitated by...

Chuang-Hue Moh, Ee-Peng Lim, Wee Keong Ng

claim paper

Read More »

142

click to vote

CIS
2004
Springer

101views Applied Computing» more CIS 2004»

A Method of Acquiring Ontology Information from Web Documents

15 years 11 months ago

Download cs.nju.edu.cn

Abstract. Ontology plays an important role on the Semantic Web. In this paper, we propose a method, AOIWD, of acquiring ontology information from Web documents. The AOIWD method em...

Lixin Han, Guihai Chen, Li Xie

claim paper

Read More »

193

click to vote

SIGMOD
2009
ACM

140views Database» more SIGMOD 2009»

Robust web extraction: an approach based on a probabilistic tree-edit model

16 years 22 days ago

Download www-rcf.usc.edu

On script-generated web sites, many documents share common HTML tree structure, allowing wrappers to eﬀectively extract information of interest. Of course, the scripts and thus ...

Nilesh N. Dalvi, Philip Bohannon, Fei Sha

claim paper

Read More »

183

click to vote

INTERNET
2007

182views more INTERNET 2007»

Analysis of Caching and Replication Strategies for Web Applications

15 years 5 months ago

Download www.globule.org

Replication and caching mechanisms are often employed to enhance the performance of Web applications. In this article, we present a qualitative and quantitative analysis of state-...

Swaminathan Sivasubramanian, Guillaume Pierre, Maa...

claim paper

Read More »

113

click to vote

WWW
2006
ACM

69views Internet Technology» more WWW 2006»

Robust web content extraction

16 years 6 months ago

Download www2006.org

We present an empirical evaluation and comparison of two content extraction methods in HTML: absolute XPath expressions and relative XPath expressions. We argue that the relative ...

Marek Kowalkiewicz, Maria E. Orlowska, Tomasz Kacz...

claim paper

Read More »

« Prev « First page 19 / 49 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers