Search Sciweavers | Sciweavers

263 search results - page 1 / 53

» Re-engineering structures from Web documents

185

click to vote

WWW
2011
ACM

316views Internet Technology» more WWW 2011»

Identifying primary content from web pages and its application to web search ranking

14 years 10 months ago

Download www.www2011india.com

Web pages are usually highly structured documents. In some documents, content with diﬀerent functionality is laid out in blocks, some merely supporting the main discourse. In ot...

Srinivas Vadrevu, Emre Velipasaoglu

claim paper

Read More »

141

click to vote

DL
2000
Springer

156views Digital Library» more DL 2000»

Re-engineering structures from Web documents

15 years 8 months ago

Download ir.iit.edu

To realise a wide range of applications (including digital libraries) on the Web, a more structured way of accessing the Web is required and such requirement can be facilitated by...

Chuang-Hue Moh, Ee-Peng Lim, Wee Keong Ng

claim paper

Read More »

133

click to vote

PVLDB
2010

135views more PVLDB 2010»

SXPath - Extending XPath towards Spatial Querying on Web Documents

15 years 2 months ago

Download www.vldb.org

Querying data from presentation formats like HTML, for purposes such as information extraction, requires the consideration of tree structures as well as the consideration of spati...

Ermelinda Oro, Massimo Ruffolo, Steffen Staab

claim paper

Read More »

157

click to vote

DOCENG
2010
ACM

220views Document Analysis» more DOCENG 2010»

From templates to schemas: bridging the gap between free editing and safe data processing

15 years 2 months ago

Download hal.inria.fr

In this paper we present tools that provide an easy way to edit XML content directly on the web, with the usual beneﬁt of valid XML content. These tools make it possible to crea...

Vincent Quint, Cécile Roisin, Stépha...

claim paper

Read More »

147

Voted

FLAIRS
2001

131views Artificial Intelligence» more FLAIRS 2001»

Syntactic Folding and its Application to the Information Extraction from Web Pages

15 years 5 months ago

Download www.aaai.org

Thepaper deals with investigations concerning potential structures of documentsthat will be subject to automated information extraction. The focus is on folding principles and the...

Jörg Herrmann

claim paper

Read More »

« Prev « First page 1 / 53 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers