Search Sciweavers | Sciweavers

127 search results - page 17 / 26

» Rule-Based Structural Analysis of Web Pages

167

click to vote

CIKM
2005
Springer

134views Information Technology» more CIKM 2005»

Versatile structural disambiguation for semantic-aware applications

15 years 11 months ago

Download www.isgroup.unimo.it

In this paper, we propose a versatile disambiguation approach which can be used to make explicit the meaning of structure based information such as XML schemas, XML document struc...

Federica Mandreoli, Riccardo Martoglia, Enrico Ron...

claim paper

Read More »

161

click to vote

ICDAR
2009
IEEE

148views Document Analysis» more ICDAR 2009»

User-Guided Wrapping of PDF Documents Using Graph Matching Techniques

16 years 23 days ago

Download www.cvc.uab.es

There are a number of established products on the market for wrapping—semi-automatic navigation and extraction of data—from web pages. These solutions make use of the inherent...

Tamir Hassan

claim paper

Read More »

178

click to vote

WWW
2009
ACM

189views Internet Technology» more WWW 2009»

Extracting data records from the web using tag path clustering

15 years 10 months ago

Download www2009.org

Fully automatic methods that extract lists of objects from the Web have been studied extensively. Record extraction, the ﬁrst step of this object extraction process, identiﬁes...

Gengxin Miao, Jun'ichi Tatemura, Wang-Pin Hsiung, ...

claim paper

Read More »

160

click to vote

TREC
2004

127views Information Technology» more TREC 2004»

Language Models for Searching in Web Corpora

15 years 7 months ago

Download trec.nist.gov

: We describe our participation in the TREC 2004 Web and Terabyte tracks. For the web track, we employ mixture language models based on document full-text, incoming anchortext, and...

Jaap Kamps, Gilad Mishne, Maarten de Rijke

claim paper

Read More »

190

click to vote

CSUR
1999

159views more CSUR 1999»

Hubs, authorities, and communities

15 years 5 months ago

Download www.csee.umbc.edu

The Web can be naturally modeled as a directed graph, consisting of a set of abstract nodes (the pages) joined by directional edges (the hyperlinks). Hyperlinks encode a considerab...

Jon M. Kleinberg

claim paper

Read More »

« Prev « First page 17 / 26 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers