Search Sciweavers | Sciweavers

152 search results - page 1 / 31

» Redundancy-Driven Web Data Extraction and Integration

click to vote

WEBDB
2010
Springer

156views Database» more WEBDB 2010»

Redundancy-Driven Web Data Extraction and Integration

13 years 9 months ago

Download www.dia.uniroma3.it

A large number of web sites publish pages containing structured information about recognizable concepts, but these data are only partially used by current applications. Although s...

Paolo Papotti, Valter Crescenzi, Paolo Merialdo, M...

claim paper

Read More »

click to vote

LREC
2010

237views Education» more LREC 2010»

Entity Mention Detection using a Combination of Redundancy-Driven Classifiers

13 years 6 months ago

Download www.lrec-conf.org

We present an experimental framework for Entity Mention Detection in which two different classifiers are combined to exploit Data Redundancy attained through the annotation of a l...

Silvana Marianela Bernaola Biggio, Manuela Speranz...

claim paper

Read More »

click to vote

WWW
2011
ACM

298views Internet Technology» more WWW 2011»

HyLiEn: a hybrid approach to general list extraction on the web

12 years 11 months ago

Download www.cs.uiuc.edu

We consider the problem of automatically extracting general lists from the web. Existing approaches are mostly dependent upon either the underlying HTML markup or the visual struc...

Fabio Fumarola, Tim Weninger, Rick Barber, Donato ...

claim paper

Read More »

click to vote

PVLDB
2010

114views more PVLDB 2010»

ObjectRunner: Lightweight, Targeted Extraction and Querying of Structured Web Data

13 years 2 months ago

Download www.comp.nus.edu.sg

We present in this paper ObjectRunner, a system for extracting, integrating and querying structured data from the Web. Our system harvests real-world items from template-based HTM...

Talel Abdessalem, Bogdan Cautis, Nora Derouiche

claim paper

Read More »

click to vote

JMLR
2008

159views more JMLR 2008»

Dynamic Hierarchical Markov Random Fields for Integrated Web Data Extraction

13 years 4 months ago

Download jmlr.csail.mit.edu

Existing template-independent web data extraction approaches adopt highly ineffective decoupled strategies--attempting to do data record detection and attribute labeling in two se...

Jun Zhu, Zaiqing Nie, Bo Zhang, Ji-Rong Wen

claim paper

Read More »

« Prev « First page 1 / 31 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers