Search Sciweavers | Sciweavers

36 search results - page 1 / 8

» Web-scale knowledge extraction from semi-structured tables

click to vote

IJSI
2008

115views more IJSI 2008»

Towards Knowledge Acquisition from Semi-Structured Content

13 years 4 months ago

Download www.ijsi.org

Abstract A rich family of generic Information Extraction (IE) techniques have been developed by researchers nowadays. This paper proposes WebKER, a system for automatically extract...

Xi Bai, Jigui Sun, Haiyan Che, Lian Shi

claim paper

Read More »

click to vote

WEBI
2004
Springer

91views Internet Technology» more WEBI 2004»

Semi-Structured Complex List Extraction

13 years 10 months ago

Download www2.cs.uregina.ca

The semi-structured information available in HTML and similar documents provide valuable information that can be used for information extraction applications. This information tog...

Anders Arpteg

claim paper

Read More »

click to vote

KDD
1998
ACM

159views Data Mining» more KDD 1998»

A Robust System Architecture for Mining Semi-Structured Data

13 years 8 months ago

Download www.aaai.org

The value of extracting knowledge from semi-structured data is readily apparent with the explosion of the WWW and the advent of digital libraries. This paper proposes a versatile ...

Lisa Singh, Bin Chen, Rebecca Haight, Peter Scheue...

claim paper

Read More »

click to vote

WSDM
2012
ACM

252views Data Mining» more WSDM 2012»

WebSets: extracting sets of entities from the web using unsupervised information extraction

12 years 3 days ago

Download www.cs.cmu.edu

We describe a open-domain information extraction method for extracting concept-instance pairs from an HTML corpus. Most earlier approaches to this problem rely on combining cluste...

Bhavana Bharat Dalvi, William W. Cohen, Jamie Call...

claim paper

Read More »

click to vote

WWW
2010
ACM

193views Internet Technology» more WWW 2010»

Web-scale knowledge extraction from semi-structured tables

13 years 9 months ago

Download www.patrickpantel.com

A wealth of knowledge is encoded in the form of tables on the World Wide Web. We propose a classification algorithm and a rich feature set for automatically recognizing layout tab...

Eric Crestan, Patrick Pantel

claim paper

Read More »

« Prev « First page 1 / 8 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers