Sciweavers

70 search results - page 2 / 14
» Incorporating site-level knowledge to extract structured dat...
Sort
View
PVLDB
2010
114views more  PVLDB 2010»
13 years 3 months ago
ObjectRunner: Lightweight, Targeted Extraction and Querying of Structured Web Data
We present in this paper ObjectRunner, a system for extracting, integrating and querying structured data from the Web. Our system harvests real-world items from template-based HTM...
Talel Abdessalem, Bogdan Cautis, Nora Derouiche
JAIR
2010
160views more  JAIR 2010»
13 years 3 months ago
Constructing Reference Sets from Unstructured, Ungrammatical Text
Vast amounts of text on the Web are unstructured and ungrammatical, such as classified ads, auction listings, forum postings, etc. We call such text “posts.” Despite their in...
Matthew Michelson, Craig A. Knoblock
SIGMOD
2008
ACM
159views Database» more  SIGMOD 2008»
14 years 5 months ago
Web-scale extraction of structured data
A long-standing goal of Web research has been to construct a unified Web knowledge base. Information extraction techniques have shown good results on Web inputs, but even most dom...
Michael J. Cafarella, Jayant Madhavan, Alon Y. Hal...
WWW
2010
ACM
13 years 9 months ago
Web-scale knowledge extraction from semi-structured tables
A wealth of knowledge is encoded in the form of tables on the World Wide Web. We propose a classification algorithm and a rich feature set for automatically recognizing layout tab...
Eric Crestan, Patrick Pantel
BTW
2005
Springer
125views Database» more  BTW 2005»
13 years 10 months ago
Web Data Extraction for Business Intelligence: The Lixto Approach
: Knowledge about market developments and competitor activities on the market becomes more and more a critical success factor for enterprises. The World Wide Web provides public do...
Georg Gottlob