Sciweavers

2677 search results - page 328 / 536
» Extracting Structured Data from Web Pages
Sort
View
ACL
2010
14 years 11 months ago
Learning 5000 Relational Extractors
Many researchers are trying to use information extraction (IE) to create large-scale knowledge bases from natural language text on the Web. However, the primary approach (supervis...
Raphael Hoffmann, Congle Zhang, Daniel S. Weld
WEBDB
2000
Springer
120views Database» more  WEBDB 2000»
15 years 5 months ago
Quilt: An XML Query Language for Heterogeneous Data Sources
The World Wide Web promises to transform human society by making virtually all types of information instantly available everywhere. Two prerequisites for this promise to be realiz...
Donald D. Chamberlin, Jonathan Robie, Daniela Flor...
CAISE
2005
Springer
15 years 7 months ago
Integrating Unnormalised Semi-structured Data Sources
From Proc. CAiSE05 LNCS 3520, Pages 460-474 c Springer-Verlag 2005 Semi-structured data sources, such as XML, HTML or CSV files, present special problems when performing data int...
Sasivimol Kittivoravitkul, Peter McBrien
WWW
2006
ACM
16 years 2 months ago
Compressing and searching XML data via two zips
XML is fast becoming the standard format to store, exchange and publish over the web, and is getting embedded in applications. Two challenges in handling XML are its size (the XML...
Paolo Ferragina, Fabrizio Luccio, Giovanni Manzini...
LREC
2008
110views Education» more  LREC 2008»
15 years 2 months ago
Unsupervised and Domain Independent Ontology Learning: Combining Heterogeneous Sources of Evidence
Acquiring knowledge from the Web to build domain ontologies has become a common practice in the Ontological Engineering field. The vast amount of freely available information allo...
David Manzano-Macho, Asunción Gómez-...