Sciweavers

8479 search results - page 73 / 1696
» Data Extraction from Web Data Sources
Sort
View
101
Voted
ILP
2007
Springer
15 years 4 months ago
Using ILP to Construct Features for Information Extraction from Semi-structured Text
Machine-generated documents containing semi-structured text are rapidly forming the bulk of data being stored in an organisation. Given a feature-based representation of such data,...
Ganesh Ramakrishnan, Sachindra Joshi, Sreeram Bala...
WWW
2006
ACM
15 years 11 months ago
Semantic web integration of cultural heritage sources
In this paper, we describe research into the use of ontologies to integrate access to cultural heritage and photographic archives. The use of the CIDOC CRM and CRM Core ontologies...
Patrick A. S. Sinclair, Paul H. Lewis, Kirk Martin...
73
Voted
IAT
2006
IEEE
15 years 4 months ago
Semantic Labeling of Data by Using the Web
The Web consists of a large amount of unstructured information that hardly can be elaborated by automatic agents. In recent years, a considerable number of techniques for informat...
Leonardo Rigutini, Ernesto Di Iorio, Marco Ernande...
82
Voted
SIGKDD
2010
111views more  SIGKDD 2010»
14 years 5 months ago
Unexpected results in automatic list extraction on the web
The discovery and extraction of general lists on the Web continues to be an important problem facing the Web mining community. There have been numerous studies that claim to autom...
Tim Weninger, Fabio Fumarola, Rick Barber, Jiawei ...
104
Voted
WEBDB
1999
Springer
196views Database» more  WEBDB 1999»
15 years 2 months ago
Web Ecology: Recycling HTML Pages as XML Documents Using W4F
In this paper we present the World-Wide Web Wrapper Factory (W4F), a Java toolkit to generate wrappers for Web data sources. Some key features of W4F are an expressive language to...
Arnaud Sahuguet, Fabien Azavant