Sciweavers

640 search results - page 17 / 128
» Wrapper Generation for Web Accessible Data Sources
Sort
View
DEBU
2000
95views more  DEBU 2000»
14 years 9 months ago
Accurately and Reliably Extracting Data from the Web: A Machine Learning Approach
A critical problem in developing information agents for the Web is accessing data that is formatted for human use. We have developed a set of tools for extracting data from web si...
Craig A. Knoblock, Kristina Lerman, Steven Minton,...
DEXA
2009
Springer
151views Database» more  DEXA 2009»
15 years 4 months ago
Automatic Extraction of Ontologies Wrapping Relational Data Sources
Describing relational data sources (i.e. databases) by means of ontologies constitutes the foundation of most of the semantic based approaches to data access and integration. In sp...
Lina Lubyte, Sergio Tessaris
VLDB
2005
ACM
141views Database» more  VLDB 2005»
15 years 3 months ago
Automatic Data Fusion with HumMer
Heterogeneous and dirty data is abundant. It is stored under different, often opaque schemata, it represents identical real-world objects multiple times, causing duplicates, and ...
Alexander Bilke, Jens Bleiholder, Christoph Bö...
72
Voted
DEXAW
2004
IEEE
104views Database» more  DEXAW 2004»
15 years 1 months ago
Multilingual and Multimedia Information Retrieval from Web Documents
Web documents present new challenges to conventional Information Retrieval (IR) technologies. This paper describes how these challenges are faced in FameIR, a multilingual multime...
Marta Gatius, Manuel Bertrán, Horacio Rodr&...
IJCAI
2003
14 years 11 months ago
Source Update Capture in Information Agents
In this paper we present strategies for successfully capturing updates at Web sources. Web-based information agents provide integrated access to autonomous Web sources that can ge...
Naveen Ashish, Deepak Kulkarni, Yao Wang