Sciweavers

820 search results - page 4 / 164
» Deep web data extraction
Sort
View
JAIR
2008
173views more  JAIR 2008»
13 years 6 months ago
Creating Relational Data from Unstructured and Ungrammatical Data Sources
In order for agents to act on behalf of users, they will have to retrieve and integrate vast amounts of textual data on the World Wide Web. However, much of the useful data on the...
Matthew Michelson, Craig A. Knoblock
ICDE
2006
IEEE
166views Database» more  ICDE 2006»
14 years 7 months ago
WebIQ: Learning from the Web to Match Deep-Web Query Interfaces
Integrating Deep Web sources requires highly accurate semantic matches between the attributes of the source query interfaces. These matches are usually established by comparing th...
Wensheng Wu, AnHai Doan, Clement T. Yu
VLDB
2007
ACM
115views Database» more  VLDB 2007»
14 years 6 months ago
Context-Aware Wrapping: Synchronized Data Extraction
The deep Web presents a pressing need for integrating large numbers of dynamically evolving data sources. To be more automatic yet accurate in building an integration system, we o...
Shui-Lung Chuang, Kevin Chen-Chuan Chang, ChengXia...
SAC
2010
ACM
13 years 4 months ago
Host-IP clustering technique for deep web characterization
—A huge portion of todays Web consists of web pages filled with information from myriads of online databases. This part of the Web, known as the deep Web, is to date relatively ...
Denis Shestakov, Tapio Salakoski
WWW
2005
ACM
14 years 6 months ago
Exploiting the deep web with DynaBot: matching, probing, and ranking
We present the design of Dynabot, a guided Deep Web discovery system. Dynabot's modular architecture supports focused crawling of the Deep Web with an emphasis on matching, p...
Daniel Rocco, James Caverlee, Ling Liu, Terence Cr...