Sciweavers

874 search results - page 40 / 175
» Jedi: Extracting and Synthesizing Information from the Web
Sort
View
HT
2009
ACM
14 years 7 months ago
Retrieving broken web links using an approach based on contextual information
In this short note we present a recommendation system for automatic retrieval of broken Web links using an approach based on contextual information. We extract information from th...
Juan Martinez-Romo, Lourdes Araujo
ADC
2006
Springer
130views Database» more  ADC 2006»
15 years 3 months ago
A two-phase rule generation and optimization approach for wrapper generation
Web information extraction is a fundamental issue for web information management and integrations. A common approach is to use wrappers to extract data from web pages or documents...
Yanan Hao, Yanchun Zhang
AAAI
2008
15 years 2 days ago
Automatic Extraction of Data Points and Text Blocks from 2-Dimensional Plots in Digital Documents
Two dimensional plots (2-D) in digital documents on the web are an important source of information that is largely under-utilized. In this paper, we outline how data and text can ...
Saurabh Kataria, William Browuer, Prasenjit Mitra,...
WWW
2004
ACM
15 years 10 months ago
Automatic extraction of web search interfaces for interface schema integration
This paper provides an overview of a technique for extracting information from the Web search interfaces of e-commerce search engines that is useful for supporting automatic searc...
Hai He, Weiyi Meng, Clement T. Yu, Zonghuan Wu
ICDM
2006
IEEE
164views Data Mining» more  ICDM 2006»
15 years 3 months ago
Unsupervised Learning of Tree Alignment Models for Information Extraction
We propose an algorithm for extracting fields from HTML search results. The output of the algorithm is a database table– a data structure that better lends itself to high-level...
Philip Zigoris, Damian Eads, Yi Zhang