Sciweavers

591 search results - page 25 / 119
» Extracting Route Directions from Web Pages
Sort
View
ADC
2006
Springer
130views Database» more  ADC 2006»
15 years 5 months ago
A two-phase rule generation and optimization approach for wrapper generation
Web information extraction is a fundamental issue for web information management and integrations. A common approach is to use wrappers to extract data from web pages or documents...
Yanan Hao, Yanchun Zhang
ICMCS
2005
IEEE
89views Multimedia» more  ICMCS 2005»
15 years 5 months ago
Semantic Knowledge Building for Image Database by Analyzing Web Page Contents
In this paper, we present a method of semantic knowledge building for image database by extracting semantic meanings from Web page contents. The novelty of our method is that it i...
Yung-Kwang Lai, Song Liu, Liang-Tien Chia, Syin Ch...
WIDM
2003
ACM
15 years 5 months ago
Datarover: a taxonomy based crawler for automated data extraction from data-intensive websites
The advent of e-commerce has created a trend that brought thousands of catalogs online. Most of these websites are “taxonomy-directed”. A Web site is said to be ``taxonomydire...
Hasan Davulcu, S. Koduri, Saravanakumar Nagarajan
KDD
2002
ACM
148views Data Mining» more  KDD 2002»
16 years 7 days ago
Discovering informative content blocks from Web documents
In this paper, we propose a new approach to discover informative contents from a set of tabular documents (or Web pages) of a Web site. Our system, InfoDiscoverer, first partition...
Shian-Hua Lin, Jan-Ming Ho
WWW
2007
ACM
16 years 15 days ago
U-REST: an unsupervised record extraction system
In this paper, we describe a system that can extract record structures from web pages with no direct human supervision. Records are commonly occurring HTML-embedded data tuples th...
Yuan Kui Shen, David R. Karger