Sciweavers

85 search results - page 4 / 17
» Extracting unstructured data from template generated web doc...
Sort
View
83
Voted
DKE
2007
132views more  DKE 2007»
14 years 9 months ago
Automated ontology construction for unstructured text documents
Ontology is playing an increasingly important role in knowledge management and the Semantic Web. This study presents a novel episode-based ontology construction mechanism to extra...
Chang-Shing Lee, Yuan-Fang Kao, Yau-Hwang Kuo, Mei...
SYNASC
2006
IEEE
211views Algorithms» more  SYNASC 2006»
15 years 3 months ago
HTML Pattern Generator--Automatic Data Extraction from Web Pages
Existing methods of information extraction from HTML documents include manual approach, supervised learning and automatic techniques. The manual method has high precision and reca...
Mirel Cosulschi, Adrian Giurca, Bogdan Udrescu, Ni...
87
Voted
KDD
2007
ACM
155views Data Mining» more  KDD 2007»
15 years 10 months ago
Mining templates from search result records of search engines
Metasearch engine, Comparison-shopping and Deep Web crawling applications need to extract search result records enwrapped in result pages returned from search engines in response ...
Hongkun Zhao, Weiyi Meng, Clement T. Yu
WEBDB
1999
Springer
131views Database» more  WEBDB 1999»
15 years 1 months ago
Adapter Generation for Extracting and Querying Data from Web
Accessing and integrating data from heterogeneous sources has become a significant challenge. So-called adapters provide the functionality for translating SQL queries into querie...
Kai-Uwe Sattler, Michael Höding
CIKM
2007
Springer
15 years 3 months ago
The role of documents vs. queries in extracting class attributes from text
Challenging the implicit reliance on document collections, this paper discusses the pros and cons of using query logs rather than document collections, as self-contained sources o...
Marius Pasca, Benjamin Van Durme, Nikesh Garera