Sciweavers

2677 search results - page 158 / 536
» Extracting Structured Data from Web Pages
Sort
View
FTDB
2008
82views more  FTDB 2008»
15 years 3 months ago
Information Extraction
The automatic extraction of information from unstructured sources has opened up new avenues for querying, organizing, and analyzing data by drawing upon the clean semantics of str...
Sunita Sarawagi
CIKM
2011
Springer
14 years 3 months ago
Integrating and querying web databases and documents
There exist many interrelated information sources on the Internet that can be categorized into structured (database) and semistructured (documents). A key challenge is to integrat...
Carlos Garcia-Alvarado, Carlos Ordonez
CIKM
2009
Springer
15 years 10 months ago
Easiest-first search: towards comprehension-based web search
Although Web search engines have become information gateways to the Internet, for queries containing technical terms, search results often contain pages that are difficult to be ...
Makoto Nakatani, Adam Jatowt, Katsumi Tanaka
141
Voted
ICDAR
2003
IEEE
15 years 9 months ago
Document Transformation System from Papers to XML Data Based on Pivot XML Document Method
This paper proposes a new method for document transformation using OCR to generate various XML documents from printed documents. The proposed method adopts a hierarchical transfor...
Yasuto Ishitani
128
Voted
IAJIT
2010
162views more  IAJIT 2010»
15 years 2 months ago
Deriving Conceptual Schema from Domain Ontology: A Web Application Reverse Engineering Approach
: The heterogeneous and dynamic nature of components making up a web application, the lack of effective programming mechanisms for implementing basic software engineering principle...
Sidi Mohamed Benslimane, Mimoun Malki, Djelloul Bo...