A Personal Web Information/Knowledge Retrieval System

11 years 1 months ago
A Personal Web Information/Knowledge Retrieval System
The Web is the richest source of information and knowledge. Unfortunately the current structure of Web pages makes it difficult for users to retrieve the information or knowledge in a systematic way. In this paper, using the tree approach, we propose a personal Web information/knowledge retrieval system for the extraction of structured parts from Web pages. First we get the layout pattern and paths of extraction parts of a typical Web page in target sites. Then we use the recorded layout pattern and paths to extract the structured parts from the rest of Web pages in target sites. We show the usefulness of our approach using the results of extracting structured parts of notable Web pages.
Hao Han, Takehiro Tokuda
Added 29 Oct 2010
Updated 29 Oct 2010
Type Conference
Year 2007
Where EJC
Authors Hao Han, Takehiro Tokuda
Comments (0)