Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

103

WEBI
2004
Springer

favoriteEmaildiscussreport

91views Internet Technology» more WEBI 2004»

Semi-Structured Complex List Extraction

15 years 6 months ago

Semi-Structured Complex List Extraction

Download www2.cs.uregina.ca

The semi-structured information available in HTML and similar documents provide valuable information that can be used for information extraction applications. This information together with other technical information about how to retrieve pages can be used to automatically extract pieces and various types of lists. The goal is to put as much intelligently as possible in the system so that as little knowledge and work as possible is required by the users, i.e. a user-driven extraction system. The advantage of a userdriven system is that the service provided by the system is available not only for experts, but for also ordinary users and thereby making the service available for a wide audience. A problem with some lists in documents are that the structure is different for the elements in the lists, and thus it becomes more difﬁcult to take advantage of the semi-structural information. The agent-oriented system described in this paper allows a user without expert skills to train an ex...

Anders Arpteg

Real-time Traffic

Complex Lists | Information Extraction Applications | Internet Technology | User-driven Extraction System | WEBI 2004 |

claim paper

Related Content

» Topdown Extraction of SemiStructured Data

» Combining the web content and usage mining to understand the visitor behavior in a web sit...

» Genes2Networks connecting lists of gene symbols using mammalian protein interactions datab...

» Improving the power for detecting overlapping genes from multiple DNA microarrayderived ge...

» Optimizing complex extraction programs over evolving text data

» Identifying overrepresented concepts in gene lists from literature a statistical approach ...

» A twophase rule generation and optimization approach for wrapper generation

» An Intelligent Process Monitoring System in Complex Manufacturing Environment

» Using Formal Tools to Study Complex Circuits Behaviour

Post Info
More Details (n/a)

Added	02 Jul 2010
Updated	02 Jul 2010
Type	Conference
Year	2004
Where	WEBI
Authors	Anders Arpteg

Comments (0)