An Intelligent Text Extraction and Navigation System

9 years 11 months ago
An Intelligent Text Extraction and Navigation System
We present sppc, a high-performance system for intelligent text extraction and navigation from German free text documents. sppc consists of a set of domainindependent shallow core components which are realized by means of cascaded weighted finite state machines and generic dynamic tries. All extracted information is represented uniformly in one data structure (called the text chart) in a highly compact and linked form in order to support indexing and navigation through the set of solutions. German text processing includes (among others) compound processing, high performance named entity recognition and chunk parsing based on a divide-and-conquer strategy. sppc has a good performance (4380 words per second on standard PC environments) and high linguistic coverage.
Jakub Piskorski, Günter Neumann
Added 01 Nov 2010
Updated 01 Nov 2010
Type Conference
Year 2000
Where RIAO
Authors Jakub Piskorski, Günter Neumann
Comments (0)