Sciweavers

584 search results - page 19 / 117
» Pattern Matching In The Textract Information Extraction Syst...
Sort
View
WISE
2005
Springer
15 years 3 months ago
NET - A System for Extracting Web Data from Flat and Nested Data Records
This paper studies automatic extraction of structured data from Web pages. Each of such pages may contain several groups of structured data records. Existing automatic methods stil...
Bing Liu, Yanhong Zhai
WWW
2006
ACM
15 years 10 months ago
GoGetIt!: a tool for generating structure-driven web crawlers
We present GoGetIt!, a tool for generating structure-driven crawlers that requires a minimum effort from the users. The tool takes as input a sample page and an entry point to a W...
Altigran Soares da Silva, Edleno Silva de Moura, J...
ICPR
2010
IEEE
14 years 7 months ago
Learning Image Anchor Templates for Document Classification and Data Extraction
Image anchor templates are used in document image analysis for document classification, data localization, and other tasks. Current tools allow human operators to mark out small s...
Prateek Sarkar
SSPR
2004
Springer
15 years 3 months ago
Tracking the Evolution of a Tennis Match Using Hidden Markov Models
The creation of a cognitive perception systems capable of inferring higher-level semantic information from low-level feature and event information for a given type of multimedia co...
Ilias Kolonias, William J. Christmas, Josef Kittle...
160
Voted
ICDE
2008
IEEE
171views Database» more  ICDE 2008»
15 years 11 months ago
Usage-Based Schema Matching
Existing techniques for schema matching are classified as either schema-based, instance-based, or a combination of both. In this paper, we define a new class of techniques, called ...
Hazem Elmeleegy, Mourad Ouzzani, Ahmed K. Elmagarm...