Sciweavers

368 search results - page 3 / 74
» Template-Based Information Mining from HTML Documents
Sort
View
RULEML
2004
Springer
13 years 11 months ago
Rule Learning for Feature Values Extraction from HTML Product Information Sheets
The Web is now a huge information repository with a rich semantic structure that, however, is primarily addressed to human understanding rather than automated processing by a compu...
Costin Badica, Amelia Badica
XSYM
2005
Springer
107views Database» more  XSYM 2005»
13 years 11 months ago
Logic Wrappers and XSLT Transformations for Tuples Extraction from HTML
Abstract. Recently it was shown that existing general-purpose inductive logic programming systems are useful for learning wrappers (known as L-wrappers) to extract data from HTML d...
Costin Badica, Amelia Badica
AWIC
2005
Springer
13 years 11 months ago
Tuples Extraction from HTML Using Logic Wrappers and Inductive Logic Programming
This paper presents an approach for applying inductive logic programming to information extraction from HTML documents structured as unranked ordered trees. We consider information...
Costin Badica, Amelia Badica, Elvira Popescu
SYNASC
2006
IEEE
211views Algorithms» more  SYNASC 2006»
13 years 11 months ago
HTML Pattern Generator--Automatic Data Extraction from Web Pages
Existing methods of information extraction from HTML documents include manual approach, supervised learning and automatic techniques. The manual method has high precision and reca...
Mirel Cosulschi, Adrian Giurca, Bogdan Udrescu, Ni...
TREC
2000
13 years 7 months ago
Information Space Based on HTML Structure
The main goal for the Information Space system for TREC9 was early precision. To facilitate this, an emphasis was placed on seeking matches from only the TITLE, H1, H2 and H3 tags...
Gregory B. Newby