Sciweavers

2714 search results - page 94 / 543
» Machine Learning for Information Extraction
Sort
View
158
Voted
AINA
2009
IEEE
16 years 29 days ago
Learning to Extract Content from News Webpages
We consider the problem of content extraction from online news webpages. To explore to what extent the syntactic markup and the visual structure of a webpage facilitate the extrac...
Alex Spengler, Patrick Gallinari
158
Voted
KDD
2004
ACM
163views Data Mining» more  KDD 2004»
16 years 6 months ago
Exploiting dictionaries in named entity extraction: combining semi-Markov extraction processes and data integration methods
We consider the problem of improving named entity recognition (NER) systems by using external dictionaries--more specifically, the problem of extending state-of-the-art NER system...
William W. Cohen, Sunita Sarawagi
ICPR
2010
IEEE
15 years 4 months ago
Learning Image Anchor Templates for Document Classification and Data Extraction
Image anchor templates are used in document image analysis for document classification, data localization, and other tasks. Current tools allow human operators to mark out small s...
Prateek Sarkar
175
Voted
ICML
2000
IEEE
16 years 7 months ago
Less is More: Active Learning with Support Vector Machines
We describe a simple active learning heuristic which greatly enhances the generalization behavior of support vector machines (SVMs) on several practical document classification ta...
Greg Schohn, David Cohn
155
Voted
ISNN
2007
Springer
16 years 8 days ago
Online Dynamic Value System for Machine Learning
A novel online dynamic value system for machine learning is proposed in this paper. The proposed system has a dual network structure: data processing network (DPN) and information ...
Haibo He, Janusz A. Starzyk