Sciweavers

293 search results - page 1 / 59
» Bootstrapping Information Extraction from Field Books
Sort
View
EMNLP
2007
13 years 6 months ago
Bootstrapping Information Extraction from Field Books
We present two machine learning approaches to information extraction from semi-structured documents that can be used if no annotated training data are available, but there does ex...
Sander Canisius, Caroline Sporleder
AAAI
2007
13 years 7 months ago
Turning Lectures into Comic Books Using Linguistically Salient Gestures
Creating video recordings of events such as lectures or meetings is increasingly inexpensive and easy. However, reviewing the content of such video may be time-consuming and dif...
Jacob Eisenstein, Regina Barzilay, Randall Davis
IJCAI
2003
13 years 6 months ago
Integrating Information to Bootstrap Information Extraction from Web Sites
In this paper we propose a methodology to learn to extract domain-specific information from large repositories (e.g. the Web) with minimum user intervention. Learning is seeded b...
Fabio Ciravegna, Alexiei Dingli, David Guthrie, Yo...
EMNLP
2009
13 years 2 months ago
Generalized Expectation Criteria for Bootstrapping Extractors using Record-Text Alignment
Traditionally, machine learning approaches for information extraction require human annotated data that can be costly and time-consuming to produce. However, in many cases, there ...
Kedar Bellare, Andrew McCallum
IJDAR
2011
114views more  IJDAR 2011»
12 years 11 months ago
Setting up a competition framework for the evaluation of structure extraction from OCR-ed books
Abstract. This paper describes the setup of the Book Structure Extraction competition run at ICDAR 2009. The goal of the competition was to evaluate and compare automatic technique...
Antoine Doucet, Gabriella Kazai, Bodin Dresevic, A...