Sciweavers

JCDL
2003
ACM
160views Education» more  JCDL 2003»
13 years 10 months ago
Automatic Document Metadata Extraction Using Support Vector Machines
Automatic metadata generation provides scalability and usability for digital libraries and their collections. Machine learning methods offer robust and adaptable automatic metadat...
Hui Han, C. Lee Giles, Eren Manavoglu, Hongyuan Zh...
GISCIENCE
2004
Springer
159views GIS» more  GISCIENCE 2004»
13 years 10 months ago
The SPIRIT Spatial Search Engine: Architecture, Ontologies and Spatial Indexing
Abstract. The SPIRIT search engine provides a test bed for the development of web search technology that is specialised for access to geographical information. Major components inc...
Christopher B. Jones, Alia I. Abdelmoty, David Fin...
ICADL
2007
Springer
112views Education» more  ICADL 2007»
13 years 11 months ago
Automated Template-Based Metadata Extraction Architecture
This paper describes our efforts to develop a toolset and process for automated metadata extraction from large, diverse, and evolving document collections. A number of federal agen...
Paul Flynn, Li Zhou, Kurt Maly, Steven J. Zeil, Mo...
QSIC
2007
IEEE
13 years 11 months ago
A Scriptable, Statistical Oracle for a Metadata Extraction System
An oracle is described for dynamic validation of an application (metadata extraction from scanned documents) where a moderate failure rate is acceptable provided that instances of...
Kurt Maly, Steven J. Zeil, Mohammad Zubair, Ashraf...
ICDAR
2009
IEEE
13 years 11 months ago
Metadata Extraction from PDF Papers for Digital Library Ingest
In this paper we analyze our recent research on the use of document analysis techniques for metadata extraction from PDF papers. We describe a package that is designed to extract ...
Simone Marinai