Sciweavers

12 search results - page 2 / 3
» Intelligent Content Based Title and Author Name Extraction f...
Sort
View
KAIS
2006
102views more  KAIS 2006»
13 years 5 months ago
Visual information extraction
Typographic and visual information is an integral part of textual documents. Most information extraction systems ignore most of this visual information, processing the text as a l...
Yonatan Aumann, Ronen Feldman, Yair Liberzon, Biny...
SEMCO
2007
IEEE
13 years 11 months ago
Intelligent Parsing of Scanned Volumes for Web Based Archives
The proliferation of digital libraries and the large amount of existing documents raise important issues in efficient handling of documents. Printed texts in documents need to be...
Xiaonan Lu, James Ze Wang, C. Lee Giles
CIKM
2005
Springer
13 years 10 months ago
Learning to summarise XML documents using content and structure
Documents formatted in eXtensible Markup Language (XML) are becoming increasingly available in collections of various document types. In this paper, we present an approach for the...
Massih-Reza Amini, Anastasios Tombros, Nicolas Usu...
ICDE
2010
IEEE
251views Database» more  ICDE 2010»
14 years 5 months ago
Viewing a World of Annotations through AnnoVIP
The proliferation of electronic content has notably lead to the apparition of large corpora of interrelated structured documents (such as HTML and XML Web pages) and semantic annot...
Konstantinos Karanasos, Spyros Zoupanos
BTW
2009
Springer
145views Database» more  BTW 2009»
13 years 12 months ago
Retrieving Metadata for Your Local Scholarly Papers
: We present a novel approach to retrieve metadata to scholarly papers stored locally as PDF files. A fingerprint is produced from the PDF fulltext to query an online metadata repo...
David Aumüller