Functional-Based Table Category Identification in Digital Library

12 years 7 months ago
Functional-Based Table Category Identification in Digital Library
– Better understanding the document logical components is crucial to many applications, e.g., document classification or data integration. As the development of digital libraries, more people realize the importance of the scientific tables, which contain valuable information concisely. Although tons of previous table works focus on table data extraction, few concrete works on understanding and utilizing the extracted table data exist. Based on a large-scaled quantitative study on scientific papers, we believe that identifying the original purpose of the table authors can improve the table data comprehension and facilitate the table data reusability. In this paper, scientific document tables are classified into three topical categories: background, system/method, and experimental, and two functional categories: commentary and comparison. We apply machine learning based methods to implement the table classification task. Our results demonstrate that the proposed features are effective ...
Seongchan Kim, Ying Liu
Added 24 Dec 2011
Updated 24 Dec 2011
Type Journal
Year 2011
Authors Seongchan Kim, Ying Liu
Comments (0)