Sciweavers

ICDIM
2009
IEEE

Text classification based on limited bibliographic metadata

13 years 2 months ago
Text classification based on limited bibliographic metadata
In this paper, we introduce a method for categorizing digital items according to their topic, only relying on the document's metadata, such as author name and title information. The proposed approach is based on a set of lexical resources constructed for our purposes (e.g., journal titles, conference names) and on a traditional machine-learning classifier that assigns one category to each document based on identified core features. The system is evaluated on a real-world data set and the influence of different feature combinations and settings is studied. Although the available information is limited, the results show that the approach is capable to efficiently classify data items representing documents.
Kerstin Denecke, Thomas Risse, Thomas Baehr
Added 18 Feb 2011
Updated 18 Feb 2011
Type Journal
Year 2009
Where ICDIM
Authors Kerstin Denecke, Thomas Risse, Thomas Baehr
Comments (0)