Sciweavers

CASCON
2006

Exploring a new space of features for document classification: figure clustering

13 years 5 months ago
Exploring a new space of features for document classification: figure clustering
Automatic document classification is an important step in organizing and mining documents. Information in documents is often conveyed using both text and images that complement each other. Typically, only the text content forms the basis for features that are used in document classification. In this paper, we explore the use of information from figure images to assist in this task. We explore image clustering as a basis for constructing visual words for representing documents. Once such visual words are formed, the standard bagof-words representation along with commonly used classifiers, such as the na
Nawei Chen, Hagit Shatkay, Dorothea Blostein
Added 30 Oct 2010
Updated 30 Oct 2010
Type Conference
Year 2006
Where CASCON
Authors Nawei Chen, Hagit Shatkay, Dorothea Blostein
Comments (0)