The performance of document clustering systems depends on employing optimal text representations, which are not only difficult to determine beforehand, but also may vary from one ...
In this paper, we introduce a method for categorizing digital items according to their topic, only relying on the document's metadata, such as author name and title informati...
Text classification in the medical domain is a real world problem with wide applicability. This paper investigates extensively the effect of text representation approaches on the p...
Fathi H. Saad, Beatriz de la Iglesia, Duncan G. Be...
Term-based representations of documents have found widespread use in information retrieval. However, one of the main shortcomings of such methods is that they largely disregard le...
Combining advantages of shape and appearance features, we propose a novel model that integrates these two complementary features into a common framework for object categorization ...
Hong Pan, Yaping Zhu, Liang-Zheng Xia, Truong Q. N...