Sciweavers

IPM
2002

A feature mining based approach for the classification of text documents into disjoint classes

13 years 4 months ago
A feature mining based approach for the classification of text documents into disjoint classes
This paper proposes a new approach for classifying text documents into two disjoint classes. The new approach is based on extracting patterns, in the form of two logical expressions, which are defined on various features (indexing terms) of the documents. The pattern extraction is aimed at providing descriptions (in the form of two logical expressions) of the two classes of positive and negative examples. This is achieved by means of a data mining approach, called One Clause At a Time (OCAT), which is based on mathematical logic. The application of a logic-based approach to text document classification is critical when one wishes to be able to justify why a particular document has been assigned to one class versus the other class. This situation occurs, for instance, in declassifying documents that have been previously considered important to national security and thus are currently being kept as secret. Some computational experiments have investigated the effectiveness of the OCAT-ba...
Salvador Nieto Sánchez, Evangelos Triantaph
Added 22 Dec 2010
Updated 22 Dec 2010
Type Journal
Year 2002
Where IPM
Authors Salvador Nieto Sánchez, Evangelos Triantaphyllou, Donald H. Kraft
Comments (0)