In this paper we study the effectiveness of using a phrase-based representation in e-mail classification, and the affect this approach has on a number of machine learning algorithm...
In this note, we present results concerning the theory and practice of determining for a given document which of several categories it best fits. We describe a mathematical model ...
One of the central challenges in sentimentbased text categorization is that not every portion of a document is equally informative for inferring the overall sentiment of the docum...
We address here the need to assist users in rapidly accessing the most important or strategic information in the text corpus by identifying sentences carrying specific information...
Abstract. We present a methodology of how to use a top-level ontology to create a domain ontology from existing scientific texts by (1) identifying informal definitions of domain-s...