A central problem in information retrieval is the automated classification of text documents. While many existing methods achieve good levels of performance, they generally require...
We demonstrate the usefulness of the uniform resource locator (URL) alone in performing web page classification. This approach is magnitudes faster than typical web page classific...
The use of the computing with words paradigm for the automatic text documents categorization problem is discussed. This specific problem of information retrieval (IR) becomes more...
We present a novel sequential clustering algorithm which is motivated by the Information Bottleneck (IB) method. In contrast to the agglomerative IB algorithm, the new sequential ...
Music consists of both local and long-term temporal information. However, for a genre classification task, most of the text categorization based approaches only capture local temp...