In this paper we study the problem of classifying chemical compound datasets. We present a sub-structure-based classification algorithm that decouples the sub-structure discovery...
Mukund Deshpande, Michihiro Kuramochi, George Kary...
High dimensional data visualization is critical to data analysts since it gives a direct view of original data. We present a method to visualize large amount of high dimensional d...
We experimentally evaluate randomization-based approaches to creating an ensemble of decision-tree classifiers. Unlike methods related to boosting, all of the eight approaches co...
Lawrence O. Hall, Kevin W. Bowyer, Robert E. Banfi...
We note that a set of statistically “unusual” proteinprofile pairs in experimentally determined database of protein-protein interactions can typify protein-protein interaction...
Byung-Hoon Park, George Ostrouchov, Gong-Xin Yu, A...
There is a ever-growing need to add structure in the form of semantic markup to the huge amounts of unstructured text data now available. We present the technique of shallow seman...
Sameer Pradhan, Kadri Hacioglu, Wayne Ward, James ...