The diagnosis of cancer type based on microarray data offers hope that cancer classification can be highly accurate for clinicians to choose the most appropriate forms of treatmen...
Myungsook Klassen, Matt Cummings, Griselda Saldana
ABSTRACT. In the framework of the LegDoc project at Xerox Research Centre Europe, we are developing components for the semantic annotation of semi-structured documents. While certa...
We have recently converted a dependency treebank, consisting of ancient Greek and Latin texts, from one annotation scheme to another that was independently designed. This paper ma...
We study dimensionality reduction or feature selection in text document categorization problem. We focus on the first step in building text categorization systems, that is the cho...
Incomplete data is a common drawback that machine learning techniques need to deal with when solving real-life classification tasks. One of the most popular procedures for solving ...