The main problems in text classification are lack of labeled data, as well as the cost of labeling the unlabeled data. We address these problems by exploring co-training - an algo...
We consider the problem of deriving class-size independent generalization bounds for some regularized discriminative multi-category classification methods. In particular, we obtai...
We propose the framework of mutual information kernels for learning covariance kernels, as used in Support Vector machines and Gaussian process classifiers, from unlabeled task da...
In this report, we describe our question-answering system SAIQA-e (System for Advanced Interactive Question Answering in English) which ran the main task of TREC-10's QA-trac...
Motivation: Protein subcellular localization is crucial for genome annotation, protein function prediction, and drug discovery. However, since determining subcellular localization...
Emily Chia-Yu Su, Hua-Sheng Chiu, Allan Lo, Jenn-K...