In this paper we propose a new criterion, based on Minimum Description Length (MDL), to estimate an optimal number of clusters. This criterion, called Kernel MDL (KMDL), is particu...
Ivan O. Kyrgyzov, Olexiy O. Kyrgyzov, Henri Ma&ici...
Combining machine learning models is a means of improving overall accuracy.Various algorithms have been proposed to create aggregate models from other models, and two popular examp...
In this paper, we investigate the topic of gender identification for short length, multi-genre, content-free e-mails. We introduce for the first time (to our knowledge), psycholing...
Na Cheng, Xiaoling Chen, R. Chandramouli, K. P. Su...
Many machine learning algorithms can be formulated in the framework of statistical independence such as the Hilbert Schmidt Independence Criterion. In this paper, we extend this c...
Xinhua Zhang, Le Song, Arthur Gretton, Alex J. Smo...
- The rapid growth in the amount of molecular genetic data being collected will, in many cases, require the development of new analytic methods for the analysis of that data. In th...