We initiate the study of sparse recovery problems under the Earth-Mover Distance (EMD). Specifically, we design a distribution over m × n matrices A such that for any x, given A...
Abstract. In this paper we propose a clustering algorithm called sCluster for analysis of gene expression data based on pattern-similarity. The algorithm captures the tight cluster...
Xiangsheng Chen, Jiuyong Li, Grant Daggard, Xiaodi...
Previous methods usually conduct the keyphrase extraction task for single documents separately without interactions for each document, under the assumption that the documents are ...
Correlation clustering aims at grouping the data set into correlation clusters such that the objects in the same cluster exhibit a certain density and are all associated to a comm...
Numerous domains ranging from distributed data acquisition to knowledge reuse need to solve the cluster ensemble problem of combining multiple clusterings into a single unified cl...