Sciweavers

ICDM
2009
IEEE
172views Data Mining» more  ICDM 2009»
13 years 11 months ago
Sparse Least-Squares Methods in the Parallel Machine Learning (PML) Framework
—We describe parallel methods for solving large-scale, high-dimensional, sparse least-squares problems that arise in machine learning applications such as document classificatio...
Ramesh Natarajan, Vikas Sindhwani, Shirish Tatikon...
ICDM
2009
IEEE
113views Data Mining» more  ICDM 2009»
13 years 11 months ago
Spatiotemporal Relational Random Forests
Abstract—We introduce and validate Spatiotemporal Relational Random Forests, which are random forests created with spatiotemporal relational probability trees. We build on the do...
Timothy A. Supinie, Amy McGovern, John Williams, J...
ICDM
2009
IEEE
163views Data Mining» more  ICDM 2009»
13 years 11 months ago
Kernel Conditional Quantile Estimation via Reduction Revisited
Quantile regression refers to the process of estimating the quantiles of a conditional distribution and has many important applications within econometrics and data mining, among ...
Novi Quadrianto, Kristian Kersting, Mark D. Reid, ...
ICDM
2009
IEEE
168views Data Mining» more  ICDM 2009»
13 years 11 months ago
Bi-relational Network Analysis Using a Fast Random Walk with Restart
—Identification of nodes relevant to a given node in a relational network is a basic problem in network analysis with great practical importance. Most existing network analysis ...
Jing Xia, Doina Caragea, William H. Hsu
ICDM
2009
IEEE
113views Data Mining» more  ICDM 2009»
13 years 11 months ago
Connecting Sparsely Distributed Similar Bloggers
—The nature of the Blogosphere determines that the majority of bloggers are only connected with a small number of fellow bloggers, and similar bloggers can be largely disconnecte...
Nitin Agarwal, Huan Liu, Shankara B. Subramanya, J...
ICDM
2009
IEEE
155views Data Mining» more  ICDM 2009»
13 years 11 months ago
A Contrast Pattern Based Clustering Quality Index for Categorical Data
Since clustering is unsupervised and highly explorative, clustering validation (i.e. assessing the quality of clustering solutions) has been an important and long standing researc...
Qingbao Liu, Guozhu Dong
ICDM
2009
IEEE
125views Data Mining» more  ICDM 2009»
13 years 11 months ago
A Fully Automated Method for Discovering Community Structures in High Dimensional Data
—Identifying modules, or natural communities, in large complex networks is fundamental in many fields, including social sciences, biological sciences and engineering. Recently s...
Jianhua Ruan
ICDM
2009
IEEE
121views Data Mining» more  ICDM 2009»
13 years 11 months ago
Finding Time Series Motifs in Disk-Resident Data
—Time series motifs are sets of very similar subsequences of a long time series. They are of interest in their own right, and are also used as inputs in several higher-level data...
Abdullah Mueen, Eamonn J. Keogh, Nima Bigdely Sham...
ICDM
2009
IEEE
160views Data Mining» more  ICDM 2009»
13 years 11 months ago
Fast Online Training of Ramp Loss Support Vector Machines
—A fast online algorithm OnlineSVMR for training Ramp-Loss Support Vector Machines (SVMR s) is proposed. It finds the optimal SVMR for t+1 training examples using SVMR built on t...
Zhuang Wang, Slobodan Vucetic
ICDM
2009
IEEE
117views Data Mining» more  ICDM 2009»
13 years 11 months ago
Clustering with Multiple Graphs
—In graph-based learning models, entities are often represented as vertices in an undirected graph with weighted edges describing the relationships between entities. In many real...
Wei Tang, Zhengdong Lu, Inderjit S. Dhillon