Sciweavers

SDM
2012
SIAM
261views Data Mining» more  SDM 2012»
11 years 7 months ago
Combining Active Learning and Dynamic Dimensionality Reduction
To date, many active learning techniques have been developed for acquiring labels when training data is limited. However, an important aspect of the problem has often been neglect...
Mustafa Bilgic
SDM
2012
SIAM
252views Data Mining» more  SDM 2012»
11 years 7 months ago
Learning from Heterogeneous Sources via Gradient Boosting Consensus
Multiple data sources containing different types of features may be available for a given task. For instance, users’ profiles can be used to build recommendation systems. In a...
Xiaoxiao Shi, Jean-François Paiement, David...
SDM
2012
SIAM
235views Data Mining» more  SDM 2012»
11 years 7 months ago
Sampling Strategies to Evaluate the Performance of Unknown Predictors
The focus of this paper is on how to select a small sample of examples for labeling that can help us to evaluate many different classification models unknown at the time of sampl...
Hamed Valizadegan, Saeed Amizadeh, Milos Hauskrech...
SDM
2012
SIAM
238views Data Mining» more  SDM 2012»
11 years 7 months ago
Evaluating Event Credibility on Twitter
Though Twitter acts as a realtime news source with people acting as sensors and sending event updates from all over the world, rumors spread via Twitter have been noted to cause c...
Manish Gupta, Peixiang Zhao, Jiawei Han
SDM
2012
SIAM
285views Data Mining» more  SDM 2012»
11 years 7 months ago
A Novel Approximation to Dynamic Time Warping allows Anytime Clustering of Massive Time Series Datasets
Given the ubiquity of time series data, the data mining community has spent significant time investigating the best time series similarity measure to use for various tasks and dom...
Qiang Zhu 0002, Gustavo E. A. P. A. Batista, Thana...
SDM
2012
SIAM
273views Data Mining» more  SDM 2012»
11 years 7 months ago
A Framework for the Evaluation and Management of Network Centrality
Network-analysis literature is rich in node-centrality measures that quantify the centrality of a node as a function of the (shortest) paths of the network that go through it. Exi...
Vatche Ishakian, Dóra Erdös, Evimaria ...
SDM
2012
SIAM
234views Data Mining» more  SDM 2012»
11 years 7 months ago
On Evaluation of Outlier Rankings and Outlier Scores
Outlier detection research is currently focusing on the development of new methods and on improving the computation time for these methods. Evaluation however is rather heuristic,...
Erich Schubert, Remigius Wojdanowski, Arthur Zimek...
SDM
2012
SIAM
281views Data Mining» more  SDM 2012»
11 years 7 months ago
Contextual Collaborative Filtering via Hierarchical Matrix Factorization
Matrix factorization (MF) has been demonstrated to be one of the most competitive techniques for collaborative filtering. However, state-of-the-art MFs do not consider contextual...
ErHeng Zhong, Wei Fan, Qiang Yang
SDM
2012
SIAM
282views Data Mining» more  SDM 2012»
11 years 7 months ago
Citation Prediction in Heterogeneous Bibliographic Networks
To reveal information hiding in link space of bibliographical networks, link analysis has been studied from different perspectives in recent years. In this paper, we address a no...
Xiao Yu, Quanquan Gu, Mianwei Zhou, Jiawei Han
SDM
2012
SIAM
220views Data Mining» more  SDM 2012»
11 years 7 months ago
Transfer Topic Modeling with Ease and Scalability
Jeon-Hyung Kang, Jun Ma, Yang Liu