Sciweavers

PAKDD
2015
ACM
13views Data Mining» more  PAKDD 2015»
8 years 11 days ago
What Is New in Our City? A Framework for Event Extraction Using Social Media Posts
Post streams from public social media platforms such as Instagram and Twitter have become precious but noisy data sources to discover what is happening around us. In this paper, we...
Chaolun Xia, Jun Hu, Yan Zhu, Mor Naaman
PAKDD
2015
ACM
21views Data Mining» more  PAKDD 2015»
8 years 11 days ago
Internal Clustering Evaluation of Data Streams
Abstract. Clustering validation is a crucial part of choosing a clustering algorithm which performs best for an input data. Internal clustering validation is efficient and realisti...
Marwan Hassani, Thomas Seidl 0001
PAKDD
2015
ACM
10views Data Mining» more  PAKDD 2015»
8 years 11 days ago
Coupling Multiple Views of Relations for Recommendation
Learning user/item relation is a key issue in recommender system, and existing methods mostly measure the user/item relation from one particular aspect, e.g., historical ratings, e...
Bin Fu, Guandong Xu, Longbing Cao, Zhihai Wang, Zh...
PAKDD
2015
ACM
10views Data Mining» more  PAKDD 2015»
8 years 11 days ago
Rank Matrix Factorisation
We introduce the problem of rank matrix factorisation (RMF). That is, we consider the decomposition of a rank matrix, in which each row is a (partial or complete) ranking of all co...
Thanh Le Van, Matthijs van Leeuwen, Siegfried Nijs...
PAKDD
2015
ACM
21views Data Mining» more  PAKDD 2015»
8 years 11 days ago
Scalable Outlying-Inlying Aspects Discovery via Feature Ranking
In outlying aspects mining, given a query object, we aim to answer the question as to what features make the query most outlying. The most recent works tackle this problem using tw...
Nguyen Xuan Vinh, Jeffrey Chan, James Bailey, Chri...
PAKDD
2015
ACM
14views Data Mining» more  PAKDD 2015»
8 years 11 days ago
Uncovering the Latent Structures of Crowd Labeling
Crowdsourcing provides a new way to distribute enormous tasks to a crowd of annotators. The divergent knowledge background and personal preferences of crowd annotators lead to nois...
Tian Tian, Jun Zhu
PAKDD
2015
ACM
8 years 11 days ago
Learning of Performance Measures from Crowd-Sourced Data with Application to Ranking of Investments
Abstract. Interestingness measures stand as proxy for “real human interest,” but their effectiveness is rarely studied empirically due to the difficulty of obtaining ground-tr...
Greg Harris, Anand V. Panangadan, Viktor K. Prasan...
PAKDD
2015
ACM
8 years 11 days ago
Identifying Hesitant and Interested Customers for Targeted Social Marketing
Abstract. Social networks provide unparalleled opportunities for marketing products or services. Along this line, tremendous efforts have been devoted to the research of targeted ...
Guowei Ma, Qi Liu, Le Wu, Enhong Chen
PAKDD
2015
ACM
12views Data Mining» more  PAKDD 2015»
8 years 11 days ago
Leveraging the Common Cause of Errors for Constraint-Based Data Cleansing
This study describes a statistically motivated approach to constraint-based data cleansing that derives the cause of errors from a distribution of conflicting tuples. In real-worl...
Ayako Hoshino, Hiroki Nakayama, Chihiro Ito, Kyota...
PAKDD
2015
ACM
8 years 11 days ago
Model Selection of Symbolic Regression to Improve the Accuracy of PM2.5 Concentration Prediction
As one of the main components of haze, topics with respect to PM2.5 are coming into people’s sight recently in China. In this paper, we try to predict PM2.5 concentrations in Da...
Guangfei Yang, Jian Huang