Sciweavers

SDM
2009
SIAM
191views Data Mining» more  SDM 2009»
14 years 1 months ago
Adaptive Concept Drift Detection.
An established method to detect concept drift in data streams is to perform statistical hypothesis testing on the multivariate data in the stream. Statistical decision theory off...
Anton Dries, Ulrich Rückert
SDM
2009
SIAM
113views Data Mining» more  SDM 2009»
14 years 1 months ago
Graph Generation with Prescribed Feature Constraints.
In this paper, we study the problem of how to generate synthetic graphs matching various properties of a real social network with two applications, privacy preserving social netwo...
Xiaowei Ying, Xintao Wu
SDM
2009
SIAM
170views Data Mining» more  SDM 2009»
14 years 1 months ago
Optimal Distance Bounds on Time-Series Data.
Most data mining operations include an integral search component at their core. For example, the performance of similarity search or classification based on Nearest Neighbors is ...
Michail Vlachos, Philip S. Yu, Suleyman S. Kozat
SDM
2009
SIAM
172views Data Mining» more  SDM 2009»
14 years 1 months ago
Travel-Time Prediction Using Gaussian Process Regression: A Trajectory-Based Approach.
This paper is concerned with the task of travel-time prediction for an arbitrary origin-destination pair on a map. Unlike most of the existing studies, which focus only on a parti...
Sei Kato, Tsuyoshi Idé
SDM
2009
SIAM
117views Data Mining» more  SDM 2009»
14 years 1 months ago
Spatially Cost-Sensitive Active Learning.
In active learning, one attempts to maximize classifier performance for a given number of labeled training points by allowing the active learning algorithm to choose which points...
Alexander Liu, Goo Jun, Joydeep Ghosh
SDM
2009
SIAM
105views Data Mining» more  SDM 2009»
14 years 1 months ago
Exploiting Semantic Constraints for Estimating Supersenses with CRFs.
The annotation of words and phrases by ontology concepts is extremely helpful for semantic interpretation. However many ontologies, e.g. WordNet, are too fine-grained and even hu...
Gerhard Paaß, Frank Reichartz
SDM
2009
SIAM
291views Data Mining» more  SDM 2009»
14 years 1 months ago
Detection and Characterization of Anomalies in Multivariate Time Series.
Anomaly detection in multivariate time series is an important data mining task with applications to ecosystem modeling, network traffic monitoring, medical diagnosis, and other d...
Christopher Potter, Haibin Cheng, Pang-Ning Tan, S...
SDM
2009
SIAM
235views Data Mining» more  SDM 2009»
14 years 1 months ago
Topic Cube: Topic Modeling for OLAP on Multidimensional Text Databases.
As the amount of textual information grows explosively in various kinds of business systems, it becomes more and more desirable to analyze both structured data records and unstruc...
ChengXiang Zhai, Duo Zhang, Jiawei Han
SDM
2009
SIAM
215views Data Mining» more  SDM 2009»
14 years 1 months ago
Hybrid Clustering of Text Mining and Bibliometrics Applied to Journal Sets.
To obtain correlated and complementary information contained in text mining and bibliometrics, hybrid clustering to incorporate textual content and citation information has become...
Bart De Moor, Frizo A. L. Janssens, Shi Yu, Wolfga...
SDM
2009
SIAM
208views Data Mining» more  SDM 2009»
14 years 1 months ago
Topic Evolution in a Stream of Documents.
Document collections evolve over time, new topics emerge and old ones decline. At the same time, the terminology evolves as well. Much literature is devoted to topic evolution in ...
Alexander Hinneburg, Andrè Gohr, Myra Spili...