Sciweavers

PKDD
2007
Springer
87views Data Mining» more  PKDD 2007»
13 years 10 months ago
Visual Exploration of Genomic Data
Abstract. In this study, we present methods for comparative visualization of DNA sequences in two dimensions. First, we illustrate a transformation of gene sequences into numerical...
Michail Vlachos, Bahar Taneri, Eamonn J. Keogh, Ph...
PKDD
2007
Springer
86views Data Mining» more  PKDD 2007»
13 years 10 months ago
An Effective Approach to Enhance Centroid Classifier for Text Categorization
Centroid Classifier has been shown to be a simple and yet effective method for text categorization. However, it is often plagued with model misfit (or inductive bias) incurred by i...
Songbo Tan, Xueqi Cheng
PKDD
2007
Springer
109views Data Mining» more  PKDD 2007»
13 years 10 months ago
Matching Partitions over Time to Reliably Capture Local Clusters in Noisy Domains
Abstract. When seeking for small clusters it is very intricate to distinguish between incidental agglomeration of noisy points and true local patterns. We present the PAMALOC algor...
Frank Höppner, Mirko Böttcher
PKDD
2007
Springer
130views Data Mining» more  PKDD 2007»
13 years 10 months ago
Bridged Refinement for Transfer Learning
Dikan Xing, Wenyuan Dai, Gui-Rong Xue, Yong Yu
PKDD
2007
Springer
114views Data Mining» more  PKDD 2007»
13 years 10 months ago
Robust Visual Mining of Data with Error Information
Abstract. Recent results on robust density-based clustering have indicated that the uncertainty associated with the actual measurements can be exploited to locate objects that are ...
Jianyong Sun, Ata Kabán, Somak Raychaudhury
PKDD
2007
Springer
214views Data Mining» more  PKDD 2007»
13 years 10 months ago
Multi-party, Privacy-Preserving Distributed Data Mining Using a Game Theoretic Framework
Abstract. Analysis of privacy-sensitive data in a multi-party environment often assumes that the parties are well-behaved and they abide by the protocols. Parties compute whatever ...
Hillol Kargupta, Kamalika Das, Kun Liu
PKDD
2007
Springer
91views Data Mining» more  PKDD 2007»
13 years 10 months ago
Domain Adaptation of Conditional Probability Models Via Feature Subsetting
The goal in domain adaptation is to train a model using labeled data sampled from a domain different from the target domain on which the model will be deployed. We exploit unlabel...
Sandeepkumar Satpal, Sunita Sarawagi
PKDD
2007
Springer
107views Data Mining» more  PKDD 2007»
13 years 10 months ago
An Empirical Comparison of Exact Nearest Neighbour Algorithms
Nearest neighbour search (NNS) is an old problem that is of practical importance in a number of fields. It involves finding, for a given point q, called the query, one or more po...
Ashraf M. Kibriya, Eibe Frank
PKDD
2007
Springer
120views Data Mining» more  PKDD 2007»
13 years 10 months ago
Site-Independent Template-Block Detection
Detection of template and noise blocks in web pages is an important step in improving the performance of information retrieval and content extraction. Of the many approaches propos...
Aleksander Kolcz, Wen-tau Yih