Sciweavers

ISVC
2005
Springer
13 years 10 months ago
Tool for Storm Analysis Using Multiple Data Sets
This note describes a web-based tool for storm analysis using multiple data sets developed for use in research of thunderstorms and forecasting applications. The tool was developed...
Robert M. Rabin, Tom Whittaker
IDA
2005
Springer
13 years 10 months ago
From Local Pattern Mining to Relevant Bi-cluster Characterization
Clustering or bi-clustering techniques have been proved quite useful in many application domains. A weakness of these techniques remains the poor support for grouping characterizat...
Ruggero G. Pensa, Jean-François Boulicaut
EUROPAR
2005
Springer
13 years 10 months ago
Modeling Machine Availability in Enterprise and Wide-Area Distributed Computing Environments
In this paper, we consider the problem of modeling machine availability in enterprise-area and wide-area distributed computing settings. Using availability data gathered from three...
Daniel Nurmi, John Brevik, Richard Wolski
DILS
2005
Springer
13 years 10 months ago
Cluster Based Integration of Heterogeneous Biological Databases Using the AutoMed Toolkit
This paper presents an extensible architecture that can be used to support the integration of heterogeneous biological data sets. In our architecture, a clustering approach has bee...
Michael Maibaum, Lucas Zamboulis, Galia Rimon, Chr...
CIKM
2005
Springer
13 years 10 months ago
Towards estimating the number of distinct value combinations for a set of attributes
Accurately and efficiently estimating the number of distinct values for some attribute(s) or sets of attributes in a data set is of critical importance to many database operation...
Xiaohui Yu, Calisto Zuzarte, Kenneth C. Sevcik
APPT
2005
Springer
13 years 10 months ago
Principal Component Analysis for Distributed Data Sets with Updating
Identifying the patterns of large data sets is a key requirement in data mining. A powerful technique for this purpose is the principal component analysis (PCA). PCA-based clusteri...
Zheng-Jian Bai, Raymond H. Chan, Franklin T. Luk
AIME
2005
Springer
13 years 10 months ago
Towards Information Visualization and Clustering Techniques for MRI Data Sets
Abstract. The paper deals with the integrated use of Information Visualization techniques and clustering algorithms to analyze Magnetic Resonance Imaging (MRI) data sets. The paper...
Umberto Castellani, Carlo Combi, Pasquina Marzola,...
AIIA
2005
Springer
13 years 10 months ago
Towards Fault-Tolerant Formal Concept Analysis
Given Boolean data sets which record properties of objects, Formal Concept Analysis is a well-known approach for knowledge discovery. Recent application domains, e.g., for very lar...
Ruggero G. Pensa, Jean-François Boulicaut
AI
2005
Springer
13 years 10 months ago
Comparing Dimension Reduction Techniques for Document Clustering
In this research, a systematic study is conducted of four dimension reduction techniques for the text clustering problem, using five benchmark data sets. Of the four methods -- Ind...
Bin Tang, Michael A. Shepherd, Malcolm I. Heywood,...
SIGIR
2005
ACM
13 years 10 months ago
Multi-label informed latent semantic indexing
Latent semantic indexing (LSI) is a well-known unsupervised approach for dimensionality reduction in information retrieval. However if the output information (i.e. category labels...
Kai Yu, Shipeng Yu, Volker Tresp