Sciweavers

IJCNN
2006
IEEE
13 years 10 months ago
Prototype based outlier detection
— Outliers refer to “minority” data that are different from most other data. They usually disturb data mining process. But, sometimes they provide valuable information. Thus,...
Seungtaek Kim, Sungzoon Cho
VLDB
2007
ACM
129views Database» more  VLDB 2007»
13 years 11 months ago
Anytime Measures for Top-k Algorithms
Top-k queries on large multi-attribute data sets are fundamental operations in information retrieval and ranking applications. In this paper, we initiate research on the anytime b...
Benjamin Arai, Gautam Das, Dimitrios Gunopulos, Ni...
VLDB
2007
ACM
116views Database» more  VLDB 2007»
13 years 11 months ago
K-Anonymization as Spatial Indexing: Toward Scalable and Incremental Anonymization
In this paper we observe that k-anonymizing a data set is strikingly similar to building a spatial index over the data set, so similar in fact that classical spatial indexing tech...
Tochukwu Iwuchukwu, Jeffrey F. Naughton
VLDB
2007
ACM
114views Database» more  VLDB 2007»
13 years 11 months ago
IndeGS: Index Supported Graphics Data Server for CFD Data Postprocessing
Virtual reality techniques particularly in the field of CFD (computational fluid dynamics) are of growing importance due to their ability to offer comfortable means to interact...
Christoph Brochhaus, Thomas Seidl
PKDD
2007
Springer
109views Data Mining» more  PKDD 2007»
13 years 11 months ago
Matching Partitions over Time to Reliably Capture Local Clusters in Noisy Domains
Abstract. When seeking for small clusters it is very intricate to distinguish between incidental agglomeration of noisy points and true local patterns. We present the PAMALOC algor...
Frank Höppner, Mirko Böttcher
MICCAI
2007
Springer
13 years 11 months ago
Robust Autonomous Model Learning from 2D and 3D Data Sets
In this paper we propose a weakly supervised learning algorithm for appearance models based on the minimum description length (MDL) principle. From a set of training images or volu...
Georg Langs, Rene Donner, Philipp Peloschek, Horst...
HIPEAC
2007
Springer
13 years 11 months ago
MiDataSets: Creating the Conditions for a More Realistic Evaluation of Iterative Optimization
Abstract. Iterative optimization has become a popular technique to obtain improvements over the default settings in a compiler for performance-critical applications, such as embedd...
Grigori Fursin, John Cavazos, Michael F. P. O'Boyl...
GRID
2007
Springer
13 years 11 months ago
Data placement for scientific applications in distributed environments
— Scientific applications often perform complex computational analyses that consume and produce large data sets. We are concerned with data placement policies that distribute dat...
Ann L. Chervenak, Ewa Deelman, Miron Livny, Mei-Hu...
SMC
2007
IEEE
110views Control Systems» more  SMC 2007»
13 years 11 months ago
A validity index based on cluster symmetry
— An important consideration in clustering is the determination of the correct number of clusters and the appropriate partitioning of a given data set. In this paper, a newly dev...
Sriparna Saha, Sanghamitra Bandyopadhyay
IEEEARES
2007
IEEE
13 years 11 months ago
Using Privacy Process Patterns for Incorporating Privacy Requirements into the System Design Process
In the online world every person has to hold a number of different data sets so as to be able to have access to various e-services and take part in specific economical and social ...
Christos Kalloniatis, Evangelia Kavakli, Stefanos ...