Preliminary approach on synthetic data sets generation based on class separability measure

16 years 5 months ago

Download www.salle.url.edu

Usually, performance of classifiers is evaluated on real-world problems that mainly belong to public repositories. However, we ignore the inherent properties of these data and how they affect classifier behavior. Also, the high cost or the difficulty of experiments hinder the data collection, leading to complex data sets characterized by few instances, missing values, and imprecise data. The generation of synthetic data sets solves both issues and allows us to build problems with a minor cost and whose characteristics are predefined. This is useful to test system limitations in a controlled framework. This paper proposes to generate synthetic data sets based on data complexity. We rely on the length of the class boundary to build the data sets, obtaining a preliminary set of benchmarks to assess classifier accuracy. The study can be further matured to identify regions of competence for classifiers.

Núria Macià, Ester Bernadó-Ma

Real-time Traffic

Complex Data Sets | Computer Vision | ICPR 2008 | Imprecise Data | Synthetic Data Sets |

claim paper

» ClusteringBased Construction of Hidden Markov Models for Generative Kernels

» A TreeBased Approach to the Discovery of Diagnostic Biomarkers for Ovarian Cancer

» A New Class Based Associative Classification Algorithm

» Decisiontheoretic approaches in fuzzy rule generation for diagnosis and fault detection pr...

» Deformable Template and Distribution MixtureBased Data Modeling for the Endocardial Contou...

» Applying both positive and negative selection to supervised learning for anomaly detection

» Learning informative point classes for the acquisition of object model maps

» Basic Association Rules

Post Info
More Details (n/a)

Added	05 Nov 2009
Updated	05 Nov 2009
Type	Conference
Year	2008
Where	ICPR
Authors	Núria Macià, Ester Bernadó-Mansilla, Albert Orriols-Puig

Comments (0)

Sciweavers

Preliminary approach on synthetic data sets generation based on class separability measure

Complex Data Sets | Computer Vision | ICPR 2008 | Imprecise Data | Synthetic Data Sets |

Explore & Download

Productivity Tools

Sciweavers