A Geographic Information System allows to create and manage spatial data. Having many public users who create and edit objects in geographic maps, the question of data quality aris...
Implementing a Total Data Quality Management (TDQM) program is not a trivial undertaking. Two key steps are (1) to clearly define what an organization means by data quality and (2...
We present a comprehensive suite of experimentation on the subject of learning from imbalanced data. When classes are imbalanced, many learning algorithms can suffer from the pers...
Jason Van Hulse, Taghi M. Khoshgoftaar, Amri Napol...
We present a parallel version of BIRCH with the objective of enhancing the scalability without compromising on the quality of clustering. The incoming data is distributed in a cyc...
This paper presents the results of a software usability study, involving both subjective and objective evaluation. It compares a popular XML data transformation language (XSLT) an...