Sciweavers

2958 search results - page 305 / 592
» Using Anonymized Data for Classification
Sort
View
GRC
2010
IEEE
15 years 2 months ago
A Comparative Study of Threshold-Based Feature Selection Techniques
Given high-dimensional software measurement data, researchers and practitioners often use feature (metric) selection techniques to improve the performance of software quality clas...
Huanjing Wang, Taghi M. Khoshgoftaar, Jason Van Hu...
IJDMB
2007
110views more  IJDMB 2007»
15 years 4 months ago
Transductive learning with EM algorithm to classify proteins based on phylogenetic profiles
: Phylogenetic profiles of proteins  strings of ones and zeros encoding respectively the presence and absence of proteins in a group of genomes  have recently been used to id...
Roger A. Craig, Li Liao
FAST
2004
15 years 5 months ago
Tracefs: A File System to Trace Them All
File system traces have been used for years to analyze user behavior and system software behavior, leading to advances in file system and storage technologies. Existing traces, ho...
Akshat Aranya, Charles P. Wright, Erez Zadok
138
Voted
KDD
2003
ACM
243views Data Mining» more  KDD 2003»
16 years 4 months ago
Accurate decision trees for mining high-speed data streams
In this paper we study the problem of constructing accurate decision tree models from data streams. Data streams are incremental tasks that require incremental, online, and any-ti...
João Gama, Pedro Medas, Ricardo Rocha
IDA
2008
Springer
15 years 4 months ago
Symbolic methodology for numeric data mining
Currently statistical and artificial neural network methods dominate in data mining applications. Alternative relational (symbolic) data mining methods have shown their effectivene...
Boris Kovalerchuk, Evgenii Vityaev