Sciweavers

JMLR
2010
161views more  JMLR 2010»
12 years 11 months ago
Feature Selection for Text Classification Based on Gini Coefficient of Inequality
A number of feature selection mechanisms have been explored in text categorization, among which mutual information, information gain and chi-square are considered most effective. ...
Sanasam Ranbir Singh, Hema A. Murthy, Timothy A. G...
JMLR
2010
153views more  JMLR 2010»
12 years 11 months ago
Feature Extraction for Outlier Detection in High-Dimensional Spaces
This work addresses the problem of feature extraction for boosting the performance of outlier detectors in high-dimensional spaces. Recent years have observed the prominence of mu...
Nguyen Hoang Vu, Vivekanand Gopalkrishnan
JMLR
2010
120views more  JMLR 2010»
12 years 11 months ago
Effective Wrapper-Filter hybridization through GRASP Schemata
Of all of the challenges which face the selection of relevant features for predictive data mining or pattern recognition modeling, the adaptation of computational intelligence tec...
Mohamed Amir Esseghir
JMLR
2010
165views more  JMLR 2010»
12 years 11 months ago
Feature Selection: An Ever Evolving Frontier in Data Mining
The rapid advance of computer technologies in data processing, collection, and storage has provided unparalleled opportunities to expand capabilities in production, services, comm...
Huan Liu, Hiroshi Motoda, Rudy Setiono, Zheng Zhao
JMLR
2010
104views more  JMLR 2010»
12 years 11 months ago
Increasing Feature Selection Accuracy for L1 Regularized Linear Models
L1 (also referred to as the 1-norm or Lasso) penalty based formulations have been shown to be effective in problem domains when noisy features are present. However, the L1 penalty...
Abhishek Jaiantilal, Gregory Z. Grudic
JMLR
2010
116views more  JMLR 2010»
12 years 11 months ago
Feature Selection, Association Rules Network and Theory Building
As the size and dimensionality of data sets increase, the task of feature selection has become increasingly important. In this paper we demonstrate how association rules can be us...
Sanjay Chawla
JMLR
2010
230views more  JMLR 2010»
12 years 11 months ago
Learning Dissimilarities for Categorical Symbols
In this paper we learn a dissimilarity measure for categorical data, for effective classification of the data points. Each categorical feature (with values taken from a finite set...
Jierui Xie, Boleslaw K. Szymanski, Mohammed J. Zak...
JMLR
2010
136views more  JMLR 2010»
12 years 11 months ago
Evaluation Method for Feature Rankings and their Aggregations for Biomarker Discovery
In this paper we investigate the problem of evaluating ranked lists of biomarkers, which are typically an output of the analysis of high-throughput data. This can be a list of pro...
Ivica Slavkov, Bernard Zenko, Saso Dzeroski
JMLR
2010
136views more  JMLR 2010»
12 years 11 months ago
Predicting the functions of proteins in Protein-Protein Interaction networks from global information
In this work we present a novel approach to predict the function of proteins in protein-protein interaction (PPI) networks. We classify existing approaches into inductive and tran...
Hossein Rahmani, Hendrik Blockeel, Andreas Bender
JMLR
2010
121views more  JMLR 2010»
12 years 11 months ago
A comparison of AUC estimators in small-sample studies
Reliable estimation of the classification performance of learned predictive models is difficult, when working in the small sample setting. When dealing with biological data it is ...
Antti Airola, Tapio Pahikkala, Willem Waegeman, Be...