Sciweavers

INTERSPEECH
2010
12 years 11 months ago
Improved language recognition using mixture components statistics
One successful approach to language recognition is to focus on the most discriminative high level features of languages, such as phones and words. In this paper, we applied a simi...
Abualsoud Hanani, Michael J. Carey 0002, Martin J....
TAL
2010
Springer
13 years 3 months ago
Summarization as Feature Selection for Document Categorization on Small Datasets
Abstract. Most common feature selection techniques for document categorization are supervised and require lots of training data in order to accurately capture the descriptive and d...
Emmanuel Anguiano-Hernández, Luis Villase&n...
CIARP
2010
Springer
13 years 3 months ago
Improving the Dynamic Hierarchical Compact Clustering Algorithm by Using Feature Selection
Abstract. Feature selection has improved the performance of text clustering. In this paper, a local feature selection technique is incorporated in the dynamic hierarchical compact ...
Reynaldo Gil-García, Aurora Pons-Porrata
EVOW
2006
Springer
13 years 8 months ago
Robust SVM-Based Biomarker Selection with Noisy Mass Spectrometric Proteomic Data
Abstract. Computational analysis of mass spectrometric (MS) proteomic data from sera is of potential relevance for diagnosis, prognosis, choice of therapy, and study of disease act...
Elena Marchiori, Connie R. Jimenez, Mikkel West-Ni...
DEXAW
2007
IEEE
157views Database» more  DEXAW 2007»
13 years 8 months ago
Dimensionality Reduction in a P2P System
Peers and data objects in the Hybrid Overlay Network (HON) are organized in a ndimensional feature space. As the dimensionality increases, peers and data objects become sparse and ...
Mouna Kacimi, Kokou Yétongnon
WEBI
2005
Springer
13 years 10 months ago
A Semi-Supervised Document Clustering Algorithm Based on EM
Document clustering is a very hard task in Automatic Text Processing since it requires to extract regular patterns from a document collection without a priori knowledge on the cat...
Leonardo Rigutini, Marco Maggini
PAKDD
2005
ACM
114views Data Mining» more  PAKDD 2005»
13 years 10 months ago
Increasing Classification Accuracy by Combining Adaptive Sampling and Convex Pseudo-Data
The availability of microarray data has enabled several studies on the application of aggregated classifiers for molecular classification. We present a combination of classifier ag...
Chia Huey Ooi, Madhu Chetty
IWANN
2005
Springer
13 years 10 months ago
Heuristic Search over a Ranking for Feature Selection
In this work, we suggest a new feature selection technique that lets us use the wrapper approach for finding a well suited feature set for distinguishing experiment classes in hig...
Roberto Ruiz, José Cristóbal Riquelm...
ISMDA
2005
Springer
13 years 10 months ago
Relevance, Redundancy and Differential Prioritization in Feature Selection for Multiclass Gene Expression Data
The large number of genes in microarray data makes feature selection techniques more crucial than ever. From various ranking-based filter procedures to classifier-based wrapper tec...
Chia Huey Ooi, Madhu Chetty, Shyh Wei Teng
ICDM
2006
IEEE
193views Data Mining» more  ICDM 2006»
13 years 11 months ago
Feature Subset Selection on Multivariate Time Series with Extremely Large Spatial Features
Several spatio-temporal data collected in many applications, such as fMRI data in medical applications, can be represented as a Multivariate Time Series (MTS) matrix with m rows (...
Hyunjin Yoon, Cyrus Shahabi