Sciweavers

SGAI
2004
Springer

Overfitting in Wrapper-Based Feature Subset Selection: The Harder You Try the Worse it Gets

13 years 9 months ago
Overfitting in Wrapper-Based Feature Subset Selection: The Harder You Try the Worse it Gets
In Wrapper based feature selection, the more states that are visited during the search phase of the algorithm the greater the likelihood of finding a feature subset that has a high internal accuracy while generalizing poorly. When this occurs, we say that the algorithm has overfitted to the training data. We outline a set of experiments to show this and we introduce a modified genetic algorithm to address this overfitting problem by stopping the search before overfitting occurs. This new algorithm called GAWES (Genetic Algorithm With Early Stopping) reduces the level of overfitting and yields feature subsets that have a better generalization accuracy.
John Loughrey, Padraig Cunningham
Added 02 Jul 2010
Updated 02 Jul 2010
Type Conference
Year 2004
Where SGAI
Authors John Loughrey, Padraig Cunningham
Comments (0)