[ 1 ] [ 2 ] [ 3 ] [ 4 ] [ 5 ] [ 6 ] [ 7 ] [ 8 ] [ 9 ] [ 10 ] [ 11 ] [ 12 ] [ 13 ]

Journal of Information Science and Engineering, Vol. 38 No. 4, pp. 791-803

Univariate and Multivariate Filter Feature Selection for Heart Disease Classification

1Software Project Management Research Team, ENSIAS
Mohammed V University in Rabat
Rabat, 10100 Morocco

Moulay ISMAIL University
Meknes, 50050 Morocco

3MSDA, Mohammed VI Polytechnic University
Benguerir, 43150 Morocco
E-mail: {houda_benhar; ali.idri}@um5.ac.ma; hosni.mohamed1@gmail.com

Feature selection (FS) is a data preprocessing task that can be applied before the clas-sification phase, and aims at improving the performance and interpretability of classifiers by finding only a few highly informative features. The present study aims at evaluating and comparing the performances of six univariate and two multivariate filter FS techniques for heart disease classification. The FS techniques were evaluated with two white-box and two black-box classification techniques using five heart disease datasets. Furthermore, this study deals with the setting of the hyperparameters’ values of the four classifiers. This study evaluates 600 variants of classifiers. Results show that white-box classification tech-niques such as K-Nearest Neighbors and Decision Trees can be very competitive with black-box ones when hyperparameters’ optimization and feature selection were applied.

Keywords: classification, filters, feature selection, data preprocessing, heart disease

  Retrieve PDF document (JISE_202204_08.pdf)