Cardiovascular Disease Prediction via Hybrid SVM–SMOTE and Sparse Autoencoder Feature Reduction with Deep MLP Classification

Abstract

Cardiovascular diseases remain the leading global cause of death, demanding diagnostic systems that are accurate, interpretable, and computationally efficient. Traditional machine learning approaches frequently struggle with class imbalance, high-dimensional noise, and restricted generalization in clinical datasets. To tackle such issues, we propose a hybrid framework that combines SVM–SMOTE and neighborhood cleaning rule (NCL) for class rebalancing, a sparse autoencoder (SAE) with random forest (RF) selection for non-linear feature optimization, and a class-weighted multilayer perceptron (MLP) for final classification. We validate our framework on the Z-Alizadeh Sani (54 features) and Cleveland (13 features) datasets under stratified fivefold cross-validation, the model attains mean accuracies of 94.02 ± 2.77 % and 94.36 ± 1.47 %, with AUC–ROC = 0.988 and 0.982, outperforming prior baselines [4, 10, 14] by 7.6%–20.8%, and Bootstrap 95% confidence intervals and McNemar/DeLong tests (p < 0.001) confirms significance. Noteably, the ablation study demonstrates the contribution of each module (e.g., a 12% accuracy improvement without sampling). The optimized MLP reduced false negatives to ~5%, while training 40% faster than CNN–LSTM alternatives. The proposed framework provides a statistically robust and interpretable solution for predicting cardiovascular disease.

Author Biographies

Zaid Alaa, University of Kufa

Master's in Computer Science and Artificial Intelligence, and I'm currently working as a senior lecturer in the Department of Computer Science, Faculty of Education for Women, University of Kufa.

Ali Sabah, University of Kufa

I received a Ph.D. in Computer Science and Artificial Intelligence from the National University of Malaysia (UKM) in 2021, and I'm currently working in the Department of Computer Science, Faculty of Education for Women, University of Kufa.ORCID Link:https://orcid.org/0000-0002-7148-4976

Authors

  • Zaid Alaa University of Kufa
  • Ali Sabah University of Kufa

DOI:

https://doi.org/10.31449/inf.v50i5.10455

Downloads

Published

02/02/2026

How to Cite

Alaa, Z., & Sabah, A. (2026). Cardiovascular Disease Prediction via Hybrid SVM–SMOTE and Sparse Autoencoder Feature Reduction with Deep MLP Classification. Informatica, 50(5). https://doi.org/10.31449/inf.v50i5.10455