Comparison of Adaptive Boosting and Categorical Boosting in Heart Attack Diagnosis

Ali Amran, Suryani Suryani, Nadiva Azro Fathinah, Anita Desiani, Indri Ramayanti

Sari


Heart disease is one of the leading causes of death worldwide, and therefore, accurate early detection methods are needed to help reduce mortality rates. One approach that can be applied is machine learning using classification techniques based on ensemble boosting algorithms. This study aims to compare the performance of two ensemble algorithms, namely Adaptive Boosting (AdaBoost) and Categorical Boosting (CatBoost), in classifying heart attack disease. The labels used in this study are positive and negative. The evaluation process was conducted using two testing techniques: percentage split with a ratio of 80% training data and 20% testing data, and 10-fold cross-validation. Model performance was evaluated based on accuracy, precision, and recall to comprehensively measure classification capability. The results show that in the percentage split method, CatBoost achieved the highest accuracy of 98.88%, while in k-fold cross-validation it reached 98.43%. Nevertheless, AdaBoost also demonstrated good performance, with all evaluation metrics exceeding 90%. Therefore, the best-performing model in this study is CatBoost with the k-fold cross-validation technique on the heart attack dataset.

Teks Lengkap:

PDF

Referensi


World Health Organization, “Cardiovascular Diseases (CVDs),” World Health Organization. Accessed: May 20, 2025. [Online]. Available: https://www.who.int/news-room/fact-sheets/detail/cardiovascular-diseases-(cvds)

N. A. Baghdadi, S. M. Farghaly Abdelaliem, A. Malki, I. Gad, A. Ewis, and E. Atlam, “Advanced Machine Learning Techniques for Cardiovascular Disease Early Detection and Diagnosis,” J. Big Data, vol. 10, no. 1, pp. 1–29, 2023, doi: 10.1186/s40537-023-00817-1.

S. K. Devi, S. Krishnapriya, and D. Kalita, “Prediction of heart disease using data mining techniques,” Indian J. Sci. Technol., vol. 9, no. 39, pp. 1–5, Oct. 2016, doi: 10.17485/ijst/2016/v9i39/102078.

A. J. A. Al-Khafaji and N. N. A. Sjarif, “A Comprehensive Review of Early Detection of COVID-19 Based on Machine Learning and Deep Learning Models,” International Journal of Electrical and Computer Engineering, vol. 14, no. 4, pp. 4167–4174, Aug. 2024, doi: 10.11591/ijece.v14i4.pp4167-4174.

M. Jordan, J. Kleinberg, and B. Schölkopf, Pattern Recognition and Machine Learning. New York: Springer, 2006.

A. Mishra, B. B. Gupta, D. Peraković, and F. J. G. Peñalvo, “A Survey on Data mining classification approaches,” International Conference on Smart Systems and Advanced Computing, 2021, [Online]. Available: http://ceur-ws.org

A. K. Putri and H. Suparwito, “Uji Algoritma Stacking Ensemble Classifier pada Kemampuan Adaptasi Mahasiswa Baru dalam Pembelajaran Online,” KONSTELASI: Konvergensi Teknologi dan Sistem Informasi, vol. 3, no. 1, pp. 1–12, Jun. 2023.

J. Brownlee, “Boosting and AdaBoost for Machine Learning,” Machine Learning Mystery. Accessed: May 26, 2025. [Online]. Available: https://machinelearningmastery.com/boosting-and-adaboost-for-machine-learning/

D. Singh and S. Agarwal, “XGBoost And AdaBoost,” National Institute of Science Education and Research (NISER). Accessed: May 28, 2025. [Online]. Available: https://www.niser.ac.in/~smishra/teach/cs460/23cs460/lectures/lec21.pdf

C. Tu, H. Liu, and B. Xu, “AdaBoost typical Algorithm and its application research,” in MATEC Web of Conferences, EDP Sciences, Dec. 2017, pp. 1–6. doi: 10.1051/matecconf/201713900222.

H. El Hamdaoui, S. Boujraf, N. E. H. Chaoui, B. Alami, and M. Maaroufi, “Improving Heart Disease Prediction Using Random Forest and AdaBoost Algorithms,” International Journal of Online and Biomedical Engineering, vol. 17, no. 11, pp. 60–75, 2021, doi: 10.3991/ijoe.v17i11.24781.

A. Sanusi, C. A. Putra, and F. A. Akbar, “Implementation of AdaBooost Algorithm on C50 for Improving the Performance of Liver Disease Classification,” JEECS (Journal of Electrical Engineering and Computer Sciences), vol. 8, no. 2, pp. 93–102, Dec. 2023, doi: 10.54732/jeecs.v8i2.1.

S. Dharmawan, V. Fernandes, and H. Halim, “Prediksi Serangan Jantung dengan Menggunakan Metode Logistic Regression Classifier dan Adaboost,” Computatio: Journal of Computer Science and Information Systems, vol. 8, no. 1, pp. 96–103, 2024.

A. William, “A Comprehensive Mathematical Approach to Understand AdaBoost,” Towards Data Science. Accessed: May 24, 2025. [Online]. Available: https://towardsdatascience.com/a-comprehensive-mathematical-approach-to-understand-adaboost-f185104edced/

P. Samosir P. and U. Salamah, “Perbandingan Performa Algoritma XGBoost, CatBoost Dan GBM Dalam Prediksi Penyakit Kardiovaskular,” JSAI (Journal Scientific and Applied Informatics), vol. 8, no. 1, pp. 268–273, Jan. 2025.

L. Prokhorenkova, G. Gusev, A. Vorobev, A. V. Dorogush, and A. Gulin, “CatBoost: Unbiased Boosting with Categorical Features,” Jun. 2017, [Online]. Available: http://arxiv.org/abs/1706.09516

N. Bhaskar, R. R. Borhade, S. Barekar, M. Bachute, and V. Bairagi, “CNN-CatBoost Ensemble Deep Learning Model for Enhanced Disease Detection and Classification of Kidney Disease,” Indonesian Journal of Electrical Engineering and Computer Science, vol. 34, no. 1, pp. 144–151, Apr. 2024, doi: 10.11591/ijeecs.v34.i1.pp144-151.

M. E. Haque, S. M. J. Islam, J. Maliha, Md. S. H. Sumon, R. Sharmin, and S. Rokoni, “Improving Chronic Kidney Disease Detection Efficiency: Fine Tuned CatBoost and Nature-Inspired Algorithms with Explainable AI,” 2025.

A. I. Nurhidayat, Asmunun, and D. Fatrianto, “Prediksi Kinerja Akademik Mahasiswa Menggunakan Machine Learning dengan Sequential Minimal Optimization untuk Pengelola Program Studi,” JIEET: Journal Information Engineering and Educational Technology), vol. 5, no. 2, pp. 84–91, 2021.

A. Widyanto, Kusrini, and Kusnawi, “Pengaruh Keseimbangan Data Terhadap Akurasi Model Support Vector Machine pada Data Set Donor Darah,” Jurnal Teknologi Terpadu, vol. 9, no. 2, pp. 79–88, 2023.

H. Hanum, S. Lamin, S. Yahdin, A. Desiani, D. Geovani, and R. An Fadhila Chaniago, “Percentage Split dan K-Fold Cross Validation pada Algoritma Support Vector Machine (SVM) pada Klasifikasi Penyakit Anemia,” JSI : Jurnal Sistem Informasi (E-Journal, vol. 16, no. 2, pp. 462–470, Oct. 2024.

Wijiyanto, A. I. Pradana, Sopingi, and V. Atina, “Teknik K-Fold Cross Validation untuk Mengevaluasi Kinerja Mahasiswa,” Jurnal Algoritma, vol. 21, no. 1, pp. 239–248, May 2024, doi: 10.33364/algoritma/v.21-1.1618.

T. A. Rashid and B. Hassan, “Heart Attack Dataset.” Accessed: Jun. 08, 2024. [Online]. Available: https://www.kaggle.com/datasets/fatemehmohammadinia/heart-attack-dataset-tarik-a-rashid

M. Buda, A. Maki, and M. A. Mazurowski, “A Systematic Study of the Class Imbalance Problem in Convolutional Neural Networks,” Neural Networks, vol. 106, pp. 249–259, 2018, doi: 10.1016/j.neunet.2018.07.011.




DOI: http://dx.doi.org/10.30811/jaise.v6i1.9051

Refbacks

  • Saat ini tidak ada refbacks.


Indexing :

Creative Commons License
Journal of Artificial Intelligence and Software Engineering (JAISE) licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.