Evaluating the Accuracy of Obesity Classification with Tree-Based Models: Decision Trees, Random Forests, and Gradient Boosting

Main Article Content

Viviana Herlita Vidiasari
B. Nadila Nuzululnisa
Neny Sulistianingsih

Abstract

Obesity is a condition of excess weight that can have a negative impact on health. This condition increases the risk of various diseases, such as heart disease, type 2 diabetes, hypertension, and other metabolic disorders. According to the World Health Organization (WHO) report, in 2022 there will be 2.5 billion adults aged 18 years and over who are overweight, including more than 890 million people who are obese. Seeing this problem, this study aims to develop an obesity classification model based on age, gender, body mass index (BMI), physical activity, and obesity category attributes. The models used in this study are decision tree, random forest, and gradient boosting, which are included in tree-based methods. The research stages include data collection, dataset processing, model building, and performance evaluation. In the final stage, the best method was selected from the three models used. The results showed that decision tree has an accuracy of 99.5%, random forest 99.7%, and gradient boosting 99.8%, making it the method with the best accuracy in obesity classification. With these results, gradient boosting can be used as a tool in health decision making, especially in detecting and categorizing individuals at risk of obesity more accurately. In addition, this model can help in developing more effective prevention and intervention strategies in dealing with obesity.

Article Details

Section

Articles

References

Al-afghoni, J. M. H. Y., Setiawan, W., Dwi, Y., Negara, P., Informasi, S., Madura, U. T., Matrix, C., Tree, D., & Rusdiana, L. (2025). KLASIFIKASI JENIS BENIH KACANG MENGGUNAKAN SMOTE DAN DECISION TREE C4. JATI (Jurnal Mahasiswa Teknik Informatika), 9(1), 462–469. https://doi.org/https://doi.org/10.36040/jati.v9i1.12366

Alifah, R. N., Najib, M. K., Nurdiati, S., Sari, A. P., Herlambang, K., Putri, T., Ginting, B., & Sya’adah, S. N. (2024). Perbandingan Metode Tree Based Classification untuk Masalah Klasifikasi Data Body Mass Index. Indones. J. Math. Nat. Sci, 47(1), 2024. https://doi.org/https://doi.org/10.15294/m2k97436

Arta, M. C., Anwar, N., Putri, Y. A., Suharjito, S., & Asroll, M. (2024). Implementasi Prediksi Penyakit Jantung Menggunakan Data Mining Untuk Dunia Kesehatan. Jurnal Optimalisasi, 10(1), 42. https://doi.org/10.35308/jopt.v10i1.9075

Azahra, R. A. (2024). Komparasi Metode Naïve Bayes dan Kolmogorov Arnold Networks dalam Klasifikasi Tingkat Kemungkinan Obesitas. TEKNOLOGI NUSANTARA, 6(2), 77–84. http://ojs.uninus.ac.id/index.php/TEKNOLOGINUSANTARA/article/view/3334

Dayera, Musa Bundaris Palungan, F. O. (2024). Optimalisasi Akurasi Algoritma C4.5 dengan Metode Adaptive Boosting Memprediksi Siswa dalam Menerima Dana Pendidikan. G-Tech : Jurnal Teknologi Terapan, 8(1), 186–195. https://doi.org/https://doi.org/10.70609/gtech.v8i4.5612

Hardwis, S., Studi, P., Keolahragaan, I., & Indonesia, U. P. (2024). Analisis Resiko Obesitas Berdasarkan Aktivitas Fisik : Implementasi Metode Artificial Intelligence Machine Learning. Jurnal Keolahragaan, 10(November), 29–36. https://doi.org/10.25157/jkor.v10i2.16884

Herdian, C., Kamila, A., & Agung Musa Budidarma, I. G. (2024). Studi Kasus Feature Engineering Untuk Data Teks: Perbandingan Label Encoding dan One-Hot Encoding Pada Metode Linear Regresi. Technologia : Jurnal Ilmiah, 15(1), 93. https://doi.org/10.31602/tji.v15i1.13457

Kartika Sari, A., Akhmad Irsyad, Dinda Nur Aini, Islamiyah, & Stephanie Elfriede Ginting. (2024). Analisis Sentimen Twitter Menggunakan Machine Learning untuk Identifikasi Konten Negatif. Adopsi Teknologi Dan Sistem Informasi (ATASI), 3(1), 64–73. https://doi.org/10.30872/atasi.v3i1.1373

Nadira, A., & Farida Utami, S. (2024). Implementasi Data Mining Dalam Mengidentifikasi Faktor Pasien Yang Berpotensi Mengalami Obesitas Menggunakan Algoritma C4.5. JATI (Jurnal Mahasiswa Teknik Informatika), 8(4), 7872–7876. https://doi.org/10.36040/jati.v8i4.10498

Nasrullah, A. H. (2021). Implementasi Algoritma Decision Tree Untuk Klasifikasi Data Peserta Didik. Jurnal Pilar Nusa Mandiri, 7(2), 217.

https://ejournal.nusamandiri.ac.id/index.php/pilar/article/view/70

Nida Winarti, Maula, L. H., Amalia, A. R., Pratiwi, N. L. A., & Nandang. (2022). Penerapan Model Pembelajaran Project Based Learning Untuk Meningkatkan Kemampuan Berpikir Kritis Siswa Kelas Iii Sekolah Dasar. Jurnal Cakrawala Pendas, 8(3), 552–563. https://doi.org/10.31949/jcp.v8i3.2419

Putri, A., Hardiana, C. S., Novfuja, E., Siregar, F. T. P., Rahmaddeni, R., Fatma, Y., & Wahyuni, R. (2023). Komparasi Algoritma K-NN, Naive Bayes dan SVM untuk Prediksi Kelulusan Mahasiswa Tingkat Akhir. MALCOM: Indonesian Journal of Machine Learning and Computer Science, 3(1), 20–26. https://doi.org/10.57152/malcom.v3i1.610

Rahardika, F., Putra, B., & Surahmanto, M. (2024). Implementation of Machine Learning Classification of Obesity Weight using Dicision Tree. IJISTECH (International Journal of Information System and Technology), 8 (158), 110–116.

https://ijistech.org/ijistech/index.php/ijistech/article/view/354/352

Rifai, A., Permana, S. E., & Hamonangan, R. (2024). Optimalisasi Klasifikasi Indeks Desa Membangun Menggunakan Metode Ensemble Dan Algoritma Random Forest. JATI (Jurnal Mahasiswa Teknik Informatika), 8(4), 8226–8234. https://doi.org/10.36040/jati.v8i4.10971

Sabili, N. L., & Umbara, F. R. (2024). Klasifikasi Penyakit Diabetes Menggunakan Algoritma Categorical Boosting Dengan Faktor Risiko Diabetes. JATI(Jurnal Mahasiswa Teknik Informatika), 8(6), 11391–11398. https://doi.org/https://doi.org/10.36040/jati.v8i6.11447

Santoso, H. T., Felmidi, F. A., Nur, A., Ristyawan, A., & Daniati, E. (2024). Analisis Kinerja Algoritma Data Mining pada Klasifikasi Tingkat Obesitas dengan K-Fold Cross Validation dan AUC. Prosiding SEMNAS INOTEK (Seminar Nasional Inovasi Teknologi), 8, 113–122. https://doi.org/https://doi.org/10.29407/inotek.v8i1.4917

Setiyani, L., Indahsari, A. N., & Roestam, R. (2023). Analisis Prediksi Level Obesitas Menggunakan Perbandingan Algoritma Machine Learning dan Deep Learning. JTERA (Jurnal Teknologi Rekayasa), 8(1), 139. https://doi.org/10.31544/jtera.v8.i1.2022.139-146

Sukmawati, C. E., Fitri, A., Masruriyah, N., & Juwita, A. R. (2024). Efektivitas algoritma AdaBoost dan XGBoost pada dataset obesitas populasi dewasa. 6(2), 101–111. https://doi.org/10.37905/jji.

Yamantri, A. B., & Ahmad, A. (2024). Penerapan Algoritma C4.5 Untuk Prediksi Faktor Risiko Obesitas Pada Penduduk Dewasa. Jurnal Komputer Antartika, 2(c), 118–125. https://doi.org/https://doi.org/10.70052/jka.v2i3.341