Evaluating the Accuracy of Obesity Classification with Tree-Based Models: Decision Trees, Random Forests, and Gradient Boosting
Main Article Content
Abstract
Obesity is a condition of excess weight that can have a negative impact on health. This condition increases the risk of various diseases, such as heart disease, type 2 diabetes, hypertension, and other metabolic disorders. According to the World Health Organization (WHO) report, in 2022 there will be 2.5 billion adults aged 18 years and over who are overweight, including more than 890 million people who are obese. Seeing this problem, this study aims to develop an obesity classification model based on age, gender, body mass index (BMI), physical activity, and obesity category attributes. The models used in this study are decision tree, random forest, and gradient boosting, which are included in tree-based methods. The research stages include data collection, dataset processing, model building, and performance evaluation. In the final stage, the best method was selected from the three models used. The results showed that decision tree has an accuracy of 99.5%, random forest 99.7%, and gradient boosting 99.8%, making it the method with the best accuracy in obesity classification. With these results, gradient boosting can be used as a tool in health decision making, especially in detecting and categorizing individuals at risk of obesity more accurately. In addition, this model can help in developing more effective prevention and intervention strategies in dealing with obesity.
Article Details
Section

This work is licensed under a Creative Commons Attribution 4.0 International License.
References
Al-afghoni, J. M. H. Y., Setiawan, W., Dwi, Y., Negara, P., Informasi, S., Madura, U. T., Matrix, C., Tree, D., & Rusdiana, L. (2025). KLASIFIKASI JENIS BENIH KACANG MENGGUNAKAN SMOTE DAN DECISION TREE C4. JATI (Jurnal Mahasiswa Teknik Informatika), 9(1), 462–469. https://doi.org/https://doi.org/10.36040/jati.v9i1.12366
Alifah, R. N., Najib, M. K., Nurdiati, S., Sari, A. P., Herlambang, K., Putri, T., Ginting, B., & Sya’adah, S. N. (2024). Perbandingan Metode Tree Based Classification untuk Masalah Klasifikasi Data Body Mass Index. Indones. J. Math. Nat. Sci, 47(1), 2024. https://doi.org/https://doi.org/10.15294/m2k97436
Arta, M. C., Anwar, N., Putri, Y. A., Suharjito, S., & Asroll, M. (2024). Implementasi Prediksi Penyakit Jantung Menggunakan Data Mining Untuk Dunia Kesehatan. Jurnal Optimalisasi, 10(1), 42. https://doi.org/10.35308/jopt.v10i1.9075
Azahra, R. A. (2024). Komparasi Metode Naïve Bayes dan Kolmogorov Arnold Networks dalam Klasifikasi Tingkat Kemungkinan Obesitas. TEKNOLOGI NUSANTARA, 6(2), 77–84. http://ojs.uninus.ac.id/index.php/TEKNOLOGINUSANTARA/article/view/3334
Dayera, Musa Bundaris Palungan, F. O. (2024). Optimalisasi Akurasi Algoritma C4.5 dengan Metode Adaptive Boosting Memprediksi Siswa dalam Menerima Dana Pendidikan. G-Tech : Jurnal Teknologi Terapan, 8(1), 186–195. https://doi.org/https://doi.org/10.70609/gtech.v8i4.5612
Hardwis, S., Studi, P., Keolahragaan, I., & Indonesia, U. P. (2024). Analisis Resiko Obesitas Berdasarkan Aktivitas Fisik : Implementasi Metode Artificial Intelligence Machine Learning. Jurnal Keolahragaan, 10(November), 29–36. https://doi.org/10.25157/jkor.v10i2.16884
Herdian, C., Kamila, A., & Agung Musa Budidarma, I. G. (2024). Studi Kasus Feature Engineering Untuk Data Teks: Perbandingan Label Encoding dan One-Hot Encoding Pada Metode Linear Regresi. Technologia : Jurnal Ilmiah, 15(1), 93. https://doi.org/10.31602/tji.v15i1.13457
Kartika Sari, A., Akhmad Irsyad, Dinda Nur Aini, Islamiyah, & Stephanie Elfriede Ginting. (2024). Analisis Sentimen Twitter Menggunakan Machine Learning untuk Identifikasi Konten Negatif. Adopsi Teknologi Dan Sistem Informasi (ATASI), 3(1), 64–73. https://doi.org/10.30872/atasi.v3i1.1373
Nadira, A., & Farida Utami, S. (2024). Implementasi Data Mining Dalam Mengidentifikasi Faktor Pasien Yang Berpotensi Mengalami Obesitas Menggunakan Algoritma C4.5. JATI (Jurnal Mahasiswa Teknik Informatika), 8(4), 7872–7876. https://doi.org/10.36040/jati.v8i4.10498
Nasrullah, A. H. (2021). Implementasi Algoritma Decision Tree Untuk Klasifikasi Data Peserta Didik. Jurnal Pilar Nusa Mandiri, 7(2), 217.
https://ejournal.nusamandiri.ac.id/index.php/pilar/article/view/70
Nida Winarti, Maula, L. H., Amalia, A. R., Pratiwi, N. L. A., & Nandang. (2022). Penerapan Model Pembelajaran Project Based Learning Untuk Meningkatkan Kemampuan Berpikir Kritis Siswa Kelas Iii Sekolah Dasar. Jurnal Cakrawala Pendas, 8(3), 552–563. https://doi.org/10.31949/jcp.v8i3.2419
Putri, A., Hardiana, C. S., Novfuja, E., Siregar, F. T. P., Rahmaddeni, R., Fatma, Y., & Wahyuni, R. (2023). Komparasi Algoritma K-NN, Naive Bayes dan SVM untuk Prediksi Kelulusan Mahasiswa Tingkat Akhir. MALCOM: Indonesian Journal of Machine Learning and Computer Science, 3(1), 20–26. https://doi.org/10.57152/malcom.v3i1.610
Rahardika, F., Putra, B., & Surahmanto, M. (2024). Implementation of Machine Learning Classification of Obesity Weight using Dicision Tree. IJISTECH (International Journal of Information System and Technology), 8 (158), 110–116.
https://ijistech.org/ijistech/index.php/ijistech/article/view/354/352
Rifai, A., Permana, S. E., & Hamonangan, R. (2024). Optimalisasi Klasifikasi Indeks Desa Membangun Menggunakan Metode Ensemble Dan Algoritma Random Forest. JATI (Jurnal Mahasiswa Teknik Informatika), 8(4), 8226–8234. https://doi.org/10.36040/jati.v8i4.10971
Sabili, N. L., & Umbara, F. R. (2024). Klasifikasi Penyakit Diabetes Menggunakan Algoritma Categorical Boosting Dengan Faktor Risiko Diabetes. JATI(Jurnal Mahasiswa Teknik Informatika), 8(6), 11391–11398. https://doi.org/https://doi.org/10.36040/jati.v8i6.11447
Santoso, H. T., Felmidi, F. A., Nur, A., Ristyawan, A., & Daniati, E. (2024). Analisis Kinerja Algoritma Data Mining pada Klasifikasi Tingkat Obesitas dengan K-Fold Cross Validation dan AUC. Prosiding SEMNAS INOTEK (Seminar Nasional Inovasi Teknologi), 8, 113–122. https://doi.org/https://doi.org/10.29407/inotek.v8i1.4917
Setiyani, L., Indahsari, A. N., & Roestam, R. (2023). Analisis Prediksi Level Obesitas Menggunakan Perbandingan Algoritma Machine Learning dan Deep Learning. JTERA (Jurnal Teknologi Rekayasa), 8(1), 139. https://doi.org/10.31544/jtera.v8.i1.2022.139-146
Sukmawati, C. E., Fitri, A., Masruriyah, N., & Juwita, A. R. (2024). Efektivitas algoritma AdaBoost dan XGBoost pada dataset obesitas populasi dewasa. 6(2), 101–111. https://doi.org/10.37905/jji.
Yamantri, A. B., & Ahmad, A. (2024). Penerapan Algoritma C4.5 Untuk Prediksi Faktor Risiko Obesitas Pada Penduduk Dewasa. Jurnal Komputer Antartika, 2(c), 118–125. https://doi.org/https://doi.org/10.70052/jka.v2i3.341