Perbandingan Algoritma Klasifikasi Data Mining Untuk Prediksi Penyakit Stroke

Authors

  • Yufis Azhar Universitas Muhammadiyah Malang
  • Aidia Khoiriyah Firdausy Universitas Muhammadiyah Malang
  • Putri Juli Amelia Universitas Muhammadiyah Malang

DOI:

https://doi.org/10.31598/sintechjournal.v5i2.1222

Keywords:

data mining, stroke, prediction

Abstract

Data mining is often called knowledge Discovery in Database (KDD). Data mining is usually used to improve future decision making based on information obtained from the past. For example for prediction, estimation, association, clustering, and description. Stroke is the second most deadly disease in the world according to WHO. The sufferer has an injury to the nervous system. Because of this, health experts, especially in the field of nursing, need special attention. Currently, the development of the Industrial Revolution Era 4.0 is collaborating in the fieldsof technology and health science so that it becomes something useful by using Machine Learning. There are so many benefits that are used in predicting several diseases that can be anticipated. In this study the dataset is dividedinto 2 parts, namely training data and testing data using split validation. Based on the results of the test that have been carried out in this study, the algorithm that has the highest accuracyvalue on balanced data is Logistic Regression with an accuracy rate of 75.65%, while for unbalanced data, the algorithm that has the highest accuracy results is Logistic Regression, Random Forest, SVM, and KNN with an accuracy rate of 98.63%. This testing process is carried out to identify stroke with data mining algorithms

Downloads

Download data is not yet available.

References

A. Byna and M. Basit, “Penerapan Metode Adaboost Untuk Mengoptimasi Prediksi Penyakit Stroke Dengan Algoritma Naïve Bayes,” J. Sisfokom (Sistem Inf. dan Komputer), vol. 9, no. 3, pp. 407–411, 2020, doi: 10.32736/sisfokom.v9i3.1023.

U. Amelia et al., “IMPLEMENTASI ALGORITMA SUPPORT VECTOR MACHINE ( SVM ) UNTUK PREDIKSI PENYAKIT STROKE DENGAN ATRIBUT BERPENGARUH,” vol. III, pp. 254–259, 2022.

R. S. Rohman, R. A. Saputra, and D. A. Firmansaha, “Komparasi Algoritma C4.5 Berbasis PSO Dan GA Untuk Diagnosa Penyakit Stroke,” CESS (Journal Comput. Eng. Syst. Sci., vol. 5, no. 1, p. 155, 2020, doi: 10.24114/cess.v5i1.15225.

R. E. Pambudi and F. I. Komputer, “Klasifikasi Penyakit Stroke Menggunakan Algoritma Decision Tree C.45 1,2,3,” vol. 16, no. x, pp. 221–226, 1978.

D. Andri and M. Reza, “Penerapan Algoritma K-Nearest Neighbord Untuk Prediksi Kematian Akibat Penyakit Gagal Jantung,” vol. III, no. 2020, pp. 105–112, 2022.

F. Karim, G. W. Nurcahyo, and S. Sumijan, “Sistem Pakar dalam Mengidentifikasi Gejala Stroke Menggunakan Metode Naive Bayes,” J. Sistim Inf. dan Teknol., vol. 3, pp. 221–226, 2021, doi: 10.37034/jsisfotek.v3i4.69.

F. Akbar, H. W. Saputra, and A. K. Maulaya, “Implementation of Decision Tree Algorithm C4 . 5 and Support Vector Regression for Stroke Disease Prediction Implementasi Algoritma Decision Tree C4 . 5 dan Support Vector Regression untuk Prediksi Penyakit Stroke,” vol. 2, no. October, pp. 61–67, 2022.

A. Puspitawuri, E. Santoso, and C. Dewi, “Diagnosis Tingkat Risiko Penyakit Stroke Menggunakan Metode K-Nearest Neighbor dan Naïve Bayes,” J. Pengemb. Teknol. Inf. dan Ilmu Komput., vol. 3, no. 4, pp. 3319–3324, 2019, [Online]. Available: e-issn: 2548-964X http://j-ptiik.ub.ac.id.

S. L. Pratiwi and H. Cahyono, “View metadata, citation and similar papers at core.ac.uk,” PENGARUH Pengguna. PASTA LABU KUNING (Cucurbita Moschata) UNTUK SUBSTITUSI TEPUNG TERIGU DENGAN PENAMBAHAN TEPUNG ANGKAK DALAM PEMBUATAN MIE KERING, vol. 1, no. 2, pp. 274–282, 2020.

N. Azwanti and E. Elisa, “Analisis Pola Penyakit Hipertensi Menggunakan Algoritma C4.5,” InfoTekJar (Jurnal Nas. Inform. dan Teknol. Jaringan), vol. 3, no. 2, pp. 116–123, 2019, doi: 10.30743/infotekjar.v3i2.944.

E. I. Utami Indarto; Raharjo, Suwanto, “Prediksi Risiko Kematian Pasien Stroke Perdarahan Dengan Menggunakan Teknik Klasifikasi Data Mining,” Inf. Interaktif, vol. 5, no. Vol 5, No 2 (2020): Jurnal Informasi Interaktif, pp. 86–91, 2020, [Online]. Available: http://e-journal.janabadra.ac.id/index.php/informasiinteraktif/article/view/1172/790.

Y. Tampil, H. Komaliq, and Y. Langi, “Analisis Regresi Logistik Untuk Menentukan Faktor-Faktor Yang Mempengaruhi Indeks Prestasi Kumulatif (IPK) Mahasiswa FMIPA Universitas Sam Ratulangi Manado,” d’CARTESIAN, vol. 6, no. 2, p. 56, 2017, doi: 10.35799/dc.6.2.2017.17023.

Tampil, Yumira, Hanny Komaliq, and Yohanes Langi. 2017. “Analisis Regresi Logistik Untuk Menentukan Faktor-Faktor Yang Mempengaruhi Indeks Prestasi Kumulatif (IPK) Mahasiswa FMIPA Universitas Sam Ratulangi Manado.” D’CARTESIAN 6 (2): 56. https://doi.org/10.35799/dc.6.2.2017.17023.

Downloads

Published

2022-10-28

How to Cite

[1]
Y. . Azhar, A. K. . Firdausy, and P. J. . Amelia, “Perbandingan Algoritma Klasifikasi Data Mining Untuk Prediksi Penyakit Stroke”, SINTECH Journal, vol. 5, no. 2, pp. 191-197, Oct. 2022.