IMPLEMENTATION OF C5.0 ALGORITHM FOR SHAPING PATTERNS OF DIAGNOSIS OF DIABETES MELLITUS DISEASE

  • Muhammad Latief Saputra mahasiswa
  • Irwan Budiman
  • Radityo Adi Nugroho
  • Dwi Kartini
  • Muliadi
Keywords: C5.0, Diabetes Mellitus, Confusion Matrix, Classification

Abstract

This study applies the C5.0 algorithm to form a decision tree pattern for diagnosing diabetes mellitus. C5.0 algorithm is a decision tree based classification algorithm. This algorithm focuses on the acquisition of information gain on all attributes. The data used is a diabetes mellitus dataset obtained from the Kaggle database website. Data preprocessing is done and data sharing is done 4 times with the distribution of training data 60% 70% 80% and 90%. Data sharing uses stratafied random sampling methods so that the distribution of training and testing data is in accordance with its portion. Calculation of accuracy performance using confusion matrix. Classification performance using C5.0 algorithm. With 90% training data get 72.73% accuracy of rules generated as many as 70 rules. With 80% training data the accuracy value is 74.03%. The rule is 64 rules. With 70% training data get an accuracy value of 76.52% of the rules generated 59 rules. With 60% training data get an accuracy value of 74.59% of the rules generated as many as 53 rules. From all the experiments that have been done, the best accuracy is found in experiments with 70% training data.

Published
2020-11-17
How to Cite
Saputra, M. L., Irwan Budiman, Radityo Adi Nugroho, Dwi Kartini, & Muliadi. (2020). IMPLEMENTATION OF C5.0 ALGORITHM FOR SHAPING PATTERNS OF DIAGNOSIS OF DIABETES MELLITUS DISEASE. Journal of Data Science and Software Engineering, 1(02), 85-97. Retrieved from https://jurnalmahasiswamipa.ulm.ac.id/index.php/integer/article/view/22
Section
Articles