| dc.description.abstract |
Diabetes, a potentially life-threatening condition if undetected, necessitates an early and accurate diagnosis. This study evaluates traditional machine learning (ML) models and deep neural networks (DNNs) for diabetes prediction, utilizing the Pima Indians Diabetes Dataset (PIDD) and the Sylhet Diabetes Hospital dataset (SDHD). We optimized six ML models—Logistic Regression, Decision Trees, Random Forests, Gradient Boosting, Support Vector Machines, and K-Nearest Neighbors—using GridSearchCV and ensemble learning, and deployed DNNs with varied train-test splits (90-10%, 80-20%, and 70-30%). Evaluations on accuracy, precision, recall, F1-score, and AUC-ROC highlighted RF’s outstanding performance after GridSearchCV on the SDHD with Accuracy: 98%, Precision: 100%, Recall: 96%, F1 Score: 98%, AUC: 100% and on PIDD with Accuracy: 76%, Precision: 63%, Recall: 80%, F1 Score:70%, AUC:81%. This underscores advanced ML’s effectiveness in handling complex datasets and enhancing early diabetes detection strategies. |
en_US |