A Machine Learning Approach for Diabetes Prediction Using Ensemble Feature Selection and Hyperparameter Tuning.

Al-Mozahid, Md.

DSpace Home
→
Faculty of Science and Information Technology
→
Department of Computer Science and Engineering
→
Project Report
→
View Item

dc.contributor.author	Al-Mozahid, Md.
dc.date.accessioned	2026-06-21T09:30:10Z
dc.date.available	2026-06-21T09:30:10Z
dc.date.issued	2025-01-13
dc.identifier.uri	http://dspace.daffodilvarsity.edu.bd:8080/handle/123456789/17323
dc.description	Project Report	en_US
dc.description.abstract	Early identification of diabetes is important for controlling the disease and avoiding problems. To improve the Predictive data mining of Diabetes prediction based on a dataset from Kaggle that focuses on diabetes, In this study we propose an ensemble feature selection method (EFSM) which is then used to enhance accuracy diabetes prediction. We have applied seven models to solve this problem, including Random Forest, Decision Tree, Logistic Regression, XGBoost, AdaBoost (DT weak learner),K-Nearest Neighbors (KNN) and Support Vector Machines (SVM). Performing 5-fold cross-validation, XGBoost provided the best model with an accuracy of 98% which further showcases its superior pattern recognition abilities in our medical data. The novel EFSM is a technique that efficiently combines and scores features according to how often they are selected by multiple selection techniques, and thus will improve the performance of our models. These findings emphasize the potential of our method in diabetes prediction, yielding a reliable model that has the potential to assist with early diagnostics and patient management.	en_US
dc.description.sponsorship	Daffodil International University	en_US
dc.language.iso	en_US	en_US
dc.publisher	Daffodil International University	en_US
dc.subject	Machine Learning	en_US
dc.subject	Logistic Regression	en_US
dc.subject	Xgboost	en_US
dc.subject	Data Mining	en_US
dc.subject	Early Identification	en_US
dc.title	A Machine Learning Approach for Diabetes Prediction Using Ensemble Feature Selection and Hyperparameter Tuning.	en_US
dc.type	Other	en_US