DSpace Repository

Advanced Feature Engineering andOptimizedRegression Models for predicting Heart DiseaseMortality Risk and Severity: AsupervisedMachine Learning Approach

Show simple item record

dc.contributor.author Akter, Maria
dc.date.accessioned 2026-05-10T07:28:57Z
dc.date.available 2026-05-10T07:28:57Z
dc.date.issued 2025-09-19
dc.identifier.citation SWT en_US
dc.identifier.uri http://dspace.daffodilvarsity.edu.bd:8080/handle/123456789/17172
dc.description Thesis Report en_US
dc.description.abstract Heart disease is a group of diseases that affect the heart and blood vessels. Someexamples are heart failure, arrhythmia's, valve disorders, and coronary arterydisease. It is a global health problem that needs to be found early and worked on together tobecontrolled and stopped. Heart disease is one of the major causes of mortality aroundtheworld, and hence early detection and severity determination are essential for propertreatment. The study aims at the application of supervised machine learning algorithmsfor the prediction of mortality risk and disease progression in patients with heart failureusing the Heart Failure Clinical Records dataset. Three machine learning algorithms—logistic regression, random forest, and XGBoost—were trained for classification(deathprediction) and regression (severity prediction). Cutting-edge feature engineeringtechniques, such as Principal Component Analysis (PCA), Shapely AdditiveExplanations (SHAP), and evolutionary algorithms, were employed in the selectionofsignificant predictors: age, serum creatinine, and ejection fraction. SHAPandLocal Interpretable Model-agnostic Explanations (LIME) were included to ensure model interpretability and clinical utility of the results. For classification tasks, theperformance was examined using precision, recall, accuracy, and ROC-AUC; forregression tasks, it was evaluated using mean squared error (MSE), root meansquarederror (RMSE), and R². With 85% accuracy, an ROC-AUC of 0.91 for mortalityprediction, and an R² of 0.75 for severity progression, XGBoost performed better thanthe other models. Logistic regression performed slightly worse compared torandomforest, which showed competitive performance. These results prove that XGBoost isauseful instrument for predicting the mortality and severity of heart disease whenpairedwith powerful feature engineering and interpretability techniques. Validatingthesemodels in larger, diverse cohorts and implementing them in medical settings shouldbethe main goals of future research. en_US
dc.description.sponsorship DIU en_US
dc.language.iso en_US en_US
dc.publisher Daffodil International University en_US
dc.subject Supervised Machine Learning en_US
dc.subject Feature Engineering en_US
dc.subject Regression Models en_US
dc.subject Heart Disease Prediction en_US
dc.subject Feature Engineeringin Clinical Data en_US
dc.subject Optimized Regression Models en_US
dc.title Advanced Feature Engineering andOptimizedRegression Models for predicting Heart DiseaseMortality Risk and Severity: AsupervisedMachine Learning Approach en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Browse

My Account