DSpace Repository

Multi-Head Self-Attention Mechanisms in Vision Transformers for Retinal Image Classification

Show simple item record

dc.contributor.author Oahidul Islam
dc.contributor.author Kumer, Kowshik
dc.contributor.author Akter, Sumaia
dc.contributor.author Uddin, Md Mohi
dc.date.accessioned 2025-12-01T09:56:48Z
dc.date.available 2025-12-01T09:56:48Z
dc.date.issued 2024-12-19
dc.identifier.uri http://dspace.daffodilvarsity.edu.bd:8080/handle/123456789/15937
dc.description Conference paper en_US
dc.description.abstract Diabetic Retinopathy (DR) is a serious condition that can lead to eyesight loss in those who have diabetes if not identified early. The traditional diagnosis approach relies on the manual scrutiny of retinal images by specialists, a process that can be both time-consuming and susceptible to errors. Artificial intelligence (AI) has the potential to revolutionize this field by providing automated, precise, and rapid identification of diabetic retinopathy (DR). In our study, we utilized Vision Transformers (ViTs) to achieve effective classification of retinal images. ViTs are advanced models that divide images into patches and utilize multi-head self-attention approaches to identify important features. The model demonstrated outstanding achievement, with a success rate of 96.13%, precision of 0.92, recall of 0.95, F1 score of 0.93, and ROC- AUC score of 0.969. The results of this study have significant implications for improving the detection of DR, enabling timely intervention, and potentially safeguarding the vision of a large number of individuals. Future research will focus on improving the resilience of the model by incorporating a wider range of datasets and optimizing its integration with medical procedures to ensure dependable and efficient performance in real-world scenarios. en_US
dc.language.iso en_US en_US
dc.subject Vision Transformer en_US
dc.subject Retinal Image Classification en_US
dc.subject Deep Learning en_US
dc.subject Diabetic Retinopathy en_US
dc.subject Medical Diagnostics en_US
dc.subject Fundus Image en_US
dc.subject Classification en_US
dc.title Multi-Head Self-Attention Mechanisms in Vision Transformers for Retinal Image Classification en_US
dc.type Other en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Browse

My Account