Patient Narratives To Specialist Prediction: A Bert-Based Nlp Approach For Automatic Medical Specialty Classification

Rashid, Md. Rejwan

DSpace Home
→
Faculty of Science and Information Technology
→
Department of Software Engineering
→
Thesis Report
→
View Item

dc.contributor.author	Rashid, Md. Rejwan
dc.date.accessioned	2026-04-25T09:23:42Z
dc.date.available	2026-04-25T09:23:42Z
dc.date.issued	2025-12-30
dc.identifier.citation	SWT	en_US
dc.identifier.uri	http://dspace.daffodilvarsity.edu.bd:8080/handle/123456789/17027
dc.description	Thesis Report	en_US
dc.description.abstract	Healthcare documentation management depends on medical specialty classification to achieve efficient organization and management of medical records. The research investigates the performance of transformer-based deep learning systems when used for medical transcription classification to achieve accurate medical domain identification. The dataset used consists of a large number of real clinical notes labeled with one of 29 medical specialties. The text-based data introduces an original approach to implement NLP technology for healthcare applications. The research creates automated medical specialty identification systems from unprocessed doctor-patient conversation recordings to enhance healthcare operational efficiency and patient referral accuracy. The research starts by reviewing existing studies about medical text classification and transformer models including DistilBERT. The research includes complete details about the dataset origin and volume and all preprocessing operations that were performed. Special attention is given to handling class imbalance, where underrepresented specialties are assigned appropriate class weights during training. The data preparation process is emphasized to ensure reliability and relevance of the content used in training. The model evaluation depends on four performance metrics which include top-1 accuracy and top-3 accuracy and macro F1-score and weighted F1-score. The proposed model reached a top-1 accuracy of 54.03% while delivering a top-3 accuracy of 95.61% which proves its capacity to generate multiple suitable predictions for each input. The evaluation process of these metrics takes place at various checkpoints to enable comparison. The model's output results undergo confusion matrix analysis to identify which specialties have the most misclassification errors. The research demonstrates that transformer-based NLP models achieve success in medical documentation automation and specialty classification work. The paper presents its current boundaries and future research paths which involve using domain- specific models together with ensemble methods. The research investigates actual clinical documentation problems to develop better AI healthcare solutions and build deep learning systems that understand medical texts.	en_US
dc.description.sponsorship	DIU	en_US
dc.language.iso	en_US	en_US
dc.publisher	Daffodil International University	en_US
dc.subject	Medical Text Classification	en_US
dc.subject	BERT Model	en_US
dc.subject	Patient Narratives	en_US
dc.subject	Clinical NLP	en_US
dc.subject	Specialty Prediction	en_US
dc.title	Patient Narratives To Specialist Prediction: A Bert-Based Nlp Approach For Automatic Medical Specialty Classification	en_US
dc.type	Thesis	en_US