DSpace Repository

A Machine Learning Based Approach to Classify Tense from English Text

Show simple item record

dc.contributor.author Ayman, Umme
dc.contributor.author Islam, Md. Shafiqul
dc.contributor.author Rahat, Md. Azmain Mahtab
dc.contributor.author Raza, Dewan Mamun
dc.contributor.author Chakraborty, Narayan Ranjan
dc.contributor.author Bijoy, Md. Hasan Imam
dc.date.accessioned 2025-12-20T07:41:47Z
dc.date.available 2025-12-20T07:41:47Z
dc.date.issued 2024-12-19
dc.identifier.uri http://dspace.daffodilvarsity.edu.bd:8080/handle/123456789/16179
dc.description Conference paper en_US
dc.description.abstract This paper investigates the classification of tense in English text using machine learning algorithms. Support Vector Machine (SVM), Random Forest (RF), Multinomial Naive Bayes (MNB), Decision Tree (DT), XGBoost, and K-Nearest Neighbors (KNN) are the six classifiers used in the study. The dataset was collected from diverse sources including novels, books, blogs, articles, social media platforms, newspapers, websites and some of them self-made. The data underwent preprocessing steps such as cleaning, normalization, and feature extraction using TfidfVectorizer. Among the other algorithms, SVM achieved the highest accuracy at 97.17%. Classifier performance was assessed with metrics such as F1-score, recall, accuracy, and precision. To evaluate performance, ROC curves, and confusion matrices were also examined. The study underlines the necessity for focused approaches and draws attention to the significant gaps in the field of natural language processing (NLP) regarding tense classification studies. By leveraging machine learning, this research aims to enhance the accuracy and contextual appropriateness of tense classification, thereby improving cross-cultural communication and understanding in machine translation systems. This research contributes to NLP by offering a robust approach to tense classification and demonstrates the potential of SVM in achieving high accuracy for this task. Future work will focus on addressing limitations such as short training data, overfitting and tense conversion. en_US
dc.language.iso en_US en_US
dc.publisher Scopus en_US
dc.subject Overfitting en_US
dc.subject Random forests en_US
dc.subject Machine translation en_US
dc.subject Nearest neighbor methods en_US
dc.subject Training data en_US
dc.subject Social networking (online) en_US
dc.subject Machine learning algorithms en_US
dc.subject Measurement en_US
dc.subject Support vector machines en_US
dc.title A Machine Learning Based Approach to Classify Tense from English Text en_US
dc.type Other en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Browse

My Account