DSpace Repository

A machine learning and deep learning approach for Baengali newspaper headline categorization

Show simple item record

dc.contributor.author Dola, Shamrita Tazbin
dc.date.accessioned 2024-06-29T09:33:46Z
dc.date.available 2024-06-29T09:33:46Z
dc.date.issued 2024-01-01
dc.identifier.uri http://dspace.daffodilvarsity.edu.bd:8080/handle/123456789/12797
dc.description.abstract The internet world is called a repository of information and data. Where there is a huge amount of information and data collection. Through internet people can access any kind of information and data from any place at any time. Current technology has made information and data readily available, due to which the amount of online news on the Internet has increased tremendously. Furthermore, because the internet is so widely available, people are growing increasingly eager to read news articles from news websites that use direct data. In general, online news portals are the terms used to describe Facebook, Twitter, WhatsApp, Telegram, Instagram, blogs, and other services. The quantity of news available on internet news portals is growing daily, and this growth is being matched by an increase in readers. All this online news are digital data, and with the volume of digital data is growing, so is the requirement for data categorization. Numerous methods, including machine learning, deep learning, transfer learning, and other data mining techniques, may be used to classify data. These algorithms classify data such that readers may deduce the news story's primary idea from the headlines alone. To address such issues, data in any language may be classified using natural language processing techniques. This article divides Bengali news stories into six categories: Politics, entertainment, sports, national, international, and IT. It does this by using deep learning and machine learning techniques. Numerous techniques, including BiLSTM, GRU, and Uni-Gram, as well as conventional machine learning algorithms, including SVM, MNB, RF Classifier, and LR, are used to select these classifications. The accuracy rates for these models are as follows: GRU achieves 84.01% accuracy, BiLSTM attains 83.42% accuracy, Logistic Regression performs at 64%, Multinomial Naive Bayes scores 61%, Random Forest Classifier achieves 65% accuracy, and Support Vector Machine also achieves 65% accuracy. en_US
dc.publisher Daffodil International University en_US
dc.subject Natural Language Processing (NLP) en_US
dc.subject Artificial Intelligence (AI) en_US
dc.subject Machine Learning en_US
dc.subject Deep Learning en_US
dc.subject Bengali Linguistic en_US
dc.subject Data Science en_US
dc.title A machine learning and deep learning approach for Baengali newspaper headline categorization en_US
dc.type Other en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Browse

My Account

Statistics