Identifying hate speech in Banglish Language

Siddika, Ayesha; Akter, Sumaiya

DSpace Home
→
Faculty of Science and Information Technology
→
DEPARTMENT OF COMPUTER SCIENCE & ENGINEERING
→
Project Report
→
View Item

dc.contributor.author	Siddika, Ayesha
dc.contributor.author	Akter, Sumaiya
dc.date.accessioned	2026-06-11T09:56:27Z
dc.date.available	2026-06-11T09:56:27Z
dc.date.issued	2025-01-12
dc.identifier.uri	http://dspace.daffodilvarsity.edu.bd:8080/handle/123456789/17289
dc.description	Project Report	en_US
dc.description.abstract	Social media's growth has facilitated the quick spread of both positive and negative content, and hate speech is one of the most harmful forms of online expression. The goal of this study is to detect hate speech in Banglish, a code-mixed language commonly used in social media conversations that blends English and Bangla. The research is about to develop a machine learning and deep learning-based system for identifying hate speech in Banglish in order to get beyond the unique challenges caused by the combination of languages and everyday idioms. A collection of Banglish text from several social media platforms was preprocessed using techniques like tokenization, lemmatization, and normalization. The expansion of words such as "pic" to "picture",img to “image” and "u" to "you" The method of lemmatization decreased linguistic variances. We have applied six deep learning models which are LSTM, GRU, Bi-LSTM, Bi-GRU, GRU+LSTM and Bi-LSTM+Bi-GRU. These models' performance was evaluated using confusion matrices, F1-score, accuracy, precision, and recall. The multiclass classification job has an accuracy of 83.93% in “Bi-LSTM+Bi-GRU” model which is a hybrid model in differentiating between seven different classification types. In particular, this study advances automated hate speech detection systems for code-mixed languages like Banglish. The findings indicate that while current models have promise, more study is needed to address problems including data imbalance and the identification of more subtle kinds of hate speech. Future initiatives to improve the precision and robustness of hate speech detection systems across other languages are made possible by this research.	en_US
dc.description.sponsorship	Daffodil International University	en_US
dc.language.iso	en_US	en_US
dc.publisher	Daffodil International University	en_US
dc.subject	Social Media	en_US
dc.subject	Hate Speech Detection	en_US
dc.subject	Banglish Language Processing	en_US
dc.subject	Machine Learning	en_US
dc.subject	Deep Learning Models	en_US
dc.subject	Natural Language Processing (NLP)	en_US
dc.title	Identifying hate speech in Banglish Language	en_US
dc.type	Other	en_US