Abstract:
This project presents a system for classifying topics and sentiments from Bengali
comments using both machine learning and deep learning algorithms. Detecting
toxic behavior and hate speech is a primary usage of this system. All steps taken
to complete this experiment were done maintaining widely accepted standards.
One of the challenges of this experiment was working with a low-resource
language like Bengali that required custom and extensive preprocessing and
model design. All the trained models were evaluated using standard metrics.
Aside from the technical aspects, this project also focuses on improving online
safety and moderation for Bengali language. A comment analysis tool for web
view is also developed as part of this project.