DSpace Repository

Redefining Bangla Text Processing with a New Stemming Methodology

Show simple item record

dc.contributor.author Tanvir, Md Istiak
dc.date.accessioned 2026-06-25T04:34:23Z
dc.date.available 2026-06-25T04:34:23Z
dc.date.issued 2025-01-13
dc.identifier.uri http://dspace.daffodilvarsity.edu.bd:8080/handle/123456789/17455
dc.description Project Report en_US
dc.description.abstract Stemming is a basic NLP technique that normalizes the linguistic structure and improves the performance of text analysis by stripping words to their roots or base forms by removing suffixes which are quite vital in applications such as text normalization, information retrieval and keeping linguistic consistency. The Bangla language has a special stemming problem due to its extensive morphological structure with many inflectional changes and intricate grammatical rules. Hence, finding the root form of a word in Bangla involves much more complex affixation patterns and compound word creation than is evident in other languages with simpler grammatical systems. To handle the linguistic complexities of Bangla better than the previous approaches. In this work a new stemming method is developed for the language. It is more adaptable and practical for root word extraction in the real world because it has been equipped with the most updated methods to recognize and handle both the Bangla suffixes and morphological changes. Very encouraging results are obtained from the complex experiment carried out on a dataset containing 1,000 unique Bangla words. It’s extremely high F1-score of 85.66% and high accuracy rate of 87.2% do, in fact, justify its superior performance over traditional stemming algorithms. It is apparent from the present study that this newly proposed stemming strategy significantly enhances the efficacy and efficiency of all Bangla text processing systems, including search engines, information retrieval platforms, and all NLP applications. This study paves the way for further development in the Bangla Language Processing problem and emphasizes the cruciality of continued research in developing language-specific Natural Language Processing tools. en_US
dc.description.sponsorship Daffodil International University en_US
dc.language.iso en_US en_US
dc.publisher Daffodil International University en_US
dc.subject Bangla Stemming en_US
dc.subject Natural Language Processing (NLP) en_US
dc.subject Morphological Analysis en_US
dc.subject Root Word Extraction en_US
dc.subject Bangla Language Processing en_US
dc.subject Information Retrieval en_US
dc.title Redefining Bangla Text Processing with a New Stemming Methodology en_US
dc.type Other en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Browse

My Account