Gender Detection from Bengali Voice Using Machine Learning Approach"

Sarker, Neela

DSpace Home
→
Faculty of Science and Information Technology
→
Department of Computer Science and Engineering
→
Project Report
→
View Item

dc.contributor.author	Sarker, Neela
dc.date.accessioned	2025-09-14T06:09:47Z
dc.date.available	2025-09-14T06:09:47Z
dc.date.issued	2024-07-13
dc.identifier.uri	http://dspace.daffodilvarsity.edu.bd:8080/handle/123456789/14471
dc.description	Project report	en_US
dc.description.abstract	Gender identification presents a notable challenge in signal processing, with recent attention shifting towards vocal feature analysis over traditional image classification methods. Acknowledging that gender classification encompasses more than fundamental frequency and pitch, the research explores the importance of feature selection, akin to dimensionality reduction, crucial for identifying gender-specific traits. This study delves into the efficacy and significance of machine learning algorithms in addressing voice-based gender identification, examining the relationship between vocal fold thickness, wavelength, and pitch perception, particularly in distinguishing male and female voices. Leveraging machine learning techniques, the feasibility of gender identification from voice signals is demonstrated, employing methods such as Discrete Fourier Transform, Mel- spaced filter-bank, and log filter-bank energies to extract MFCC features. Gender identification from natural voice holds considerable implications, especially in practical applications where gender differentiation is vital. While conventional voice-to-text conversion may not require gender detection, real-world scenarios demand it, aligning with Natural Language Processing, a subset of artificial intelligence. The methodology entails a systematic workflow, encompassing input audio files, pre-processing, feature extraction, model training, and testing with separate datasets. The study yields a remarkable dataset of 2384 samples from more than 50 speakers, including 607 male,663 child,451 third gender, and 663 female voices. This research underscores machine learning's potential in addressing gender identification from voice, emphasizing its significance across various artificial intelligence applications. Our dataset comprises individuals aged 0 to 42+ from diverse locations, meticulously categorized into age groups and gender categories. Utilizing a smartphone and audio recording software, we collect a comprehensive database, strategically divided into training and testing sets. Notably, SVM and Random Forest emerge as top performers, achieving accuracies of 96.28% and 97.72%, respectively.	en_US
dc.description.sponsorship	DIU	en_US
dc.language.iso	en_US	en_US
dc.publisher	Daffodil International University	en_US
dc.subject	Natural language processing (NLP)	en_US
dc.subject	Machine Learning	en_US
dc.subject	Computational linguistics	en_US
dc.title	Gender Detection from Bengali Voice Using Machine Learning Approach"	en_US
dc.type	Other	en_US