DSpace Repository

Revisiting the Class Imbalance Issue in Software Defect Prediction

Show simple item record

dc.contributor.author Sohan, Md. Fahimuzzman
dc.contributor.author Kabir, Md Alamgir
dc.contributor.author Jabiullah, Md. Ismail
dc.contributor.author Rahman, Sheikh Shah Mohammad Motiur
dc.date.accessioned 2021-10-26T09:32:50Z
dc.date.available 2021-10-26T09:32:50Z
dc.date.issued 2019-04-04
dc.identifier.uri http://dspace.daffodilvarsity.edu.bd:8080/handle/123456789/6293
dc.description.abstract Software defect prediction is related to the testing area of software industry. Several methods have been developed for the prediction of bugs in software source codes. The objective of this study is to find the inconsistency of performance between imbalances and balance data set and to find the distinction of performance between single classifier and aggregate classifier (voting). In this investigation, eight publicly available data sets have collected, also seven algorithms and hard voting are used for finding precision, recall and F-1 score to predict software defect. In these collected data, two sets are almost balanced. For this investigation, these balanced data sets have converted into imbalanced sets as average non-defective and defective ratio of the other 6 data sets. The experiment result shows that performance of the two balanced data sets is lower than other six sets. After conversion of two data sets, the performance has increased as like as other six data sets. Another observation is the performance metric that shows the results of precision, recall and F1-score for voting are 0.92, 0.84 and 0.87 respectively, which are better than other single classifier. This study has been able to shows that- imbalance of non-defective and defective classes have a big impact on software defect prediction and the voting is the best performer among the classifiers. en_US
dc.language.iso en_US en_US
dc.publisher 2nd International Conference on Electrical, Computer and Communication Engineering, ECCE 2019, IEEE en_US
dc.subject Artificial neural networks en_US
dc.subject Machine learning algorithms en_US
dc.subject Artificial intelligence en_US
dc.subject Pattern classification en_US
dc.subject Software quality en_US
dc.title Revisiting the Class Imbalance Issue in Software Defect Prediction en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Browse

My Account

Statistics