DSpace Repository

Missing Information in Imbalanced Data Stream

Show simple item record

dc.contributor.author Halder, Bohnishikha
dc.contributor.author Ahmed, Md Manjur
dc.contributor.author Amagasa, Toshiyuki
dc.contributor.author Isa, Nor Ashidi Mat
dc.contributor.author Faisal, Rahat Hossain
dc.contributor.author Rahman, Md. Mostafijur
dc.date.accessioned 2022-03-14T09:38:16Z
dc.date.available 2022-03-14T09:38:16Z
dc.date.issued 2021
dc.identifier.uri http://dspace.daffodilvarsity.edu.bd:8080/handle/123456789/7500
dc.description.abstract From a real-world perspective, missing information is an ordinary scenario in data stream. Generally, missing data generate diverse problems in recognizing the pattern of data (i.e., clustering and classification). Particularly, missing data in data stream is a challenging topic. With imbalanced data, the problem of missing data greatly affects pattern recognition. As a solution to all these issues, this study puts forward an adaptive technique with fuzzy-based information decomposition method, which simultaneously solves the problem of incomplete data and overcomes the imbalanced data stream in a dataset. The main purpose of the proposed fuzzy adaptive imputation approach (FAIA) is to represent the effect of missing values in imbalance data stream and handle the missing data problem in imbalance data stream. FAIA is a single pass method. It considers adaptive selection of intervals based on all observed instances by using the interrelationship of attributes to identify correct interval for computing missing instances. Here, the interrelationship of two attributes means one attribute’s value of an instance depends on another attribute’s value of the same instance. In FAIA, after measuring all interval distances from a certain missing value, the least distance is selected for this missing value. Synthetic data of minority classes are generated using the same process of missing value imputation for balancing data that is called oversampling. Instances of the datasets are divided into the chunks in the data stream to balance data without any ensemble of previous chunks because missing values may misguide the future chunks. To demonstrate the performance of FAIA, the experiment is divided into three parts: missing data imputation, imbalanced information for offline data for the data stream, and imbalanced information with a missing value for offline data. Eleven numerical datasets with different dimensions from various repositories are considered for the computing performance of missing data imputation and imbalanced data without data stream. Four different datasets are also used to measure the performance of an imbalanced data stream. In maximum measuring cases, the proposed method outperforms. en_US
dc.language.iso en_US en_US
dc.publisher Scopus en_US
dc.subject interrelationship en_US
dc.subject FAIA en_US
dc.subject decomposition en_US
dc.subject simultaneously en_US
dc.title Missing Information in Imbalanced Data Stream en_US
dc.title.alternative Fuzzy Adaptive Imputation Approach en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Browse

My Account

Statistics