DSpace Repository

Comparative Analysis of Feature Selection Algorithms for Computational Personality Prediction from Social Media

Show simple item record

dc.contributor.author Marouf, Ahmed Al
dc.contributor.author Mahmud, Hasan
dc.contributor.author Hasan, Md. Kamrul
dc.date.accessioned 2022-01-08T08:40:28Z
dc.date.available 2022-01-08T08:40:28Z
dc.date.issued 2020-02-19
dc.identifier.uri http://dspace.daffodilvarsity.edu.bd:8080/handle/123456789/6695
dc.description.abstract With the rapid growth of social media, users are getting involved in virtual socialism, generating a huge volume of textual and image contents. Considering the contents such as status updates/tweets and shared posts/retweets, liking other posts is reflecting the online behavior of the users. Predicting personality of a user from these digital footprints has become a computationally challenging problem. In a profile-based approach, utilizing the user-generated textual contents could be useful to reflect the personality in social media. Using huge number of features of different categories, such as traditional linguistic features (character-level, word-level, structural, and so on), psycholinguistic features (emotional affects, perceptions, social relationships, and so on) or social network features (network size, betweenness, and so on) could be useful to predict personality traits from social media. According to a widely popular personality model, namely, big-five-factor model (BFFM), the five factors are openness-to-experience, conscientiousness, extraversion, agreeableness, and neuroticism. Predicting personality is redefined as predicting each of these traits separately from the extracted features. Traditionally, it takes huge number of features to get better accuracy on any prediction task although applying feature selection algorithms may improve the performance of the model. In this article, we have compared the performance of five feature selection algorithms, namely the Pearson correlation coefficient (PCC), correlation-based feature subset (CFS), information gain (IG), symmetric uncertainly (SU) evaluator, and chi-squared (CHI) method. The performance is evaluated using the classic metrics, namely, precision, recall, f-measure, and accuracy as evaluation matrices. en_US
dc.language.iso en_US en_US
dc.publisher IEEE Transactions on Computational Social Systems, IEEE en_US
dc.subject Chi-squared (CHI) method en_US
dc.subject Computational personality prediction en_US
dc.subject Feature selection algorithms en_US
dc.subject Information gain (IG) en_US
dc.subject Pearson correlation coefficient (PCC) en_US
dc.subject Social media en_US
dc.title Comparative Analysis of Feature Selection Algorithms for Computational Personality Prediction from Social Media en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Browse

My Account

Statistics