DSpace Repository

An Investigation and Evaluation of N-Gram, TF-IDF and Ensemble Methods in Sentiment Classification

Show simple item record

dc.contributor.author Rahman, Sheikh Shah Mohammad Motiur
dc.contributor.author Biplob, Khalid Been Md. Badruzzaman
dc.contributor.author Rahman, Md. Habibur
dc.contributor.author Sarker, Kaushik
dc.contributor.author Islam, Takia
dc.date.accessioned 2022-01-08T08:39:09Z
dc.date.available 2022-01-08T08:39:09Z
dc.date.issued 2020
dc.identifier.uri http://dspace.daffodilvarsity.edu.bd:8080/handle/123456789/6678
dc.description.abstract In the area of sentiment analysis and classification, the performance of the classification tasks can be varied based on the usage of text vectorization and feature extraction methods. This paper represents a detailed investigation and analysis of the impact on feature extraction methods to attain the highest classification accuracy of the sentiment from user reviews. Unigram, Bigram and Trigram are applied as n-gram vectorization models with TF-IDF features extraction method individually. Accuracy, misclassification rate, Receiver Operating Characteristics (ROC) and recall-precision are used in this study to evaluate which are counted as the most important performance measurement parameters in machine learning based approaches. Parameters are measured by the output obtained from Bagged Decision Tree (BDT), Random Forest (RF), Ada Boost (ADA), Gradient Boost (GB) and Extra Tree (ET). The outcomes of this study is to find out the best fitted combination of term frequency–inverse document frequency (TF-IDF) and n-grams for different data size. en_US
dc.language.iso en_US en_US
dc.publisher Lecture Notes of the Institute for Computer Sciences, Social-Informatics and Telecommunications Engineering, LNICST en_US
dc.subject N-Gram en_US
dc.subject TF-IDF en_US
dc.subject Ensemble Methods en_US
dc.title An Investigation and Evaluation of N-Gram, TF-IDF and Ensemble Methods in Sentiment Classification en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Browse

My Account

Statistics