• KSII Transactions on Internet and Information Systems
    Monthly Online Journal (eISSN: 1976-7277)

F_MixBERT: Sentiment Analysis Model using Focal Loss for Imbalanced E-commerce Reviews

Vol. 18, No. 2, February 29, 2024
10.3837/tiis.2024.02.001, Download Paper (Free):

Abstract

Users' comments after online shopping are critical to product reputation and business improvement. These comments, sometimes known as e-commerce reviews, influence other customers' purchasing decisions. To confront large amounts of e-commerce reviews, automatic analysis based on machine learning and deep learning draws more and more attention. A core task therein is sentiment analysis. However, the e-commerce reviews exhibit the following characteristics: (1) inconsistency between comment content and the star rating; (2) a large number of unlabeled data, i.e., comments without a star rating, and (3) the data imbalance caused by the sparse negative comments. This paper employs Bidirectional Encoder Representation from Transformers (BERT), one of the best natural language processing models, as the base model. According to the above data characteristics, we propose the F_MixBERT framework, to more effectively use inconsistently low-quality and unlabeled data and resolve the problem of data imbalance. In the framework, the proposed MixBERT incorporates the MixMatch approach into BERT’s high-dimensional vectors to train the unlabeled and low-quality data with generated pseudo labels. Meanwhile, data imbalance is resolved by Focal loss, which penalizes the contribution of large-scale data and easily-identifiable data to total loss. Comparative experiments demonstrate that the proposed framework outperforms BERT and MixBERT for sentiment analysis of e-commerce comments.


Statistics

Show / Hide Statistics

Statistics (Cumulative Counts from December 1st, 2015)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.


Cite this article

[IEEE Style]
F. Pang, X. Chen, L. Li, X. Xu, Z. Xing, "F_MixBERT: Sentiment Analysis Model using Focal Loss for Imbalanced E-commerce Reviews," KSII Transactions on Internet and Information Systems, vol. 18, no. 2, pp. 263-283, 2024. DOI: 10.3837/tiis.2024.02.001.

[ACM Style]
Fengqian Pang, Xi Chen, Letong Li, Xin Xu, and Zhiqiang Xing. 2024. F_MixBERT: Sentiment Analysis Model using Focal Loss for Imbalanced E-commerce Reviews. KSII Transactions on Internet and Information Systems, 18, 2, (2024), 263-283. DOI: 10.3837/tiis.2024.02.001.

[BibTeX Style]
@article{tiis:90548, title="F_MixBERT: Sentiment Analysis Model using Focal Loss for Imbalanced E-commerce Reviews", author="Fengqian Pang and Xi Chen and Letong Li and Xin Xu and Zhiqiang Xing and ", journal="KSII Transactions on Internet and Information Systems", DOI={10.3837/tiis.2024.02.001}, volume={18}, number={2}, year="2024", month={February}, pages={263-283}}