• KSII Transactions on Internet and Information Systems
    Monthly Online Journal (eISSN: 1976-7277)

Semi-supervised Software Defect Prediction Model Based on Tri-training


Abstract

Aiming at the problem of software defect prediction difficulty caused by insufficient software defect marker samples and unbalanced classification, a semi-supervised software defect prediction model based on a tri-training algorithm was proposed by combining feature normalization, over-sampling technology, and a Tri-training algorithm. First, the feature normalization method is used to smooth the feature data to eliminate the influence of too large or too small feature values on the model's classification performance. Secondly, the oversampling method is used to expand and sample the data, which solves the unbalanced classification of labelled samples. Finally, the Tri-training algorithm performs machine learning on the training samples and establishes a defect prediction model. The novelty of this model is that it can effectively combine feature normalization, oversampling techniques, and the Tri-training algorithm to solve both the under-labelled sample and class imbalance problems. Simulation experiments using the NASA software defect prediction dataset show that the proposed method outperforms four existing supervised and semi-supervised learning in terms of Precision, Recall, and F-Measure values.


Statistics

Show / Hide Statistics

Statistics (Cumulative Counts from December 1st, 2015)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.


Cite this article

[IEEE Style]
F. Meng, W. Cheng, J. Wang, "Semi-supervised Software Defect Prediction Model Based on Tri-training," KSII Transactions on Internet and Information Systems, vol. 15, no. 11, pp. 4028-4042, 2021. DOI: 10.3837/tiis.2021.11.009.

[ACM Style]
Fanqi Meng, Wenying Cheng, and Jingdong Wang. 2021. Semi-supervised Software Defect Prediction Model Based on Tri-training. KSII Transactions on Internet and Information Systems, 15, 11, (2021), 4028-4042. DOI: 10.3837/tiis.2021.11.009.

[BibTeX Style]
@article{tiis:25101, title="Semi-supervised Software Defect Prediction Model Based on Tri-training", author="Fanqi Meng and Wenying Cheng and Jingdong Wang and ", journal="KSII Transactions on Internet and Information Systems", DOI={10.3837/tiis.2021.11.009}, volume={15}, number={11}, year="2021", month={November}, pages={4028-4042}}