• KSII Transactions on Internet and Information Systems
    Monthly Online Journal (eISSN: 1976-7277)

Application of YOLOv5 Neural Network Based on Improved Attention Mechanism in Recognition of Thangka Image Defects

Vol. 16, No. 1, January 31, 2022
10.3837/tiis.2022.01.014, Download Paper (Free):

Abstract

In response to problems such as insufficient extraction information, low detection accuracy, and frequent misdetection in the field of Thangka image defects, this paper proposes a YOLOv5 prediction algorithm fused with the attention mechanism. Firstly, the Backbone network is used for feature extraction, and the attention mechanism is fused to represent different features, so that the network can fully extract the texture and semantic features of the defect area. The extracted features are then weighted and fused, so as to reduce the loss of information. Next, the weighted fused features are transferred to the Neck network, the semantic features and texture features of different layers are fused by FPN, and the defect target is located more accurately by PAN. In the detection network, the CIOU loss function is used to replace the GIOU loss function to locate the image defect area quickly and accurately, generate the bounding box, and predict the defect category. The results show that compared with the original network, YOLOv5-SE and YOLOv5-CBAMachieve an improvement of 8.95% and 12.87% in detection accuracy respectively. The improved networks can identify the location and category of defects more accurately, and greatly improve the accuracy of defect detection of Thangka images.


Statistics

Show / Hide Statistics

Statistics (Cumulative Counts from December 1st, 2015)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.


Cite this article

[IEEE Style]
Y. Fan, Y. Li, Y. Shi, S. Wang, "Application of YOLOv5 Neural Network Based on Improved Attention Mechanism in Recognition of Thangka Image Defects," KSII Transactions on Internet and Information Systems, vol. 16, no. 1, pp. 245-265, 2022. DOI: 10.3837/tiis.2022.01.014.

[ACM Style]
Yao Fan, Yubo Li, Yingnan Shi, and Shuaishuai Wang. 2022. Application of YOLOv5 Neural Network Based on Improved Attention Mechanism in Recognition of Thangka Image Defects. KSII Transactions on Internet and Information Systems, 16, 1, (2022), 245-265. DOI: 10.3837/tiis.2022.01.014.

[BibTeX Style]
@article{tiis:25256, title="Application of YOLOv5 Neural Network Based on Improved Attention Mechanism in Recognition of Thangka Image Defects", author="Yao Fan and Yubo Li and Yingnan Shi and Shuaishuai Wang and ", journal="KSII Transactions on Internet and Information Systems", DOI={10.3837/tiis.2022.01.014}, volume={16}, number={1}, year="2022", month={January}, pages={245-265}}