• KSII Transactions on Internet and Information Systems
    Monthly Online Journal (eISSN: 1976-7277)

Representative Batch Normalization for Scene Text Recognition


Abstract

Scene text recognition has important application value and attracted the interest of plenty of researchers. At present, many methods have achieved good results, but most of the existing approaches attempt to improve the performance of scene text recognition from the image level. They have a good effect on reading regular scene texts. However, there are still many obstacles to recognizing text on low-quality images such as curved, occlusion, and blur. This exacerbates the difficulty of feature extraction because the image quality is uneven. In addition, the results of model testing are highly dependent on training data, so there is still room for improvement in scene text recognition methods. In this work, we present a natural scene text recognizer to improve the recognition performance from the feature level, which contains feature representation and feature enhancement. In terms of feature representation, we propose an efficient feature extractor combined with Representative Batch Normalization and ResNet. It reduces the dependence of the model on training data and improves the feature representation ability of different instances. In terms of feature enhancement, we use a feature enhancement network to expand the receptive field of feature maps, so that feature maps contain rich feature information. Enhanced feature representation capability helps to improve the recognition performance of the model. We conducted experiments on 7 benchmarks, which shows that this method is highly competitive in recognizing both regular and irregular texts. The method achieved top1 recognition accuracy on four benchmarks of IC03, IC13, IC15, and SVTP .


Statistics

Show / Hide Statistics

Statistics (Cumulative Counts from December 1st, 2015)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.


Cite this article

[IEEE Style]
Y. Sun, X. Cao, Y. Sun, "Representative Batch Normalization for Scene Text Recognition," KSII Transactions on Internet and Information Systems, vol. 16, no. 7, pp. 2390-2406, 2022. DOI: 10.3837/tiis.2022.07.015.

[ACM Style]
Yajie Sun, Xiaoling Cao, and Yingying Sun. 2022. Representative Batch Normalization for Scene Text Recognition. KSII Transactions on Internet and Information Systems, 16, 7, (2022), 2390-2406. DOI: 10.3837/tiis.2022.07.015.

[BibTeX Style]
@article{tiis:25848, title="Representative Batch Normalization for Scene Text Recognition", author="Yajie Sun and Xiaoling Cao and Yingying Sun and ", journal="KSII Transactions on Internet and Information Systems", DOI={10.3837/tiis.2022.07.015}, volume={16}, number={7}, year="2022", month={July}, pages={2390-2406}}