• KSII Transactions on Internet and Information Systems
    Monthly Online Journal (eISSN: 1976-7277)

A Novel Two-Stage Training Method for Unbiased Scene Graph Generation via Distribution Alignment


Abstract

Scene graphs serve as semantic abstractions of images and play a crucial role in enhancing visual comprehension and reasoning. However, the performance of Scene Graph Generation is often compromised when working with biased data in real-world situations. While many existing systems focus on a single stage of learning for both feature extraction and classification, some employ Class-Balancing strategies, such as Re-weighting, Data Resampling, and Transfer Learning from head to tail. In this paper, we propose a novel approach that decouples the feature extraction and classification phases of the scene graph generation process. For feature extraction, we leverage a transformer-based architecture and design an adaptive calibration function specifically for predicate classification. This function enables us to dynamically adjust the classification scores for each predicate category. Additionally, we introduce a Distribution Alignment technique that effectively balances the class distribution after the feature extraction phase reaches a stable state, thereby facilitating the retraining of the classification head. Importantly, our Distribution Alignment strategy is model-independent and does not require additional supervision, making it applicable to a wide range of SGG models. Using the scene graph diagnostic toolkit on Visual Genome and several popular models, we achieved significant improvements over the previous state-of-the-art methods with our model. Compared to the TDE model, our model improved mR@100 by 70.5% for PredCls, by 84.0% for SGCls, and by 97.6% for SGDet tasks.


Statistics

Show / Hide Statistics

Statistics (Cumulative Counts from December 1st, 2015)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.


Cite this article

[IEEE Style]
D. Jia, M. Zhou, W. WEI, D. Wang, Z. Bai, "A Novel Two-Stage Training Method for Unbiased Scene Graph Generation via Distribution Alignment," KSII Transactions on Internet and Information Systems, vol. 17, no. 12, pp. 3383-3397, 2023. DOI: 10.3837/tiis.2023.12.009.

[ACM Style]
Dongdong Jia, Meili Zhou, Wei WEI, Dong Wang, and Zongwen Bai. 2023. A Novel Two-Stage Training Method for Unbiased Scene Graph Generation via Distribution Alignment. KSII Transactions on Internet and Information Systems, 17, 12, (2023), 3383-3397. DOI: 10.3837/tiis.2023.12.009.

[BibTeX Style]
@article{tiis:56488, title="A Novel Two-Stage Training Method for Unbiased Scene Graph Generation via Distribution Alignment", author="Dongdong Jia and Meili Zhou and Wei WEI and Dong Wang and Zongwen Bai and ", journal="KSII Transactions on Internet and Information Systems", DOI={10.3837/tiis.2023.12.009}, volume={17}, number={12}, year="2023", month={December}, pages={3383-3397}}