• KSII Transactions on Internet and Information Systems
    Monthly Online Journal (eISSN: 1976-7277)

3D Cross-Modal Retrieval Using Noisy Center Loss and SimSiam for Small Batch Training


Abstract

3D Cross-Modal Retrieval (3DCMR) is a task that retrieves 3D objects regardless of modalities, such as images, meshes, and point clouds. One of the most prominent methods used for 3DCMR is the Cross-Modal Center Loss Function (CLF) which applies the conventional center loss strategy for 3D cross-modal search and retrieval. Since CLF is based on center loss, the center features in CLF are also susceptible to subtle changes in hyperparameters and external inferences. For instance, performance degradation is observed when the batch size is too small. Furthermore, the Mean Squared Error (MSE) used in CLF is unable to adapt to changes in batch size and is vulnerable to data variations that occur during actual inference due to the use of simple Euclidean distance between multi-modal features. To address the problems that arise from small batch training, we propose a Noisy Center Loss (NCL) method to estimate the optimal center features. In addition, we apply the simple Siamese representation learning method (SimSiam) during optimal center feature estimation to compare projected features, making the proposed method robust to changes in batch size and variations in data. As a result, the proposed approach demonstrates improved performance in ModelNet40 dataset compared to the conventional methods.


Statistics

Show / Hide Statistics

Statistics (Cumulative Counts from December 1st, 2015)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.


Cite this article

[IEEE Style]
Y. Choo, B. Kim, H. Kim, Y. Park, "3D Cross-Modal Retrieval Using Noisy Center Loss and SimSiam for Small Batch Training," KSII Transactions on Internet and Information Systems, vol. 18, no. 3, pp. 670-684, 2024. DOI: 10.3837/tiis.2024.03.008.

[ACM Style]
Yeon-Seung Choo, Boeun Kim, Hyun-Sik Kim, and Yong-Suk Park. 2024. 3D Cross-Modal Retrieval Using Noisy Center Loss and SimSiam for Small Batch Training. KSII Transactions on Internet and Information Systems, 18, 3, (2024), 670-684. DOI: 10.3837/tiis.2024.03.008.

[BibTeX Style]
@article{tiis:90675, title="3D Cross-Modal Retrieval Using Noisy Center Loss and SimSiam for Small Batch Training", author="Yeon-Seung Choo and Boeun Kim and Hyun-Sik Kim and Yong-Suk Park and ", journal="KSII Transactions on Internet and Information Systems", DOI={10.3837/tiis.2024.03.008}, volume={18}, number={3}, year="2024", month={March}, pages={670-684}}