• KSII Transactions on Internet and Information Systems
    Monthly Online Journal (eISSN: 1976-7277)

Memory Propagation-based Target-aware Segmentation Tracker with Adaptive Mask-attention Decision Network


Abstract

Siamese-based segmentation and tracking algorithms improve accuracy and stability for video object segmentation and tracking tasks simultaneously. Although effective, variability in target appearance and background clutter can still affect segmentation accuracy and further influence the performance of tracking. In this paper, we present a memory propagation-based target-aware and mask-attention decision network for robust object segmentation and tracking. Firstly, a mask propagation-based attention module (MPAM) is constructed to explore the inherent correlation among image frames, which can mine mask information of the historical frames. By retrieving a memory bank (MB) that stores features and binary masks of historical frames, target attention maps are generated to highlight the target region on backbone features, thus suppressing the adverse effects of background clutter. Secondly, an attention refinement pathway (ARP) is designed to further refine the segmentation profile in the process of mask generation. A lightweight attention mechanism is introduced to calculate the weight of low-level features, paying more attention to low-level features sensitive to edge detail so as to obtain segmentation results. Finally, a mask fusion mechanism (MFM) is proposed to enhance the accuracy of the mask. By utilizing a mask quality assessment decision network, the corresponding quality scores of the "initial mask" and the "previous mask" can be obtained adaptively, thus achieving the assignment of weights and the fusion of masks. Therefore, the final mask enjoys higher accuracy and stability. Experimental results on multiple benchmarks demonstrate that our algorithm performs outstanding performance in a variety of challenging tracking tasks.


Statistics

Show / Hide Statistics

Statistics (Cumulative Counts from December 1st, 2015)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.


Cite this article

[IEEE Style]
H. Zhang, W. Fu, B. Zhou, K. Zhou, X. Yang, S. Liu, "Memory Propagation-based Target-aware Segmentation Tracker with Adaptive Mask-attention Decision Network," KSII Transactions on Internet and Information Systems, vol. 18, no. 9, pp. 2605-2625, 2024. DOI: 10.3837/tiis.2024.09.008.

[ACM Style]
Huanlong Zhang, Weiqiang Fu, Bin Zhou, Keyan Zhou, Xiangbo Yang, and Shanfeng Liu. 2024. Memory Propagation-based Target-aware Segmentation Tracker with Adaptive Mask-attention Decision Network. KSII Transactions on Internet and Information Systems, 18, 9, (2024), 2605-2625. DOI: 10.3837/tiis.2024.09.008.

[BibTeX Style]
@article{tiis:101206, title="Memory Propagation-based Target-aware Segmentation Tracker with Adaptive Mask-attention Decision Network", author="Huanlong Zhang and Weiqiang Fu and Bin Zhou and Keyan Zhou and Xiangbo Yang and Shanfeng Liu and ", journal="KSII Transactions on Internet and Information Systems", DOI={10.3837/tiis.2024.09.008}, volume={18}, number={9}, year="2024", month={September}, pages={2605-2625}}