SGAT: Shuffle and graph attention based Siamese networks for visual tracking

PLoS One. 2022 Nov 23;17(11):e0277064. doi: 10.1371/journal.pone.0277064. eCollection 2022.

Abstract

Siamese-based trackers have achieved excellent performance and attracted extensive attention, which regard the tracking task as a similarity learning between the target template and search regions. However, most Siamese-based trackers do not effectively exploit correlations of the spatial and channel-wise information to represent targets. Meanwhile, the cross-correlation is a linear matching method and neglects the structured and part-level information. In this paper, we propose a novel tracking algorithm for feature extraction of target templates and search region images. Based on convolutional neural networks and shuffle attention, the tracking algorithm computes the similarity between the template and a search region through a graph attention matching. The proposed tracking algorithm exploits the correlations between the spatial and channel-wise information to highlight the target region. Moreover, the graph matching can greatly alleviate the influences of appearance variations such as partial occlusions. Extensive experiments demonstrate that the proposed tracking algorithm achieves excellent tracking results on multiple challenging benchmarks. Compared with other state-of-the-art methods, the proposed tracking algorithm achieves excellent tracking performance.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Learning
  • Neural Networks, Computer*

Grants and funding

Yuanyun Wang, Wenshuang Zhang, Limin Zhang are funded by the Jiangxi Science and Technology Research Project of Education within the Department of China (No: GJJ190955), and the National Natural Science Foundation of China (No: 61861032) for the study design, the experiments and the paper publishing. Jun Wang is funded by the National Natural Science Foundation of China (No: 61865012) for the study and the publication.