MFGNet: Dynamic Modality-Aware Filter Generation for RGB-T Tracking

Wang, Xiao; Shu, Xiujun; Zhang, Shiliang; Jiang, Bo; Wang, Yaowei; Tian, Yonghong; Wu, Feng

Computer Science > Computer Vision and Pattern Recognition

arXiv:2107.10433 (cs)

[Submitted on 22 Jul 2021 (v1), last revised 9 May 2022 (this version, v2)]

Title:MFGNet: Dynamic Modality-Aware Filter Generation for RGB-T Tracking

Authors:Xiao Wang, Xiujun Shu, Shiliang Zhang, Bo Jiang, Yaowei Wang, Yonghong Tian, Feng Wu

View PDF

Abstract:Many RGB-T trackers attempt to attain robust feature representation by utilizing an adaptive weighting scheme (or attention mechanism). Different from these works, we propose a new dynamic modality-aware filter generation module (named MFGNet) to boost the message communication between visible and thermal data by adaptively adjusting the convolutional kernels for various input images in practical tracking. Given the image pairs as input, we first encode their features with the backbone network. Then, we concatenate these feature maps and generate dynamic modality-aware filters with two independent networks. The visible and thermal filters will be used to conduct a dynamic convolutional operation on their corresponding input feature maps respectively. Inspired by residual connection, both the generated visible and thermal feature maps will be summarized with input feature maps. The augmented feature maps will be fed into the RoI align module to generate instance-level features for subsequent classification. To address issues caused by heavy occlusion, fast motion and out-of-view, we propose to conduct a joint local and global search by exploiting a new direction-aware target driven attention mechanism. The spatial and temporal recurrent neural network is used to capture the direction-aware context for accurate global attention prediction. Extensive experiments on three large-scale RGB-T tracking benchmark datasets validated the effectiveness of our proposed algorithm. The source code of this paper is available at \textcolor{magenta}{\url{this https URL}}.

Comments:	Accepted by IEEE TMM 2022
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2107.10433 [cs.CV]
	(or arXiv:2107.10433v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2107.10433

Submission history

From: Xiao Wang [view email]
[v1] Thu, 22 Jul 2021 03:10:51 UTC (12,647 KB)
[v2] Mon, 9 May 2022 11:06:22 UTC (14,374 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:MFGNet: Dynamic Modality-Aware Filter Generation for RGB-T Tracking

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:MFGNet: Dynamic Modality-Aware Filter Generation for RGB-T Tracking

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators