SPS Webinar: YDTR: Infrared and Visible Image Fusion via Y-Shape Dynamic Transformer

Date: 23 May 2025
Time: 7:30 AM ET (New York Time)
Presenter(s): Dr. Wei Tang

Based on the IEEE Xplore® article: 
"YDTR: Infrared and Visible Image Fusion via Y-Shape Dynamic Transformer"
Published: IEEE Transactions on Multimedia, July 2022.

Download article: Original article will be made publicly available for download on the day of the webinar for 48 hours.

Abstract

Given its remarkable capability for feature extraction in computer vision tasks, deep learning (DL) has been extensively utilized to fuse infrared and visible images. However, the existing DL-based methods generally extract complementary information from source images through convolutional operations, which results in limited preservation of global features. To this end, the presenter will propose a novel infrared and visible image fusion method, i.e., the Y-shape dynamic Transformer (YDTR). Specifically, a dynamic Transformer module is designed to acquire not only the local features but also the significant context information. Furthermore, the proposed network is devised in a Y-shape to comprehensively maintain the thermal radiation information from the infrared image and scene details from the visible image. Considering the specific information provided by the source images, we design a loss function that consists of two terms to improve fusion quality: a structural similarity term and a spatial frequency term. Extensive experiments on mainstream datasets illustrate that the proposed method outperforms both classical and state-of-the-art approaches in both qualitative and quantitative assessments. She further extends the YDTR to address other infrared and RGB-visible images and multi-focus images without fine-tuning, and the satisfactory fusion results demonstrate that the proposed method has good generalization capability.

Biography

Julius Richter

Wei Tang received the bachelor’s degree in biomedical engineering from Wannan Medical College, Wuhu, China, in 2018, the master’s degree in biomedical instrument from Hefei University of Technology, Hefei, China, in 2021, and the Ph.D. degree in artificial intelligence from Wuhan University, Wuhan, China, in 2024.

She is currently an Assistant Professor in the School of Computer Science and Technology at Tongji University, Shanghai, China. Her current research interests include artificial intelligence, computer vision, image processing, information fusion, and biomedical image analysis.

Dr. Tang has first-authored several refereed journal articles, including IEEE Transactions on Image Processing, IEEE Transactions on Multimedia, IEEE Transactions on Circuits and Systems for Video Technology, and IEEE Transactions on Computational Imaging.