Given its remarkable capability for feature extraction in computer vision tasks, deep learning (DL) has been extensively utilized to fuse infrared and visible images. However, the existing DL-based methods generally extract complementary information from source images through convolutional operations, which results in limited preservation of global features. To this end, we propose a novel infrared and visible image fusion method, i.e., the Y-shape dynamic Transformer (YDTR). Speciï¬