Global-Feature Encoding U-Net (GEU-Net) for Multi-Focus Image Fusion

Bin Xiao; Bocheng Xu; Xiuli Bi; Weisheng Li

doi:10.1109/TIP.2020.3033158

Global-Feature Encoding U-Net (GEU-Net) for Multi-Focus Image Fusion

IEEE Trans Image Process. 2021:30:163-175. doi: 10.1109/TIP.2020.3033158. Epub 2020 Nov 18.

Authors

Bin Xiao, Bocheng Xu, Xiuli Bi, Weisheng Li

PMID: 33112746
DOI: 10.1109/TIP.2020.3033158

Abstract

The convolutional neural network (CNN)-based multi-focus image fusion methods which learn the focus map from the source images have greatly enhanced fusion performance compared with the traditional methods. However, these methods have not yet reached a satisfactory fusion result, since the convolution operation pays too much attention on the local region and generating the focus map as a local classification (classify each pixel into focus or de-focus classes) problem. In this article, a global-feature encoding U-Net (GEU-Net) is proposed for multi-focus image fusion. In the proposed GEU-Net, the U-Net network is employed for treating the generation of focus map as a global two-class segmentation task, which segments the focused and defocused regions from a global view. For improving the global feature encoding capabilities of U-Net, the global feature pyramid extraction module (GFPE) and global attention connection upsample module (GACU) are introduced to effectively extract and utilize the global semantic and edge information. The perceptual loss is added to the loss function, and a large-scale dataset is constructed for boosting the performance of GEU-Net. Experimental results show that the proposed GEU-Net can achieve superior fusion performance than some state-of-the-art methods in both human visual quality, objective assessment and network complexity.