Multi-Modal Convolutional Parameterisation Network for Guided Image Inverse Problems

Mikolaj Czerkawski; Priti Upadhyay; Christopher Davison; Robert Atkinson; Craig Michie; Ivan Andonovic; Malcolm Macdonald; Javier Cardona; Christos Tachtatzis

doi:10.3390/jimaging10030069

Multi-Modal Convolutional Parameterisation Network for Guided Image Inverse Problems

J Imaging. 2024 Mar 12;10(3):69. doi: 10.3390/jimaging10030069.

Authors

Mikolaj Czerkawski¹, Priti Upadhyay¹, Christopher Davison¹, Robert Atkinson¹, Craig Michie¹, Ivan Andonovic¹, Malcolm Macdonald¹, Javier Cardona², Christos Tachtatzis¹

Affiliations

¹ Department of Electronic and Electrical Engineering, University of Strathclyde, Glasgow G1 1XW, UK.
² Department of Chemical Engineering, University of Strathclyde, Glasgow G1 1XJ, UK.

Abstract

There are several image inverse tasks, such as inpainting or super-resolution, which can be solved using deep internal learning, a paradigm that involves employing deep neural networks to find a solution by learning from the sample itself rather than a dataset. For example, Deep Image Prior is a technique based on fitting a convolutional neural network to output the known parts of the image (such as non-inpainted regions or a low-resolution version of the image). However, this approach is not well adjusted for samples composed of multiple modalities. In some domains, such as satellite image processing, accommodating multi-modal representations could be beneficial or even essential. In this work, Multi-Modal Convolutional Parameterisation Network (MCPN) is proposed, where a convolutional neural network approximates shared information between multiple modes by combining a core shared network with modality-specific head networks. The results demonstrate that these approaches can significantly outperform the single-mode adoption of a convolutional parameterisation network on guided image inverse problems of inpainting and super-resolution.

Keywords: image inpainting; image super-resolution; image synthesis; internal learning; multi-modal learning.

Grants and funding

825355/European Union Horizon 2020 Research and Innovation Programme