FPGA-based systolic deconvolution architecture for upsampling

Alex Noel Joseph Raj; Lianhong Cai; Wei Li; Zhemin Zhuang; Tardi Tjahjadi

doi:10.7717/peerj-cs.973

FPGA-based systolic deconvolution architecture for upsampling

PeerJ Comput Sci. 2022 May 11:8:e973. doi: 10.7717/peerj-cs.973. eCollection 2022.

Authors

Alex Noel Joseph Raj¹, Lianhong Cai¹, Wei Li¹, Zhemin Zhuang¹, Tardi Tjahjadi²

Affiliations

¹ Department of Electronic Engineering, Shantou University, Shantou City, Guangdong Province, China.
² School of Engineering, University of Warwick, Coventry, United Kingdom.

Abstract

A deconvolution accelerator is proposed to upsample n × n input to 2n × 2n output by convolving with a k × k kernel. Its architecture avoids the need for insertion and padding of zeros and thus eliminates the redundant computations to achieve high resource efficiency with reduced number of multipliers and adders. The architecture is systolic and governed by a reference clock, enabling the sequential placement of the module to represent a pipelined decoder framework. The proposed accelerator is implemented on a Xilinx XC7Z020 platform, and achieves a performance of 3.641 giga operations per second (GOPS) with resource efficiency of 0.135 GOPS/DSP for upsampling 32 × 32 input to 256 × 256 output using a 3 × 3 kernel at 200 MHz. Furthermore, its high peak signal to noise ratio of almost 80 dB illustrates that the upsampled outputs of the bit truncated accelerator are comparable to IEEE double precision results.

Keywords: Deep learning; FPGA; Transposed convolution; Upsample.

Associated data

figshare/10.6084/m9.figshare.19387118

Grants and funding

This research was financially supported by The Scientific Research Grant of Shantou University, China, (Grant No: NTF17016); the National Natural Science Foundation of China (No.82071992); Basic and Applied Basic Research Foundation of Guangdong Province [grant number 2020B1515120061]; National Key R&D Program of China [grant number 2020YFC0122103] and the Guangdong Province University Priority Field (Artificial Intelligence) Project [grant number 2019KZDZX1013]. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.