A Bi-FPN-Based Encoder-Decoder Model for Lung Nodule Image Segmentation

Chandra Sekhara Rao Annavarapu; Samson Anosh Babu Parisapogu; Nikhil Varma Keetha; Praveen Kumar Donta; Gurindapalli Rajita

doi:10.3390/diagnostics13081406

A Bi-FPN-Based Encoder-Decoder Model for Lung Nodule Image Segmentation

Diagnostics (Basel). 2023 Apr 13;13(8):1406. doi: 10.3390/diagnostics13081406.

Authors

Chandra Sekhara Rao Annavarapu¹, Samson Anosh Babu Parisapogu², Nikhil Varma Keetha¹, Praveen Kumar Donta³, Gurindapalli Rajita⁴

Affiliations

¹ Indian Institute of Technology (Indian School of Mines), Dhanbad 826004, India.
² Krishna Chaitanya Institute of Technology and Sciences, Markapur 523316, India.
³ Distributed Systems Group, TU Wien, 1040 Vienna, Austria.
⁴ Department of ECE, GIET University, Gunupur 765022, India.

Abstract

Early detection and analysis of lung cancer involve a precise and efficient lung nodule segmentation in computed tomography (CT) images. However, the anonymous shapes, visual features, and surroundings of the nodules as observed in the CT images pose a challenging and critical problem to the robust segmentation of lung nodules. This article proposes a resource-efficient model architecture: an end-to-end deep learning approach for lung nodule segmentation. It incorporates a Bi-FPN (bidirectional feature network) between an encoder and a decoder architecture. Furthermore, it uses the Mish activation function and class weights of masks with the aim of enhancing the efficiency of the segmentation. The proposed model was extensively trained and evaluated on the publicly available LUNA-16 dataset consisting of 1186 lung nodules. To increase the probability of the suitable class of each voxel in the mask, a weighted binary cross-entropy loss of each sample of training was utilized as network training parameter. Moreover, on the account of further evaluation of robustness, the proposed model was evaluated on the QIN Lung CT dataset. The results of the evaluation show that the proposed architecture outperforms existing deep learning models such as U-Net with a Dice Similarity Coefficient of 82.82% and 81.66% on both datasets.

Keywords: computed tomography; deep learning; medical image analysis; segmentation.

Grants and funding

This research received no external funding.