A study of generalization and compatibility performance of 3D U-Net segmentation on multiple heterogeneous liver CT datasets

Baochun He; Dalong Yin; Xiaoxia Chen; Huoling Luo; Deqiang Xiao; Mu He; Guisheng Wang; Chihua Fang; Lianxin Liu; Fucang Jia

doi:10.1186/s12880-021-00708-y

A study of generalization and compatibility performance of 3D U-Net segmentation on multiple heterogeneous liver CT datasets

BMC Med Imaging. 2021 Nov 24;21(1):178. doi: 10.1186/s12880-021-00708-y.

Authors

Baochun He^#^{1

2}, Dalong Yin^#^{3

4}, Xiaoxia Chen⁵, Huoling Luo^{1

2}, Deqiang Xiao^{1

2}, Mu He⁶, Guisheng Wang⁵, Chihua Fang⁶, Lianxin Liu^{7

8}, Fucang Jia^{9

10

11}

Affiliations

¹ Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China.
² Shenzhen College of Advanced Technology, University of Chinese Academy of Sciences, Shenzhen, China.
³ Department of Hepatobiliary Surgery, The First Affiliated Hospital, Harbin Medical University, Harbin, China.
⁴ Department of Hepatobiliary Surgery, The First Affiliated Hospital, University of Science and Technology of China, Hefei, China.
⁵ Department of Radiology, The Third Medical Center, General Hospital of PLA, Beijing, China.
⁶ First Hepatobiliary Surgery, Zhujiang Hospital, Southern Medical University, Guangzhou, China.
⁷ Department of Hepatobiliary Surgery, The First Affiliated Hospital, Harbin Medical University, Harbin, China. liulx@ustc.edu.cn.
⁸ Department of Hepatobiliary Surgery, The First Affiliated Hospital, University of Science and Technology of China, Hefei, China. liulx@ustc.edu.cn.
⁹ Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China. fc.jia@siat.ac.cn.
¹⁰ Shenzhen College of Advanced Technology, University of Chinese Academy of Sciences, Shenzhen, China. fc.jia@siat.ac.cn.
¹¹ Pazhou Lab, Guangzhou, China. fc.jia@siat.ac.cn.

^# Contributed equally.

Abstract

Background: Most existing algorithms have been focused on the segmentation from several public Liver CT datasets scanned regularly (no pneumoperitoneum and horizontal supine position). This study primarily segmented datasets with unconventional liver shapes and intensities deduced by contrast phases, irregular scanning conditions, different scanning objects of pigs and patients with large pathological tumors, which formed the multiple heterogeneity of datasets used in this study.

Methods: The multiple heterogeneous datasets used in this paper includes: (1) One public contrast-enhanced CT dataset and one public non-contrast CT dataset; (2) A contrast-enhanced dataset that has abnormal liver shape with very long left liver lobes and large-sized liver tumors with abnormal presets deduced by microvascular invasion; (3) One artificial pneumoperitoneum dataset under the pneumoperitoneum and three scanning profiles (horizontal/left/right recumbent position); (4) Two porcine datasets of Bama type and domestic type that contains pneumoperitoneum cases but with large anatomy discrepancy with humans. The study aimed to investigate the segmentation performances of 3D U-Net in: (1) generalization ability between multiple heterogeneous datasets by cross-testing experiments; (2) the compatibility when hybrid training all datasets in different sampling and encoder layer sharing schema. We further investigated the compatibility of encoder level by setting separate level for each dataset (i.e., dataset-wise convolutions) while sharing the decoder.

Results: Model trained on different datasets has different segmentation performance. The prediction accuracy between LiTS dataset and Zhujiang dataset was about 0.955 and 0.958 which shows their good generalization ability due to that they were all contrast-enhanced clinical patient datasets scanned regularly. For the datasets scanned under pneumoperitoneum, their corresponding datasets scanned without pneumoperitoneum showed good generalization ability. Dataset-wise convolution module in high-level can improve the dataset unbalance problem. The experimental results will facilitate researchers making solutions when segmenting those special datasets.

Conclusions: (1) Regularly scanned datasets is well generalized to irregularly ones. (2) The hybrid training is beneficial but the dataset imbalance problem always exits due to the multi-domain homogeneity. The higher levels encoded more domain specific information than lower levels and thus were less compatible in terms of our datasets.

Keywords: Dataset-wise convolution; Generalization; Liver segmentation; U-Net.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Animals
Contrast Media
Datasets as Topic
Humans
Imaging, Three-Dimensional*
Liver / diagnostic imaging*
Liver Neoplasms / diagnostic imaging*
Machine Learning*
Pneumoperitoneum / diagnostic imaging
Radiographic Image Interpretation, Computer-Assisted / methods*
Swine
Tomography, X-Ray Computed / methods*

Substances

Contrast Media