Portrait Segmentation Using Ensemble of Heterogeneous Deep-Learning Models

Yong-Woon Kim; Yung-Cheol Byun; Addapalli V N Krishna

doi:10.3390/e23020197

Portrait Segmentation Using Ensemble of Heterogeneous Deep-Learning Models

Entropy (Basel). 2021 Feb 5;23(2):197. doi: 10.3390/e23020197.

Authors

Yong-Woon Kim¹, Yung-Cheol Byun², Addapalli V N Krishna³

Affiliations

¹ Centre for Digital Innovation, CHRIST (Deemed to be University), Bangalore, Karnataka 560029, India.
² Department of Computer Engineering, Jeju National University, 102 Jejudaehak-ro, Jeju-si 63243, Korea.
³ Department of Computer Science and Engineering, CHRIST (Deemed to be University), Bangalore, Karnataka 560029, India.

Abstract

Image segmentation plays a central role in a broad range of applications, such as medical image analysis, autonomous vehicles, video surveillance and augmented reality. Portrait segmentation, which is a subset of semantic image segmentation, is widely used as a preprocessing step in multiple applications such as security systems, entertainment applications, video conferences, etc. A substantial amount of deep learning-based portrait segmentation approaches have been developed, since the performance and accuracy of semantic image segmentation have improved significantly due to the recent introduction of deep learning technology. However, these approaches are limited to a single portrait segmentation model. In this paper, we propose a novel approach using an ensemble method by combining multiple heterogeneous deep-learning based portrait segmentation models to improve the segmentation performance. The Two-Models ensemble and Three-Models ensemble, using a simple soft voting method and weighted soft voting method, were experimented. Intersection over Union (IoU) metric, IoU standard deviation and false prediction rate were used to evaluate the performance. Cost efficiency was calculated to analyze the efficiency of segmentation. The experiment results show that the proposed ensemble approach can perform with higher accuracy and lower errors than single deep-learning-based portrait segmentation models. The results also show that the ensemble of deep-learning models typically increases the use of memory and computing power, although it also shows that the ensemble of deep-learning models can perform more efficiently than a single model with higher accuracy using less memory and less computing power.

Keywords: deep learning; efficiency; ensemble; portrait segmentation; simple soft voting; stacking; weighted soft voting.

Grants and funding

S2855401/The Ministry of Small and Medium-sized Enterprises(SMEs) and Startups(MSS)