Wall segmentation in 2D images using convolutional neural networks

Mihailo Bjekic; Ana Lazovic; Venkatachalam K; Nebojsa Bacanin; Miodrag Zivkovic; Goran Kvascev; Bosko Nikolic

doi:10.7717/peerj-cs.1565

Wall segmentation in 2D images using convolutional neural networks

PeerJ Comput Sci. 2023 Sep 11:9:e1565. doi: 10.7717/peerj-cs.1565. eCollection 2023.

Authors

Mihailo Bjekic¹, Ana Lazovic², Venkatachalam K³, Nebojsa Bacanin⁴, Miodrag Zivkovic⁴, Goran Kvascev⁵, Bosko Nikolic⁵

Affiliations

¹ Everseen, Belgrade, Serbia.
² Daon, Belgrade, Serbia.
³ Department of Applied Cybernetics, University of Hradec Králové, Faculty of Science, Hradec Králové, Czech Republic.
⁴ Department of Informatics and Computing, Singidunum University, Belgrade, Serbia.
⁵ University of Belgrade, School of Electrical Engineering, Belgrade, Serbia.

Abstract

Wall segmentation is a special case of semantic segmentation, and the task is to classify each pixel into one of two classes: wall and no-wall. The segmentation model returns a mask showing where objects like windows and furniture are located, as well as walls. This article proposes the module's structure for semantic segmentation of walls in 2D images, which can effectively address the problem of wall segmentation. The proposed model achieved higher accuracy and faster execution than other solutions. An encoder-decoder architecture of the segmentation module was used. Dilated ResNet50/101 network was used as an encoder, representing ResNet50/101 network in which dilated convolutional layers replaced the last convolutional layers. The ADE20K dataset subset containing only interior images, was used for model training, while only its subset was used for model evaluation. Three different approaches to model training were analyzed in the research. On the validation dataset, the best approach based on the proposed structure with the ResNet101 network resulted in an average accuracy at the pixel level of 92.13% and an intersection over union (IoU) of 72.58%. Moreover, all proposed approaches can be applied to recognize other objects in the image to solve specific tasks.

Keywords: ADE20K; Encoder-decoder; PSPNet; Semantic segmentation; Wall segmentation.

Grants and funding

This research was funded by the Ministry of Education, Science and Technological Development of the Republic of Serbia, grant number 451-03-68/2022-14/200103. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.