Structural inference embedded adversarial networks for scene parsing

PLoS One. 2018 Apr 12;13(4):e0195114. doi: 10.1371/journal.pone.0195114. eCollection 2018.

Abstract

Explicit structural inference is one key point to improve the accuracy of scene parsing. Meanwhile, adversarial training method is able to reinforce spatial contiguity in output segmentations. To take both advantages of the structural learning and adversarial training simultaneously, we propose a novel deep learning network architecture called Structural Inference Embedded Adversarial Networks (SIEANs) for pixel-wise scene labeling. The generator of our SIEANs, a novel designed scene parsing network, makes full use of convolutional neural networks and long short-term memory networks to learn the global contextual information of objects in four different directions from RGB-(D) images, which is able to describe the (three-dimensional) spatial distributions of objects in a more comprehensive and accurate way. To further improve the performance, we explore the adversarial training method to optimize the generator along with a discriminator, which can not only detect and correct higher-order inconsistencies between the predicted segmentations and corresponding ground truths, but also exploit full advantages of the generator by fine-tuning its parameters so as to obtain higher consistencies. The experimental results demonstrate that our proposed SIEANs is able to achieve a better performance on PASCAL VOC 2012, SIFT FLOW, PASCAL Person-Part, Cityscapes, Stanford Background, NYUDv2, and SUN-RGBD datasets compared to the most of state-of-the-art methods.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Image Processing, Computer-Assisted / methods*
  • Imaging, Three-Dimensional
  • Machine Learning
  • Models, Statistical
  • Neural Networks, Computer*
  • Pattern Recognition, Automated / methods*
  • Software
  • User-Computer Interface

Grants and funding

This work was supported by the National Key Research and Development Program, http://program.most.gov.cn/, Award Number: 2016YFB1000400, Recipient: YanXia Wu; the Central University Free Exploration Fund, http://www.hrbeu.edu.cn/, Award Number: HEUCF170605, Recipient: YanXia Wu; the Harbin Outstanding Young Talents Fund, http://www.hljkjt.gov.cn/, Award Number: 2017RAYXJ016, Recipient: YanXia Wu; the National Natural Science Foundation of China, http://www.nsfc.gov.cn/, Award Number: 60903098, Recipient: ShuHui Bu; and the National Natural Science Foundation of China, http://www.nsfc.gov.cn/, Award Number: 61573284, Recipient: ShuHui Bu. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.