Integrating spatial configuration into heatmap regression based CNNs for landmark localization

Med Image Anal. 2019 May:54:207-219. doi: 10.1016/j.media.2019.03.007. Epub 2019 Mar 25.

Abstract

In many medical image analysis applications, only a limited amount of training data is available due to the costs of image acquisition and the large manual annotation effort required from experts. Training recent state-of-the-art machine learning methods like convolutional neural networks (CNNs) from small datasets is a challenging task. In this work on anatomical landmark localization, we propose a CNN architecture that learns to split the localization task into two simpler sub-problems, reducing the overall need for large training datasets. Our fully convolutional SpatialConfiguration-Net (SCN) learns this simplification due to multiplying the heatmap predictions of its two components and by training the network in an end-to-end manner. Thus, the SCN dedicates one component to locally accurate but ambiguous candidate predictions, while the other component improves robustness to ambiguities by incorporating the spatial configuration of landmarks. In our extensive experimental evaluation, we show that the proposed SCN outperforms related methods in terms of landmark localization error on a variety of size-limited 2D and 3D landmark localization datasets, i.e., hand radiographs, lateral cephalograms, hand MRIs, and spine CTs.

Keywords: Anatomical landmarks; Fully convolutional networks; Heatmap regression; Localization.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Anatomic Landmarks*
  • Cephalometry
  • Hand / diagnostic imaging
  • Humans
  • Image Processing, Computer-Assisted / methods*
  • Imaging, Three-Dimensional
  • Magnetic Resonance Imaging
  • Neural Networks, Computer*
  • Spine / diagnostic imaging
  • Tomography, X-Ray Computed