Adapting content-based image retrieval techniques for the semantic annotation of medical images

Comput Med Imaging Graph. 2016 Apr:49:37-45. doi: 10.1016/j.compmedimag.2016.01.001. Epub 2016 Feb 4.

Abstract

The automatic annotation of medical images is a prerequisite for building comprehensive semantic archives that can be used to enhance evidence-based diagnosis, physician education, and biomedical research. Annotation also has important applications in the automatic generation of structured radiology reports. Much of the prior research work has focused on annotating images with properties such as the modality of the image, or the biological system or body region being imaged. However, many challenges remain for the annotation of high-level semantic content in medical images (e.g., presence of calcification, vessel obstruction, etc.) due to the difficulty in discovering relationships and associations between low-level image features and high-level semantic concepts. This difficulty is further compounded by the lack of labelled training data. In this paper, we present a method for the automatic semantic annotation of medical images that leverages techniques from content-based image retrieval (CBIR). CBIR is a well-established image search technology that uses quantifiable low-level image features to represent the high-level semantic content depicted in those images. Our method extends CBIR techniques to identify or retrieve a collection of labelled images that have similar low-level features and then uses this collection to determine the best high-level semantic annotations. We demonstrate our annotation method using retrieval via weighted nearest-neighbour retrieval and multi-class classification to show that our approach is viable regardless of the underlying retrieval strategy. We experimentally compared our method with several well-established baseline techniques (classification and regression) and showed that our method achieved the highest accuracy in the annotation of liver computed tomography (CT) images.

Keywords: Computed tomography; Content-based image retrieval; Image annotation; ImageCLEF; Liver.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Data Mining / methods*
  • Database Management Systems
  • Documentation / methods*
  • Humans
  • Imaging, Three-Dimensional / methods
  • Liver / diagnostic imaging*
  • Machine Learning
  • Natural Language Processing*
  • Pattern Recognition, Automated / methods
  • Radiographic Image Enhancement / methods
  • Radiographic Image Interpretation, Computer-Assisted / methods
  • Radiology Information Systems / organization & administration*
  • Reproducibility of Results
  • Semantics
  • Sensitivity and Specificity
  • Terminology as Topic
  • Tomography, X-Ray Computed / methods*