Using an ontology of the human cardiovascular system to improve the classification of histological images

Sci Rep. 2020 Jul 23;10(1):12276. doi: 10.1038/s41598-020-69037-4.

Abstract

The advantages of automatically recognition of fundamental tissues using computer vision techniques are well known, but one of its main limitations is that sometimes it is not possible to classify correctly an image because the visual information is insufficient or the descriptors extracted are not discriminative enough. An Ontology could solve in part this problem, because it gathers and makes possible to use the specific knowledge that allows detecting clear mistakes in the classification, occasionally simply by pointing out impossible configurations in that domain. One of the main contributions of this work is that we used a Histological Ontology to correct, and therefore improve the classification of histological images. First, we described small regions of images, denoted as blocks, using Local Binary Pattern (LBP) based descriptors and we determined which tissue of the cardiovascular system was present using a cascade Support Vector Machine (SVM). Later, we built Resource Description Framework (RDF) triples for the occurrences of each discriminant class. Based on that, we used a Histological Ontology to correct, among others, "not possible" situations, improving in this way the global accuracy in the blocks first and in tissues classification later. For the experimental validation, we used a set of 6000 blocks of [Formula: see text] pixels, obtaining F-Scores between 0.769 and 0.886. Thus, there is an improvement between 0.003 and [Formula: see text] when compared to the approach without the histological ontology. The methodology improves the automatic classification of histological images using a histological ontology. Another advantage of our proposal is that using the Ontology, we were capable of recognising the epithelial tissue, previously not detected by any of the computer vision methods used, including a CNN proposal called HistoResNet evaluated in the experiments. Finally, we also have created and made publicly available a dataset consisting of 6000 blocks of labelled histological tissues.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Animals
  • Cardiovascular Physiological Phenomena*
  • Cardiovascular System / cytology*
  • Cardiovascular System / pathology
  • Computational Biology* / methods
  • Gene Ontology*
  • Histocytochemistry* / methods
  • Humans
  • Image Processing, Computer-Assisted
  • Support Vector Machine