Detection and annotation of plant organs from digitised herbarium scans using deep learning

Biodivers Data J. 2020 Dec 10:8:e57090. doi: 10.3897/BDJ.8.e57090. eCollection 2020.

Abstract

As herbarium specimens are increasingly becoming digitised and accessible in online repositories, advanced computer vision techniques are being used to extract information from them. The presence of certain plant organs on herbarium sheets is useful information in various scientific contexts and automatic recognition of these organs will help mobilise such information. In our study, we use deep learning to detect plant organs on digitised herbarium specimens with Faster R-CNN. For our experiment, we manually annotated hundreds of herbarium scans with thousands of bounding boxes for six types of plant organs and used them for training and evaluating the plant organ detection model. The model worked particularly well on leaves and stems, while flowers were also present in large numbers in the sheets, but were not equally well recognised.

Keywords: convolutional neural networks; deep learning; digitisation; herbarium specimens; image annotation; object detection and localisation; plant organ detection.