Envisaging a global infrastructure to exploit the potential of digitised collections

Biodivers Data J. 2023 Nov 30:11:e109439. doi: 10.3897/BDJ.11.e109439. eCollection 2023.

Abstract

Tens of millions of images from biological collections have become available online over the last two decades. In parallel, there has been a dramatic increase in the capabilities of image analysis technologies, especially those involving machine learning and computer vision. While image analysis has become mainstream in consumer applications, it is still used only on an artisanal basis in the biological collections community, largely because the image corpora are dispersed. Yet, there is massive untapped potential for novel applications and research if images of collection objects could be made accessible in a single corpus. In this paper, we make the case for infrastructure that could support image analysis of collection objects. We show that such infrastructure is entirely feasible and well worth investing in.

Keywords: biodiversity; computer vision; functional traits; machine learning; species identification; specimens.