Labeled temperate hardwood tree stomatal image datasets from seven taxa of Populus and 17 hardwood species

Sci Data. 2024 Jan 2;11(1):1. doi: 10.1038/s41597-023-02657-3.

Abstract

Machine learning (ML) algorithms have shown potential in automatically detecting and measuring stomata. However, ML algorithms require substantial data to efficiently train and optimize models, but their potential is restricted by the limited availability and quality of stomatal images. To overcome this obstacle, we have compiled a collection of around 11,000 unique images of temperate broadleaf angiosperm tree leaf stomata from various projects conducted between 2015 and 2022. The dataset includes over 7,000 images of 17 commonly encountered hardwood species, such as oak, maple, ash, elm, and hickory, and over 3,000 images of 55 genotypes from seven Populus taxa. Inner_guard_cell_walls and whole_stomata (stomatal aperture and guard cells) were labeled and had a corresponding YOLO label file that can be converted into other annotation formats. With the use of our dataset, users can (1) employ state-of-the-art machine learning models to identify, count, and quantify leaf stomata; (2) explore the diverse range of stomatal characteristics across different types of hardwood trees; and (3) develop new indices for measuring stomata.

Publication types

  • Dataset

MeSH terms

  • Genotype
  • Plant Leaves
  • Plant Stomata* / genetics
  • Populus* / genetics
  • Trees