A tea bud segmentation, detection and picking point localization based on the MDY7-3PTB model

Fenyun Zhang; Hongwei Sun; Shuang Xie; Chunwang Dong; You Li; Yiting Xu; Zhengwei Zhang; Fengnong Chen

doi:10.3389/fpls.2023.1199473

A tea bud segmentation, detection and picking point localization based on the MDY7-3PTB model

Front Plant Sci. 2023 Sep 28:14:1199473. doi: 10.3389/fpls.2023.1199473. eCollection 2023.

Authors

Fenyun Zhang¹, Hongwei Sun¹, Shuang Xie¹, Chunwang Dong², You Li¹, Yiting Xu¹, Zhengwei Zhang¹, Fengnong Chen¹

Affiliations

¹ School of Automation, Hangzhou Dianzi University, Hangzhou, China.
² Tea Research Institute, Shandong Academy of Agricultural Sciences, Jinan, China.

Abstract

Introduction: The identification and localization of tea picking points is a prerequisite for achieving automatic picking of famous tea. However, due to the similarity in color between tea buds and young leaves and old leaves, it is difficult for the human eye to accurately identify them.

Methods: To address the problem of segmentation, detection, and localization of tea picking points in the complex environment of mechanical picking of famous tea, this paper proposes a new model called the MDY7-3PTB model, which combines the high-precision segmentation capability of DeepLabv3+ and the rapid detection capability of YOLOv7. This model achieves the process of segmentation first, followed by detection and finally localization of tea buds, resulting in accurate identification of the tea bud picking point. This model replaced the DeepLabv3+ feature extraction network with the more lightweight MobileNetV2 network to improve the model computation speed. In addition, multiple attention mechanisms (CBAM) were fused into the feature extraction and ASPP modules to further optimize model performance. Moreover, to address the problem of class imbalance in the dataset, the Focal Loss function was used to correct data imbalance and improve segmentation, detection, and positioning accuracy.

Results and discussion: The MDY7-3PTB model achieved a mean intersection over union (mIoU) of 86.61%, a mean pixel accuracy (mPA) of 93.01%, and a mean recall (mRecall) of 91.78% on the tea bud segmentation dataset, which performed better than usual segmentation models such as PSPNet, Unet, and DeeplabV3+. In terms of tea bud picking point recognition and positioning, the model achieved a mean average precision (mAP) of 93.52%, a weighted average of precision and recall (F1 score) of 93.17%, a precision of 97.27%, and a recall of 89.41%. This model showed significant improvements in all aspects compared to existing mainstream YOLO series detection models, with strong versatility and robustness. This method eliminates the influence of the background and directly detects the tea bud picking points with almost no missed detections, providing accurate two-dimensional coordinates for the tea bud picking points, with a positioning precision of 96.41%. This provides a strong theoretical basis for future tea bud picking.

Keywords: DeepLabv3+; YOLOv7; deep learning; focal loss; multi-attention mechanism; tea bud picking point.

Grants and funding

The Project was supported by Zhejiang Provincial Natural Science Foundation of China under Grant No. LQ21C130007, and the Open Fund of Key Laboratory of Transplanting Equipment and Technology of Zhejiang Province under Grant No. 2023E10013-05.