Evaluating YOLO architectures for detecting road killed endangered Brazilian animals

Gabriel Souto Ferrante; Luis Hideo Vasconcelos Nakamura; Sandra Sampaio; Geraldo Pereira Rocha Filho; Rodolfo Ipolito Meneguette

doi:10.1038/s41598-024-52054-y

Evaluating YOLO architectures for detecting road killed endangered Brazilian animals

Sci Rep. 2024 Jan 16;14(1):1353. doi: 10.1038/s41598-024-52054-y.

Authors

Gabriel Souto Ferrante¹, Luis Hideo Vasconcelos Nakamura², Sandra Sampaio³, Geraldo Pereira Rocha Filho⁴, Rodolfo Ipolito Meneguette⁵

Affiliations

¹ Institute of Science Mathematics and Computer Science, University of São Paulo, 400 Trabalhador São-carlense Avenue, São Carlos, São Paulo, 13566-590, Brazil. g.ferrante@usp.br.
² Departament of Informatics, Federal Institute of São Paulo - Campus Catanduva, 239 Pastor José Dutra de Moraes Avenue, Catanduva, São Paulo, 15808-305, Brazil.
³ Department of Computer Science, University of Manchester, Oxford Rd, Manchester, M13 9PL, UK.
⁴ Department of Exact and Technological Sciences, State University of Southwest Bahia, Estr. Bem Querer, Km-04, Vitória da Conquista, BA, 45083-900, Brazil.
⁵ Institute of Science Mathematics and Computer Science, University of São Paulo, 400 Trabalhador São-carlense Avenue, São Carlos, São Paulo, 13566-590, Brazil.

Abstract

Wildlife roadkill is a recurring, dangerous problem that affects both humans and animals and has received increasing attention from environmentalists worldwide. Addressing this problem is difficult due to the high investments required in road infrastructure to effectively reduce wildlife vehicle collisions. Despite recent applications of machine learning techniques in low-cost and economically viable detection systems, e.g., for alerting drivers about the presence of animals and collecting statistics on endangered animal species, the success and wide adoption of these systems depend heavily on the availability of data for system training. The lack of training data negatively impacts the feature extraction of machine learning models, which is crucial for successful animal detection and classification. In this paper, we evaluate the performance of several state-of-the-art object detection models on limited data for model training. The selected models are based on the YOLO architecture, which is well-suited for and commonly used in real-time object detection. These include the YoloV4, Scaled-YoloV4, YoloV5, YoloR, YoloX, and YoloV7 models. We focus on Brazilian endangered animal species and use the BRA-Dataset for model training. We also assess the effectiveness of data augmentation and transfer learning techniques in our evaluation. The models are compared using summary metrics such as precision, recall, mAP, and FPS and are qualitatively analyzed considering classic computer vision problems. The results show that the architecture with the best results against false negatives is Scaled-YoloV4, while the best FPS detection score is the nano version of YoloV5.

MeSH terms

Animals
Animals, Wild*
Benchmarking*
Brazil
Compulsive Behavior
Endangered Species
Humans

Abstract

MeSH terms

Grants and funding