Leveraging cross-view geo-localization with ensemble learning and temporal awareness

Abdulrahman Ghanem; Ahmed Abdelhay; Noor Eldeen Salah; Ahmed Nour Eldeen; Mohammed Elhenawy; Mahmoud Masoud; Ammar M Hassan; Abdallah A Hassan

doi:10.1371/journal.pone.0283672

Leveraging cross-view geo-localization with ensemble learning and temporal awareness

PLoS One. 2023 Mar 30;18(3):e0283672. doi: 10.1371/journal.pone.0283672. eCollection 2023.

Authors

Abdulrahman Ghanem¹, Ahmed Abdelhay¹, Noor Eldeen Salah¹, Ahmed Nour Eldeen¹, Mohammed Elhenawy², Mahmoud Masoud³, Ammar M Hassan⁴, Abdallah A Hassan¹

Affiliations

¹ Computer and Systems Engineering Department, Faculty of Engineering, Minia University, Minia, Egypt.
² Centre for Accident Research and Road Safety-Queensland (CARRS-Q), Queensland University of Technology, Brisbane, Australia.
³ Department of Information Systems & Operations Management, and Interdisciplinary Research Center for Smart Mobility and Logistics, King Fahd University of Petroleum and Minerals, Dhahran, Saudi Arabia.
⁴ Arab Academy for Science, Technology, and Maritime Transport, South Valley Branch, Aswan, Egypt.

Abstract

The Global Navigation Satellite System (GNSS) is unreliable in some situations. To mend the poor GNSS signal, an autonomous vehicle can self-localize by matching a ground image against a database of geotagged aerial images. However, this approach has challenges because of the dramatic differences in the viewpoint between aerial and ground views, harsh weather and lighting conditions, and the lack of orientation information in training and deployment environments. In this paper, it is shown that previous models in this area are complementary, not competitive, and that each model solves a different aspect of the problem. There was a need for a holistic approach. An ensemble model is proposed to aggregate the predictions of multiple independently trained state-of-the-art models. Previous state-of-the-art (SOTA) temporal-aware models used heavy-weight network to fuse the temporal information into the query process. The effect of making the query process temporal-aware is explored and exploited by an efficient meta block: naive history. But none of the existing benchmark datasets was suitable for extensive temporal awareness experiments, a new derivative dataset based on the BDD100K dataset is generated. The proposed ensemble model achieves a recall accuracy R@1 (Recall@1: the top most prediction) of 97.74% on the CVUSA dataset and 91.43% on the CVACT dataset (surpassing the current SOTA). The temporal awareness algorithm converges to R@1 of 100% by looking at a few steps back in the trip history.

Copyright: © 2023 Ghanem et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

MeSH terms

Algorithms*
Autonomous Vehicles
Benchmarking
Learning*
Machine Learning

Grants and funding

The author(s) received no specific funding for this work.