Deep learning exoplanets detection by combining real and synthetic data

PLoS One. 2022 May 25;17(5):e0268199. doi: 10.1371/journal.pone.0268199. eCollection 2022.

Abstract

Scientists and astronomers have attached great importance to the task of discovering new exoplanets, even more so if they are in the habitable zone. To date, more than 4300 exoplanets have been confirmed by NASA, using various discovery techniques, including planetary transits, in addition to the use of various databases provided by space and ground-based telescopes. This article proposes the development of a deep learning system for detecting planetary transits in Kepler Telescope light curves. The approach is based on related work from the literature and enhanced to validation with real light curves. A CNN classification model is trained from a mixture of real and synthetic data. The model is then validated only with unknown real data. The best ratio of synthetic data is determined by the performance of an optimisation technique and a sensitivity analysis. The precision, accuracy and true positive rate of the best model obtained are determined and compared with other similar works. The results demonstrate that the use of synthetic data on the training stage can improve the transit detection performance on real light curves.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Deep Learning*
  • Exobiology / methods
  • Extraterrestrial Environment
  • Planets
  • Telescopes*

Grants and funding

This research was supported in part by the Chilean National Agency for Research and Development (ANID) under Projects FONDECYT 1191188 and 1190486, and PhD Scholarship 21221393. The National University of Distance Education under Project 2021V/-TAJOV/00 and Ministry of Science and Innovation of Spain under Project PID2019-108377RB-C32.