Next-generation deep learning based on simulators and synthetic data

Trends Cogn Sci. 2022 Feb;26(2):174-187. doi: 10.1016/j.tics.2021.11.008. Epub 2021 Dec 23.

Abstract

Deep learning (DL) is being successfully applied across multiple domains, yet these models learn in a most artificial way: they require large quantities of labeled data to grasp even simple concepts. Thus, the main bottleneck is often access to supervised data. Here, we highlight a trend in a potential solution to this challenge: synthetic data. Synthetic data are becoming accessible due to progress in rendering pipelines, generative adversarial models, and fusion models. Moreover, advancements in domain adaptation techniques help close the statistical gap between synthetic and real data. Paradoxically, this artificial solution is also likely to enable more natural learning, as seen in biological systems, including continual, multimodal, and embodied learning. Complementary to this, simulators and deep neural networks (DNNs) will also have a critical role in providing insight into the cognitive and neural functioning of biological systems. We also review the strengths of, and opportunities and novel challenges associated with, synthetic data.

Keywords: deep neural networks; domain adaptation; generative adversarial networks; graphics-rendering pipelines; next-generation learning; synthetic data.

Publication types

  • Research Support, Non-U.S. Gov't
  • Review

MeSH terms

  • Deep Learning*
  • Humans
  • Neural Networks, Computer