What Does DALL-E 2 Know About Radiology?

J Med Internet Res. 2023 Mar 16:25:e43110. doi: 10.2196/43110.

Abstract

Generative models, such as DALL-E 2 (OpenAI), could represent promising future tools for image generation, augmentation, and manipulation for artificial intelligence research in radiology, provided that these models have sufficient medical domain knowledge. Herein, we show that DALL-E 2 has learned relevant representations of x-ray images, with promising capabilities in terms of zero-shot text-to-image generation of new images, the continuation of an image beyond its original boundaries, and the removal of elements; however, its capabilities for the generation of images with pathological abnormalities (eg, tumors, fractures, and inflammation) or computed tomography, magnetic resonance imaging, or ultrasound images are still limited. The use of generative models for augmenting and generating radiological data thus seems feasible, even if the further fine-tuning and adaptation of these models to their respective domains are required first.

Keywords: DALL-E; artificial intelligence; creating images from text; diagnostic imaging; generative model; image creation; image generation; machine learning; medical imaging; radiology; text-to-image; transformer language model; x-ray.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Artificial Intelligence*
  • Humans
  • Magnetic Resonance Imaging / methods
  • Radiology*
  • Tomography, X-Ray Computed / methods
  • Ultrasonography