Survey on natural language processing in medical image analysis

Zhong Nan Da Xue Xue Bao Yi Xue Ban. 2022 Aug 28;47(8):981-993. doi: 10.11817/j.issn.1672-7347.2022.220376.
[Article in English, Chinese]

Abstract

Recent advancement in natural language processing (NLP) and medical imaging empowers the wide applicability of deep learning models. These developments have increased not only data understanding, but also knowledge of state-of-the-art architectures and their real-world potentials. Medical imaging researchers have recognized the limitations of only targeting images, as well as the importance of integrating multimodal inputs into medical image analysis. The lack of comprehensive surveys of the current literature, however, impedes the progress of this domain. Existing research perspectives, as well as the architectures, tasks, datasets, and performance measures examined in the present literature, are reviewed in this work, and we also provide a brief description of possible future directions in the field, aiming to provide researchers and healthcare professionals with a detailed summary of existing academic research and to provide rational insights to facilitate future research.

自然语言处理和医学影像的发展使深度学习模型能够在各种领域和各种数据模态中表现出出色的通用性。这些进步不仅加深了对数据的理解,而且促进了学界对最先进架构及其前景的认识。医学影像研究人员已经认识到仅针对图像研究的不足之处,以及对多模态输入进行综合分析的重要意义。但是,目前相关综述论文的缺乏不利于这个研究方向的发展。本篇综述介绍了自然语言处理和医学图像结合这一领域的背景,并分5个主题回顾现有文献的研究目标、模型架构、目标任务、实验数据和性能指标,还对该领域未来可能的发展方向进行简要描述,旨在为研究人员和医护人员提供现有学术研究的详细总结,提出理性的见解,进而促进未来的研究。.

Keywords: deep learning; medical imaging; multimodal input; natural language processing.

MeSH terms

  • Humans
  • Natural Language Processing*
  • Surveys and Questionnaires