Transfer language space with similar domain adaptation: a case study with hepatocellular carcinoma

Amara Tariq; Omar Kallas; Patricia Balthazar; Scott Jeffery Lee; Terry Desser; Daniel Rubin; Judy Wawira Gichoya; Imon Banerjee

doi:10.1186/s13326-022-00262-8

Transfer language space with similar domain adaptation: a case study with hepatocellular carcinoma

J Biomed Semantics. 2022 Feb 23;13(1):8. doi: 10.1186/s13326-022-00262-8.

Authors

Amara Tariq¹, Omar Kallas², Patricia Balthazar², Scott Jeffery Lee², Terry Desser³, Daniel Rubin^{3

4}, Judy Wawira Gichoya^{5

2}, Imon Banerjee^{5

2}

Affiliations

¹ Machine Intelligence in Medicine and Imaging (MI ∙2) Lab, Mayo Clinic, Phoenix, AZ, USA. tariq.amara2@mayo.edu.
² Department of Radiology, Emory University, Atlanta, GA, USA.
³ Department of Radiology, Stanford University, Palo Alto, CA, USA.
⁴ Department of Biomedical Data Science, Stanford University, Palo Alto, CA, USA.
⁵ Machine Intelligence in Medicine and Imaging (MI ∙2) Lab, Mayo Clinic, Phoenix, AZ, USA.

Abstract

Background: Transfer learning is a common practice in image classification with deep learning where the available data is often limited for training a complex model with millions of parameters. However, transferring language models requires special attention since cross-domain vocabularies (e.g. between two different modalities MR and US) do not always overlap as the pixel intensity range overlaps mostly for images.

Method: We present a concept of similar domain adaptation where we transfer inter-institutional language models (context-dependent and context-independent) between two different modalities (ultrasound and MRI) to capture liver abnormalities.

Results: We use MR and US screening exam reports for hepatocellular carcinoma as the use-case and apply the transfer language space strategy to automatically label imaging exams with and without structured template with > 0.9 average f1-score.

Conclusion: We conclude that transfer learning along with fine-tuning the discriminative model is often more effective for performing shared targeted tasks than the training for a language space from scratch.

Keywords: BERT; Language model; Radiology report; Transfer learning; Word2vec.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Carcinoma, Hepatocellular* / diagnostic imaging
Humans
Language
Liver Neoplasms* / diagnostic imaging
Natural Language Processing