Categorizing metadata to help mobilize computable biomedical knowledge

Brian S Alper; Allen Flynn; Bruce E Bray; Marisa L Conte; Christina Eldredge; Sigfried Gold; Robert A Greenes; Peter Haug; Kim Jacoby; Gunes Koru; James McClay; Marc L Sainvil; Davide Sottara; Mark Tuttle; Shyam Visweswaran; Robin Ann Yurk

doi:10.1002/lrh2.10271

Categorizing metadata to help mobilize computable biomedical knowledge

Learn Health Syst. 2021 May 9;6(1):e10271. doi: 10.1002/lrh2.10271. eCollection 2022 Jan.

Authors

Brian S Alper¹, Allen Flynn², Bruce E Bray³, Marisa L Conte⁴, Christina Eldredge⁵, Sigfried Gold⁶, Robert A Greenes⁷, Peter Haug⁸, Kim Jacoby⁹, Gunes Koru¹⁰, James McClay¹¹, Marc L Sainvil¹², Davide Sottara¹², Mark Tuttle¹³, Shyam Visweswaran¹⁴, Robin Ann Yurk¹⁵

Affiliations

¹ Computable Publishing LLC Ipswich Massachusetts USA.
² Medical School University of Michigan Ann Arbor Michigan USA.
³ Biomedical Informatics and Cardiovascular Medicine School of Medicine, University of Utah Salt Lake City Utah USA.
⁴ Taubman Health Sciences Library, University of Michigan Ann Arbor Michigan USA.
⁵ School of Information University of South Florida Tampa Florida USA.
⁶ College of Information Studies University of Maryland College Park Maryland USA.
⁷ Arizona State University and Mayo Clinic. Scottsdale Arizona USA.
⁸ Intermountain Healthcare University of Utah Salt Lake City Utah USA.
⁹ Komodo Health San Francisco California USA.
¹⁰ Department of Information Systems University of Maryland Baltimore Maryland USA.
¹¹ Emergency Medicine University of Nebraska Medical Center Omaha Nebraska USA.
¹² Mayo Clinic Scottsdale Arizona USA.
¹³ Apelon Hartford Connecticut USA.
¹⁴ Department of Biomedical Informatics University of Pittsburgh Pittsburgh Pennsylvania USA.
¹⁵ MDyurk West Bloomfield Michigan USA.

Abstract

Introduction: Computable biomedical knowledge artifacts (CBKs) are digital objects conveying biomedical knowledge in machine-interpretable structures. As more CBKs are produced and their complexity increases, the value obtained from sharing CBKs grows. Mobilizing CBKs and sharing them widely can only be achieved if the CBKs are findable, accessible, interoperable, reusable, and trustable (FAIR+T). To help mobilize CBKs, we describe our efforts to outline metadata categories to make CBKs FAIR+T.

Methods: We examined the literature regarding metadata with the potential to make digital artifacts FAIR+T. We also examined metadata available online today for actual CBKs of 12 different types. With iterative refinement, we came to a consensus on key categories of metadata that, when taken together, can make CBKs FAIR+T. We use subject-predicate-object triples to more clearly differentiate metadata categories.

Results: We defined 13 categories of CBK metadata most relevant to making CBKs FAIR+T. Eleven of these categories (type, domain, purpose, identification, location, CBK-to-CBK relationships, technical, authorization and rights management, provenance, evidential basis, and evidence from use metadata) are evident today where CBKs are stored online. Two additional categories (preservation and integrity metadata) were not evident in our examples. We provide a research agenda to guide further study and development of these and other metadata categories.

Conclusion: A wide variety of metadata elements in various categories is needed to make CBKs FAIR+T. More work is needed to develop a common framework for CBK metadata that can make CBKs FAIR+T for all stakeholders.

Keywords: FAIR principles; computable biomedical knowledge; digital objects; metadata; trust.