Semantic Annotation of Experimental Methods in Analytical Chemistry

Anal Chem. 2022 Nov 8;94(44):15464-15471. doi: 10.1021/acs.analchem.2c03565. Epub 2022 Oct 25.

Abstract

A major obstacle for reusing and integrating existing data is finding the data that is most relevant in a given context. The primary metadata resource is the scientific literature describing the experiments that produced the data. To stimulate the development of natural language processing methods for extracting this information from articles, we have manually annotated 100 recent open access publications in Analytical Chemistry as semantic graphs. We focused on articles mentioning mass spectrometry in their experimental sections, as we are particularly interested in the topic, which is also within the domain of several ontologies and controlled vocabularies. The resulting gold standard dataset is publicly available and directly applicable to validating automated methods for retrieving this metadata from the literature. In the process, we also made a number of observations on the structure and description of experiments and open access publication in this journal.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Chemistry, Analytic
  • Natural Language Processing*
  • Research Design
  • Semantics*