Synthetic heparan sulfate standards and machine learning facilitate the development of solid-state nanopore analysis

Proc Natl Acad Sci U S A. 2021 Mar 16;118(11):e2022806118. doi: 10.1073/pnas.2022806118.

Abstract

The application of solid-state (SS) nanopore devices to single-molecule nucleic acid sequencing has been challenging. Thus, the early successes in applying SS nanopore devices to the more difficult class of biopolymer, glycosaminoglycans (GAGs), have been surprising, motivating us to examine the potential use of an SS nanopore to analyze synthetic heparan sulfate GAG chains of controlled composition and sequence prepared through a promising, recently developed chemoenzymatic route. A minimal representation of the nanopore data, using only signal magnitude and duration, revealed, by eye and image recognition algorithms, clear differences between the signals generated by four synthetic GAGs. By subsequent machine learning, it was possible to determine disaccharide and even monosaccharide composition of these four synthetic GAGs using as few as 500 events, corresponding to a zeptomole of sample. These data suggest that ultrasensitive GAG analysis may be possible using SS nanopore detection and well-characterized molecular training sets.

Keywords: glycosaminoglycan; polysaccharide; sequencing; single-molecule analysis; solid-state nanopore.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Carbohydrate Sequence
  • Disaccharides / chemistry
  • Glycomics / methods
  • Glycomics / standards
  • Heparitin Sulfate / chemical synthesis
  • Heparitin Sulfate / chemistry*
  • Machine Learning*
  • Monosaccharides / chemistry
  • Nanopores*

Substances

  • Disaccharides
  • Monosaccharides
  • Heparitin Sulfate