High-throughput cryo-ET structural pattern mining by unsupervised deep iterative subtomogram clustering

Proc Natl Acad Sci U S A. 2023 Apr 11;120(15):e2213149120. doi: 10.1073/pnas.2213149120. Epub 2023 Apr 7.

Abstract

Cryoelectron tomography directly visualizes heterogeneous macromolecular structures in their native and complex cellular environments. However, existing computer-assisted structure sorting approaches are low throughput or inherently limited due to their dependency on available templates and manual labels. Here, we introduce a high-throughput template-and-label-free deep learning approach, Deep Iterative Subtomogram Clustering Approach (DISCA), that automatically detects subsets of homogeneous structures by learning and modeling 3D structural features and their distributions. Evaluation on five experimental cryo-ET datasets shows that an unsupervised deep learning based method can detect diverse structures with a wide range of molecular sizes. This unsupervised detection paves the way for systematic unbiased recognition of macromolecular complexes in situ.

Keywords: cryoelectron tomography; image clustering; macromolecular complexes; unsupervised learning.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Cluster Analysis
  • Cryoelectron Microscopy / methods
  • Electron Microscope Tomography* / methods
  • Image Processing, Computer-Assisted* / methods
  • Macromolecular Substances / chemistry
  • Molecular Structure

Substances

  • Macromolecular Substances