An integration of fast alignment and maximum-likelihood methods for electron subtomogram averaging and classification

Bioinformatics. 2018 Jul 1;34(13):i227-i236. doi: 10.1093/bioinformatics/bty267.

Abstract

Motivation: Cellular Electron CryoTomography (CECT) is an emerging 3D imaging technique that visualizes subcellular organization of single cells at sub-molecular resolution and in near-native state. CECT captures large numbers of macromolecular complexes of highly diverse structures and abundances. However, the structural complexity and imaging limits complicate the systematic de novo structural recovery and recognition of these macromolecular complexes. Efficient and accurate reference-free subtomogram averaging and classification represent the most critical tasks for such analysis. Existing subtomogram alignment based methods are prone to the missing wedge effects and low signal-to-noise ratio (SNR). Moreover, existing maximum-likelihood based methods rely on integration operations, which are in principle computationally infeasible for accurate calculation.

Results: Built on existing works, we propose an integrated method, Fast Alignment Maximum Likelihood method (FAML), which uses fast subtomogram alignment to sample sub-optimal rigid transformations. The transformations are then used to approximate integrals for maximum-likelihood update of subtomogram averages through expectation-maximization algorithm. Our tests on simulated and experimental subtomograms showed that, compared to our previously developed fast alignment method (FA), FAML is significantly more robust to noise and missing wedge effects with moderate increases of computation cost. Besides, FAML performs well with significantly fewer input subtomograms when the FA method fails. Therefore, FAML can serve as a key component for improved construction of initial structural models from macromolecules captured by CECT.

Availability and implementation: http://www.cs.cmu.edu/mxu1.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Electron Microscope Tomography / methods*
  • Image Interpretation, Computer-Assisted / methods
  • Image Processing, Computer-Assisted / methods*
  • Imaging, Three-Dimensional / methods
  • Likelihood Functions
  • Macromolecular Substances / chemistry*
  • Macromolecular Substances / classification
  • Macromolecular Substances / metabolism
  • Models, Molecular*
  • Molecular Conformation
  • Proteasome Endopeptidase Complex / chemistry
  • Proteasome Endopeptidase Complex / metabolism
  • Ribosomes / chemistry
  • Ribosomes / metabolism
  • Signal-To-Noise Ratio
  • Software*

Substances

  • Macromolecular Substances
  • Proteasome Endopeptidase Complex