DomainFit: Identification of Protein Domains in cryo-EM maps at Intermediate Resolution using AlphaFold2-predicted Models

bioRxiv [Preprint]. 2023 Nov 28:2023.11.28.569001. doi: 10.1101/2023.11.28.569001.

Abstract

Cryo-electron microscopy (cryo-EM) has revolutionized our understanding of macromolecular complexes, enabling high-resolution structure determination. With the paradigm shift to in situ structural biology recently driven by the ground-breaking development of cryo-focused ion beam milling and cryo-electron tomography, there are an increasing number of structures at sub-nanometer resolution of complexes solved directly within their cellular environment. These cellular complexes often contain unidentified proteins, related to different cellular states or processes. Identifying proteins at resolutions lower than 4 Å remains challenging because the side chains cannot be visualized reliably. Here, we present DomainFit, a program for automated domain-level protein identification from cryo-EM maps at resolutions lower than 4 Å. By fitting domains from artificial intelligence-predicted models such as AlphaFold2-predicted models into cryo-EM maps, the program performs statistical analyses and attempts to identify the proteins forming the density. Using DomainFit, we identified two microtubule inner proteins, one of them, a CCDC81 domain-containing protein, is exclusively localized in the proximal region of the doublet microtubule from the ciliate Tetrahymena thermophila. The flexibility and capability of DomainFit makes it a valuable tool for analyzing in situ structures.

Keywords: AlphaFold2 modelling; cilia; cryo-EM; cryo-ET; doublet; microtubule inner protein; protein identification; subtomogram averaging.

Publication types

  • Preprint