AlphaFold protein structure database (AlphaFold DB) archives a vast number of predicted models. We conducted systematic data mining against AlphaFold DB and discovered an uncharacterized P-loop NTPase family. The structure of the protein family was surprisingly novel, showing an atypical topology for P-loop NTPases, noticeable twofold symmetry, and two pairs of independent putative active sites. Our findings show that structural data mining is a powerful approach to identifying undiscovered protein families.
Keywords: NTPase; dark proteome; protein discovery; structure mining.
© 2024 The Authors. Protein Science published by Wiley Periodicals LLC on behalf of The Protein Society.