Identification of putative essential protein domains from high-density transposon insertion sequencing

A S M Zisanur Rahman; Lukas Timmerman; Flyn Gallardo; Silvia T Cardona

doi:10.1038/s41598-022-05028-x

Identification of putative essential protein domains from high-density transposon insertion sequencing

Sci Rep. 2022 Jan 19;12(1):962. doi: 10.1038/s41598-022-05028-x.

Authors

A S M Zisanur Rahman¹, Lukas Timmerman², Flyn Gallardo¹, Silvia T Cardona^{3

4}

Affiliations

¹ Department of Microbiology, University of Manitoba, Winnipeg, MB, Canada.
² Department of Computer Science, University of Manitoba, Winnipeg, MB, Canada.
³ Department of Microbiology, University of Manitoba, Winnipeg, MB, Canada. silvia.cardona@umanitoba.ca.
⁴ Department of Medical Microbiology and Infectious Diseases, University of Manitoba, Winnipeg, Canada. silvia.cardona@umanitoba.ca.

Abstract

A first clue to gene function can be obtained by examining whether a gene is required for life in certain standard conditions, that is, whether a gene is essential. In bacteria, essential genes are usually identified by high-density transposon mutagenesis followed by sequencing of insertion sites (Tn-seq). These studies assign the term "essential" to whole genes rather than the protein domain sequences that encode the essential functions. However, genes can code for multiple protein domains that evolve their functions independently. Therefore, when essential genes code for more than one protein domain, only one of them could be essential. In this study, we defined this subset of genes as "essential domain-containing" (EDC) genes. Using a Tn-seq data set built-in Burkholderia cenocepacia K56-2, we developed an in silico pipeline to identify EDC genes and the essential protein domains they encode. We found forty candidate EDC genes and demonstrated growth defect phenotypes using CRISPR interference (CRISPRi). This analysis included two knockdowns of genes encoding the protein domains of unknown function DUF2213 and DUF4148. These putative essential domains are conserved in more than two hundred bacterial species, including human and plant pathogens. Together, our study suggests that essentiality should be assigned to individual protein domains rather than genes, contributing to a first functional characterization of protein domains of unknown function.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Bacterial Proteins / genetics*
Burkholderia cenocepacia / genetics*
Burkholderia cenocepacia / growth & development
Escherichia coli
Genes, Essential*
Protein Domains*

Substances

Bacterial Proteins

Grants and funding

5211/CIHR/Canada