In Silico Characterization of Uncharacterized Proteins From Multiple Strains of Clostridium Difficile

Front Genet. 2022 Aug 11:13:878012. doi: 10.3389/fgene.2022.878012. eCollection 2022.

Abstract

Clostridium difficile (C. difficile) is a multi-strain, spore-forming, Gram-positive, opportunistic enteropathogen bacteria, majorly associated with nosocomial infections, resulting in severe diarrhoea and colon inflammation. Several antibiotics including penicillin, tetracycline, and clindamycin have been employed to control C. difficile infection, but studies have suggested that injudicious use of antibiotics has led to the development of resistance in C. difficile strains. However, many proteins from its genome are still considered uncharacterized proteins that might serve crucial functions and assist in the biological understanding of the organism. In this study, we aimed to annotate and characterise the 6 C. difficile strains using in silico approaches. We first analysed the complete genome of 6 C. difficile strains using standardised approaches and analysed hypothetical proteins (HPs) employing various bioinformatics approaches coalescing, including identifying contigs, coding sequences, phage sequences, CRISPR-Cas9 systems, antimicrobial resistance determination, membrane helices, instability index, secretory nature, conserved domain, and vaccine target properties like comparative homology analysis, allergenicity, antigenicity determination along with structure prediction and binding-site analysis. This study provides crucial supporting information about the functional characterization of the HPs involved in the pathophysiology of the disease. Moreover, this information also aims to assist in mechanisms associated with bacterial pathogenesis and further design candidate inhibitors and bona fide pharmaceutical targets.

Keywords: annotation; clostridium difficile; essential genes; function abbreviations C. difficile-clostridium difficile CDI-C. difficile infection; uncharacterized proteins.