Reconstruction of Acetogenesis Pathway Using Short-Read Sequencing of Clostridium aceticum Genome

J Nanosci Nanotechnol. 2015 May;15(5):3852-61. doi: 10.1166/jnn.2015.9537.

Abstract

Clostridium aceticum is an anaerobic homoacetogen, able to reduce CO2 to multi-carbon products using the reductive acetyl-CoA pathway. This unique ability to use CO2 or CO makes the microbe a potential platform for the biotech industry. However, the development of genetically engineered homoacetogen for the large-scale production of commodity chemicals is hampered by the limited amount of their genetic and metabolic information. Here we exploited next-generation sequencing to reveal C. aceticum genome. The short-read sequencing produced 44,871,196 high quality reads with an average length of 248 bases. Following sequence trimming step, 30,256,976 reads were assembled into 12,563 contigs with 168-fold coverage and 1,971 bases in length using de Bruijn graph algorithm. Since the k-mer hash length in the algorithm is an important factor for the quality of output contigs, a window of k-mers (k-51 to k-201) was tested to obtain high quality contigs. In addition to the assembly metrics, the functional annotation of the contigs was investigated to select the k-mer optimum. Metabolic pathway mapping using the functional annotation identified the majority of central metabolic pathways, such as the glycolysis and TCA cycle. Further, these analyses elucidated the enzymes consisting of Wood-Ljungdahl pathway, in which CO2 is fixed into acetyl-CoA. Thus, the metabolic reconstruction based on the draft genome assembly provides a foundation for the functional genomics required to engineer C. aceticum.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Bacterial Proteins / genetics
  • Bacterial Proteins / metabolism
  • Clostridium / genetics*
  • Clostridium / metabolism
  • DNA, Bacterial / analysis
  • DNA, Bacterial / genetics
  • Genome, Bacterial / genetics*
  • High-Throughput Nucleotide Sequencing / methods*
  • Metabolic Networks and Pathways / genetics*
  • Molecular Sequence Data
  • Sequence Alignment
  • Sequence Analysis, DNA / methods*

Substances

  • Bacterial Proteins
  • DNA, Bacterial