Genome/transcriptome collection of plethora of economically important, previously unexplored organisms from India and abroad

Data Brief. 2019 Jun 5:25:104099. doi: 10.1016/j.dib.2019.104099. eCollection 2019 Aug.

Abstract

Genome and transcriptome sequencing data are extremely useful resources for researchers in carrying out biological experiments that involves cloning and characterizing genes. We are presenting here genome sequence data from different clades of life including photosynthetic prokaryotes; oomycetes pathogens; probiotic bacteria; endophytic yeasts and filamentous fungus and pathogenic protozoa Leishmania donovani. In addition, we are also presenting paired control and treated stress response transcriptomes of Cyanobacteria growing in extreme conditions. The Cyanobacterial species that are included in this dataset were isolated from extreme conditions including desiccated monuments, hot springs and saline archipelagos. The probiotic Lactobacillus paracasei was isolated from Indian sub-continent. The Kala azar causing protozoan Leishmania donovani, whose early infectious stage is also included in this dataset. The endophyte Arthrinium malaysianum was isolated as a contaminant has significant bio-remediation property. Our collaborators have isolated endophyte Rhodotorula mucilaginosa JGTA1 from Jaduguda mines, West Bengal, India infested with Uranium. Our collaborators have isolated a heterozygous diploid oomycetes pathogen, Phytophthora ramorum causing sudden oak death in CA, USA coast is also part of the data. These dataset presents a unique heterogeneous collection from various sources that are analyzed using "Genome Annotator Light (GAL): A Docker-based package for genome analysis and visualization" (Panda et al., 2019) and are presented in a web site automatically created by GAL at http://www.eumicrobedb.org/cglab.

Keywords: Annotation; Genome; Transcriptome.