Metagenomic data of bacterial community from different land uses at the river basin, Kelantan

Data Brief. 2020 Sep 28:33:106351. doi: 10.1016/j.dib.2020.106351. eCollection 2020 Dec.

Abstract

The data provided in the article includes the sequence of bacterial 16S rRNA gene from a high conservation value forest, logged forest, rubber plantation and oil palm plantation collected at Kelantan river basin. The logged forest area was previously notified as a flooding region. The total gDNA of bacterial community was amplified via polymerase chain reaction at V3-V4 regions using a pair of specific universal primer. Amplicons were sequenced on Illumina HiSeq paired-end platform to generate 250 bp paired-end raw reads. Several bioinformatics tools such as FLASH, QIIME and UPARSE were used to process the reads generated for OTU analysis. Meanwhile, R&D software was used to construct the taxonomy tree for all samples. Raw data files are available at the Sequence Read Archive (SRA), NCBI and data information can be found at the BioProject and BioSample, NCBI. The data shows the comparison of bacterial community between the natural forest and different land uses.

Keywords: Clustering analysis; Kelantan river basin; Land-uses; Metagenomics; Taxonomy tree.