ISOdb: A Comprehensive Database of Full-Length Isoforms Generated by Iso-Seq

Shang-Qian Xie; Yue Han; Xiao-Zhou Chen; Tai-Yu Cao; Kai-Kai Ji; Jie Zhu; Peng Ling; Chuan-Le Xiao

doi:10.1155/2018/9207637

ISOdb: A Comprehensive Database of Full-Length Isoforms Generated by Iso-Seq

Int J Genomics. 2018 Nov 19:2018:9207637. doi: 10.1155/2018/9207637. eCollection 2018.

Authors

Shang-Qian Xie¹, Yue Han², Xiao-Zhou Chen³, Tai-Yu Cao¹, Kai-Kai Ji¹, Jie Zhu¹, Peng Ling¹, Chuan-Le Xiao²

Affiliations

¹ Research Center for Terrestrial Biodiversity of the South China Sea, Institute of Tropical Agriculture and Forestry, Hainan University, Haikou 570228, China.
² State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangzhou 510060, China.
³ School of Mathematics and Computer Science, Yunnan Minzu University, Kunming 650031, China.

Abstract

The accurate landscape of transcript isoforms plays an important role in the understanding of gene function and gene regulation. However, building complete transcripts is very challenging for short reads generated using next-generation sequencing. Fortunately, isoform sequencing (Iso-Seq) using single-molecule sequencing technologies, such as PacBio SMRT, provides long reads spanning entire transcript isoforms which do not require assembly. Therefore, we have developed ISOdb, a comprehensive resource database for hosting and carrying out an in-depth analysis of Iso-Seq datasets and visualising the full-length transcript isoforms. The current version of ISOdb has collected 93 publicly available Iso-Seq samples from eight species and presents the samples in two levels: (1) sample level, including metainformation, long read distribution, isoform numbers, and alternative splicing (AS) events of each sample; (2) gene level, including the total isoforms, novel isoform number, novel AS number, and isoform visualisation of each gene. In addition, ISOdb provides a user interface in the website for uploading sample information to facilitate the collection and analysis of researchers' datasets. Currently, ISOdb is the first repository that offers comprehensive resources and convenient public access for hosting, analysing, and visualising Iso-Seq data, which is freely available.