Transcriptome Mining to Identify Genes of Interest: From Local Databases to Phylogenetic Inference

Daniele De Luca; Chiara Lauritano

doi:10.1007/978-1-0716-2313-8_3

Transcriptome Mining to Identify Genes of Interest: From Local Databases to Phylogenetic Inference

Methods Mol Biol. 2022:2498:43-51. doi: 10.1007/978-1-0716-2313-8_3.

Authors

Daniele De Luca¹, Chiara Lauritano²

Affiliations

¹ Department of Biology, University of Naples Federico II, Botanic Garden of Naples, Naples, Italy. daniele.deluca@unina.it.
² Department of Ecosustainable Marine Biotechnology, Stazione Zoologica Anton Dohrn, Naples, Italy. chiara.lauritano@szn.it.

PMID: 35727539
DOI: 10.1007/978-1-0716-2313-8_3

Abstract

The advancement in next-generation sequencing technologies and the dropping of sequencing costs have seen an increase in the amount of transcriptome data generated each year. These data are of big potential for identifying genes and molecular pathways of interest across a plethora of organisms. However, navigating these resources requires some bioinformatics and evolutionary skills. Here, we describe a protocol of transcriptome data mining for genes of interest, from the creation of a protein database to the inference of phylogenetic trees, which was used for marine protists, but can be used as general pipeline across different taxa.

Keywords: Bioinformatics; Biosynthetic pathways; Genes of interest; Local databases; Marine protists; Phylogenies; Sequence alignment; Transcriptome mining.

MeSH terms

Computational Biology / methods
Data Mining / methods
High-Throughput Nucleotide Sequencing*
Phylogeny
Transcriptome*