CF-Seq, an accessible web application for rapid re-analysis of cystic fibrosis pathogen RNA sequencing studies

Sci Data. 2022 Jun 16;9(1):343. doi: 10.1038/s41597-022-01431-1.

Abstract

Researchers studying cystic fibrosis (CF) pathogens have produced numerous RNA-seq datasets which are available in the gene expression omnibus (GEO). Although these studies are publicly available, substantial computational expertise and manual effort are required to compare similar studies, visualize gene expression patterns within studies, and use published data to generate new experimental hypotheses. Furthermore, it is difficult to filter available studies by domain-relevant attributes such as strain, treatment, or media, or for a researcher to assess how a specific gene responds to various experimental conditions across studies. To reduce these barriers to data re-analysis, we have developed an R Shiny application called CF-Seq, which works with a compendium of 128 studies and 1,322 individual samples from 13 clinically relevant CF pathogens. The application allows users to filter studies by experimental factors and to view complex differential gene expression analyses at the click of a button. Here we present a series of use cases that demonstrate the application is a useful and efficient tool for new hypothesis generation. (CF-Seq: http://scangeo.dartmouth.edu/CFSeq/ ).

Publication types

  • Dataset

MeSH terms

  • Cystic Fibrosis* / genetics
  • Data Analysis
  • Humans
  • RNA-Seq
  • Sequence Analysis, RNA*
  • Software