From ArrayExpress to BioStudies

Nucleic Acids Res. 2021 Jan 8;49(D1):D1502-D1506. doi: 10.1093/nar/gkaa1062.

Abstract

ArrayExpress (https://www.ebi.ac.uk/arrayexpress) is an archive of functional genomics data at EMBL-EBI, established in 2002, initially as an archive for publication-related microarray data and was later extended to accept sequencing-based data. Over the last decade an increasing share of biological experiments involve multiple technologies assaying different biological modalities, such as epigenetics, and RNA and protein expression, and thus the BioStudies database (https://www.ebi.ac.uk/biostudies) was established to deal with such multimodal data. Its central concept is a study, which typically is associated with a publication. BioStudies stores metadata describing the study, provides links to the relevant databases, such as European Nucleotide Archive (ENA), as well as hosts the types of data for which specialized databases do not exist. With BioStudies now fully functional, we are able to further harmonize the archival data infrastructure at EMBL-EBI, and ArrayExpress is being migrated to BioStudies. In future, all functional genomics data will be archived at BioStudies. The process will be seamless for the users, who will continue to submit data using the online tool Annotare and will be able to query and download data largely in the same manner as before. Nevertheless, some technical aspects, particularly programmatic access, will change. This update guides the users through these changes.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Cell Line
  • DNA Methylation
  • Databases, Genetic*
  • Epigenesis, Genetic*
  • Gene Expression Profiling
  • Genomics / methods*
  • High-Throughput Nucleotide Sequencing / statistics & numerical data*
  • Humans
  • Internet
  • Metadata
  • Oligonucleotide Array Sequence Analysis / statistics & numerical data*
  • Organ Specificity
  • Plants / genetics
  • Single-Cell Analysis
  • Software