Identification of Culprit Genes for Different Diseases by Analyzing Microarray Data

Methods Mol Biol. 2024:2719:167-180. doi: 10.1007/978-1-0716-3461-5_10.

Abstract

The identification of disease-causing genes is the first and most important step toward understanding the biological mechanisms underlying a disease. Microarray analysis is one such powerful method that is widely used to identify genes that are expressed differently in two or more conditions (disease vs. normal). Because of its large library of statistical R packages and user-friendly interface, the R programming language provides a platform for microarray analysis. In this chapter, we will go over how to identify disease-causing culprit genes from the raw microarray data, using various packages of R programming. The pipeline overviews the steps in microarray analysis, such as data pre-processing, normalization, and statistical analysis using visualization techniques such as heatmaps, box plots, and so on. To better understand the function of the altered genes, gene ontology and pathway analysis are performed.

Keywords: Differentially expressed genes; Gene ontology; Microarray analysis; Pathway analysis; R-programming.

MeSH terms

  • Gene Expression Profiling* / methods
  • Gene Library
  • Oligonucleotide Array Sequence Analysis / methods
  • Programming Languages
  • Software*