Classification and Clustering on Microarray Data for Gene Functional Prediction Using R

Methods Mol Biol. 2016:1375:41-54. doi: 10.1007/7651_2015_240.

Abstract

Gene expression data (microarrays and RNA-sequencing data) as well as other kinds of genomic data can be extracted from publicly available genomic data. Here, we explain how to apply multivariate cluster and classification methods on gene expression data. These methods have become very popular and are implemented in freely available software in order to predict the participation of gene products in a specific functional category of interest. Taking into account the availability of data and of these methods, every biological study should apply them in order to obtain knowledge on the organism studied and functional category of interest. A special emphasis is made on the nonlinear kernel classification methods.

Keywords: Classification; Clustering; Functional prediction; Microarrays; Multivariate data analysis.

MeSH terms

  • Algorithms
  • Cluster Analysis*
  • Computational Biology / methods*
  • Databases, Genetic
  • Gene Expression Profiling / methods*
  • Genomics / methods*
  • Oligonucleotide Array Sequence Analysis / methods*
  • Support Vector Machine