Gene expression profiling of colorectal tumors and normal mucosa by microarrays meta-analysis using prediction analysis of microarray, artificial neural network, classification, and regression trees

Dis Markers. 2014:2014:634123. doi: 10.1155/2014/634123. Epub 2014 May 19.

Abstract

Background: Microarray technology shows great potential but previous studies were limited by small number of samples in the colorectal cancer (CRC) research. The aims of this study are to investigate gene expression profile of CRCs by pooling cDNA microarrays using PAM, ANN, and decision trees (CART and C5.0).

Methods: Pooled 16 datasets contained 88 normal mucosal tissues and 1186 CRCs. PAM was performed to identify significant expressed genes in CRCs and models of PAM, ANN, CART, and C5.0 were constructed for screening candidate genes via ranking gene order of significances.

Results: The first screening identified 55 genes. The test accuracy of each model was over 0.97 averagely. Less than eight genes achieve excellent classification accuracy. Combining the results of four models, we found the top eight differential genes in CRCs; suppressor genes, CA7, SPIB, GUCA2B, AQP8, IL6R and CWH43; oncogenes, SPP1 and TCN1. Genes of higher significances showed lower variation in rank ordering by different methods.

Conclusion: We adopted a two-tier genetic screen, which not only reduced the number of candidate genes but also yielded good accuracy (nearly 100%). This method can be applied to future studies. Among the top eight genes, CA7, TCN1, and CWH43 have not been reported to be related to CRC.

Publication types

  • Meta-Analysis
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Colon / metabolism*
  • Colon / pathology
  • Colorectal Neoplasms / genetics
  • Colorectal Neoplasms / metabolism*
  • Gene Expression Profiling
  • Gene Expression Regulation, Neoplastic
  • Gene Ontology
  • Humans
  • Intestinal Mucosa / metabolism*
  • Neural Networks, Computer
  • Oligonucleotide Array Sequence Analysis
  • Regression Analysis
  • Transcriptome*