Integrated statistical analysis of cDNA microarray and NIR spectroscopic data applied to a hemp dataset

J Bioinform Comput Biol. 2005 Aug;3(4):891-913. doi: 10.1142/s0219720005001363.

Abstract

Both cDNA microarray and spectroscopic data provide indirect information about the chemical compounds present in the biological tissue under consideration. In this paper simple univariate and bivariate measures are used to investigate correlations between both types of high dimensional analyses. A large dataset of 42 hemp samples on which 3456 cDNA clones and 351 NIR wavelengths have been measured, was analyzed using graphical representations. For this purpose we propose clustered correlation and clustered discrimination images. Large, tissue-related differences are seen to dominate the cDNA-NIR correlation structure but smaller, more difficult to detect, variety-related differences can be found at specific cDNA clone/NIR wavelength combinations.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Cannabis / genetics
  • Cannabis / metabolism*
  • Computer Simulation
  • Data Interpretation, Statistical*
  • Discriminant Analysis
  • Gene Expression Profiling / methods*
  • Gene Expression Regulation, Plant / physiology
  • Models, Biological
  • Oligonucleotide Array Sequence Analysis / methods*
  • Plant Proteins / genetics
  • Plant Proteins / metabolism*
  • Spectroscopy, Fourier Transform Infrared / methods*
  • Statistics as Topic
  • Systems Integration

Substances

  • Plant Proteins