Diagnosis using clinical/pathological and molecular information

Stat Methods Med Res. 2016 Dec;25(6):2878-2894. doi: 10.1177/0962280214534410. Epub 2014 May 11.

Abstract

In diagnosis and classification diseases multiple outcomes, both molecular and clinical/pathological are routinely gathered on patients. In recent years, many approaches have been suggested for integrating gene expression (continuous data) with clinical/pathological data (usually categorical and ordinal data). This new area of research integrates both clinical and genomic data in order to improve our knowledge about diseases, and to capture the information which is lost in independent clinical or genomic studies. The related metric scaling distance is a not well-known, but very valuable distance to integrate clinical/pathological and molecular information. In this article, we present the use of the related metric scaling distance in biomedical research. We describe how this distance works, and we also explain why it may sometimes be preferred. We discuss the choice of the related metric scaling distance and compare it with other proximity measures to include both clinical and genetic information. Furthermore, we comment the choice of the related metric scaling distance when classical clustering or discriminant analysis based on distances are performed and compare the results with more complex cluster or discriminant procedures specially constructed for integrating clinical and molecular information. The use of the related metric scaling distance is illustrated on simulated experimental and four real data sets, a heart disease, and three cancer studies. The results present the flexibility and availability of this distance which gives competitive results.

Keywords: clinical data; clinical/genomic integration data; diseases; distance matrix; gene expression; metric scaling; pathological information; related metric scaling distance.

MeSH terms

  • Biomedical Research
  • Cluster Analysis
  • Discriminant Analysis
  • Gene Expression Profiling
  • Heart Diseases / diagnosis*
  • Heart Diseases / genetics*
  • Heart Diseases / pathology
  • Humans
  • Neoplasms / diagnosis*
  • Neoplasms / genetics*
  • Neoplasms / pathology