Support vector regression applied to the determination of the developmental age of a Drosophila embryo from its segmentation gene expression patterns

Bioinformatics. 2002:18 Suppl 1:S87-95. doi: 10.1093/bioinformatics/18.suppl_1.s87.

Abstract

Motivation: In this paper we address the problem of the determination of developmental age of an embryo from its segmentation gene expression patterns in Drosophila.

Results: By applying support vector regression we have developed a fast method for automated staging of an embryo on the basis of its gene expression pattern. Support vector regression is a statistical method for creating regression functions of arbitrary type from a set of training data. The training set is composed of embryos for which the precise developmental age was determined by measuring the degree of membrane invagination. Testing the quality of regression on the training set showed good prediction accuracy. The optimal regression function was then used for the prediction of the gene expression based age of embryos in which the precise age has not been measured by membrane morphology. Moreover, we show that the same accuracy of prediction can be achieved when the dimensionality of the feature vector was reduced by applying factor analysis. The data reduction allowed us to avoid over-fitting and to increase the efficiency of the algorithm.

Publication types

  • Comparative Study
  • Evaluation Study
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, P.H.S.
  • Validation Study

MeSH terms

  • Aging / genetics*
  • Algorithms
  • Animals
  • Artificial Intelligence*
  • Cluster Analysis
  • Computing Methodologies
  • Drosophila / embryology*
  • Drosophila / genetics*
  • Gene Expression Profiling / methods*
  • Gene Expression Regulation, Developmental / physiology*
  • Genes, Insect / genetics*
  • Pattern Recognition, Automated
  • Regression Analysis