A Short Review on Minimum Description Length: An Application to Dimension Reduction in PCA

Entropy (Basel). 2022 Feb 13;24(2):269. doi: 10.3390/e24020269.

Abstract

The minimun description length (MDL) is a powerful criterion for model selection that is gaining increasing interest from both theorists and practicioners. It allows for automatic selection of the best model for representing data without having a priori information about them. It simply uses both data and model complexity, selecting the model that provides the least coding length among a predefined set of models. In this paper, we briefly review the basic ideas underlying the MDL criterion and its applications in different fields, with particular reference to the dimension reduction problem. As an example, the role of MDL in the selection of the best principal components in the well known PCA is investigated.

Keywords: classification; dimension reduction; features extraction; minimum description length; principal component analysis.

Publication types

  • Review