The generalization complexity measure for continuous input data

Iván Gómez; Sergio A Cannas; Omar Osenda; José M Jerez; Leonardo Franco

doi:10.1155/2014/815156

The generalization complexity measure for continuous input data

ScientificWorldJournal. 2014:2014:815156. doi: 10.1155/2014/815156. Epub 2014 Apr 10.

Authors

Iván Gómez¹, Sergio A Cannas², Omar Osenda², José M Jerez¹, Leonardo Franco¹

Affiliations

¹ Departamento de Lenguajes y Ciencias de la Computación, Universidad de Málaga, 29071 Málaga, Spain.
² Facultad de Matemática, Astronomía y Física, Universidad Nacional de Córdoba, 5000 Córdoba, Argentina.

Abstract

We introduce in this work an extension for the generalization complexity measure to continuous input data. The measure, originally defined in Boolean space, quantifies the complexity of data in relationship to the prediction accuracy that can be expected when using a supervised classifier like a neural network, SVM, and so forth. We first extend the original measure for its use with continuous functions to later on, using an approach based on the use of the set of Walsh functions, consider the case of having a finite number of data points (inputs/outputs pairs), that is, usually the practical case. Using a set of trigonometric functions a model that gives a relationship between the size of the hidden layer of a neural network and the complexity is constructed. Finally, we demonstrate the application of the introduced complexity measure, by using the generated model, to the problem of estimating an adequate neural network architecture for real-world data sets.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Algorithms*
Humans
Models, Theoretical*