Topological learning in multiclass data sets

Christopher Griffin; Trevor Karn; Benjamin Apple

doi:10.1103/PhysRevE.109.024131

Topological learning in multiclass data sets

Phys Rev E. 2024 Feb;109(2-1):024131. doi: 10.1103/PhysRevE.109.024131.

Authors

Christopher Griffin¹, Trevor Karn², Benjamin Apple³

Affiliations

¹ Applied Research Laboratory, The Pennsylvania State University, University Park, Pennsylvania 16802, USA.
² School of Mathematics, University of Minnesota, Minneapolis, Minnesota 55455, USA.
³ Naval Surface Warfare Center Carderock, Bethesda Maryland 20817, USA.

PMID: 38491638
DOI: 10.1103/PhysRevE.109.024131

Abstract

We specialize techniques from topological data analysis to the problem of characterizing the topological complexity (as defined in the body of the paper) of a multiclass data set. As a by-product, a topological classifier is defined that uses an open subcovering of the data set. This subcovering can be used to construct a simplicial complex whose topological features (e.g., Betti numbers) provide information about the classification problem. We use these topological constructs to study the impact of topological complexity on learning in feedforward deep neural networks (DNNs). We hypothesize that topological complexity is negatively correlated with the ability of a fully connected feedforward deep neural network to learn to classify data correctly. We evaluate our topological classification algorithm on multiple constructed and open-source data sets. We also validate our hypothesis regarding the relationship between topological complexity and learning in DNN's on multiple data sets.