Mathematical foundations based statistical modeling of software source code for software system evolution

Math Biosci Eng. 2022 Feb 9;19(4):3701-3719. doi: 10.3934/mbe.2022170.

Abstract

Source code is the heart of the software systems; it holds a wealth of knowledge that can be tapped for intelligent software systems and leverage the possibilities of reuse of the software. In this work, exploration revolves around making use of the pattern hidden in various software development processes and artifacts. This module is part of the smart requirements management system that is intended to be built. This system will have multiple modules to make the software requirements management phase more secure from vulnerabilities. Some of the critical challenges bothering the software development community are discussed. The background of Machine Learning approaches and their application in software development practices are explored. Some of the work done around modeling the source code and approaches used for vulnerabilities understanding in software systems are reviewed. Program representation is explored to understand some of the principles that would help in understanding the subject well. Further deeper dive into source code modeling possibilities are explored. Machine learning best practices are explored inline with the software source code modeling.

Keywords: Bidirection encoders transformers; knowledge graphs; requirements management; software development; transfer learning.

MeSH terms

  • Machine Learning*
  • Software*