Learned pseudo-random number generator: WGAN-GP for generating statistically robust random numbers

Kiyoshiro Okada; Katsuhiro Endo; Kenji Yasuoka; Shuichi Kurabayashi

doi:10.1371/journal.pone.0287025

Learned pseudo-random number generator: WGAN-GP for generating statistically robust random numbers

PLoS One. 2023 Jun 14;18(6):e0287025. doi: 10.1371/journal.pone.0287025. eCollection 2023.

Authors

Kiyoshiro Okada^{1

2}, Katsuhiro Endo¹, Kenji Yasuoka¹, Shuichi Kurabayashi^{3

2}

Affiliations

¹ Department of Mechanical Engineering, Keio University, Yokohama, Japan.
² Cygames Research, Cygames Inc., Shibuya, Tokyo, Japan.
³ Graduate School of Media and Governance, Keio University, Fujisawa, Kanagawa, Japan.

Abstract

Pseudo-random number generators (PRNGs) are software algorithms generating a sequence of numbers approximating the properties of random numbers. They are critical components in many information systems that require unpredictable and nonarbitrary behaviors, such as parameter configuration in machine learning, gaming, cryptography, and simulation. A PRNG is commonly validated through a statistical test suite, such as NIST SP 800-22rev1a (NIST test suite), to evaluate its robustness and the randomness of the numbers. In this paper, we propose a Wasserstein distance-based generative adversarial network (WGAN) approach to generating PRNGs that fully satisfy the NIST test suite. In this approach, the existing Mersenne Twister (MT) PRNG is learned without implementing any mathematical programming code. We remove the dropout layers from the conventional WGAN network to learn random numbers distributed in the entire feature space because the nearly infinite amount of data can suppress the overfitting problems that occur without dropout layers. We conduct experimental studies to evaluate our learned pseudo-random number generator (LPRNG) by adopting cosine-function-based numbers with poor random number properties according to the NIST test suite as seed numbers. The experimental results show that our LPRNG successfully converted the sequence of seed numbers to random numbers that fully satisfy the NIST test suite. This study opens the way for the "democratization" of PRNGs through the end-to-end learning of conventional PRNGs, which means that PRNGs can be generated without deep mathematical know-how. Such tailor-made PRNGs will effectively enhance the unpredictability and nonarbitrariness of a wide range of information systems, even if the seed numbers can be revealed by reverse engineering. The experimental results also show that overfitting was observed after about 450,000 trials of learning, suggesting that there is an upper limit to the number of learning counts for a fixed-size neural network, even when learning with unlimited data.

Copyright: © 2023 Okada et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

MeSH terms

Algorithms*
Computer Simulation
Engineering*
Machine Learning
Neural Networks, Computer

Grants and funding

The authors received no specific funding for this work.