Correlation analysis of materials properties by machine learning: illustrated with stacking fault energy from first-principles calculations in dilute fcc-based alloys

J Phys Condens Matter. 2021 Jun 14;33(29). doi: 10.1088/1361-648X/ac0195.

Abstract

Advances in machine learning (ML), especially in the cooperation between ML predictions, density functional theory (DFT) based first-principles calculations, and experimental verification are emerging as a key part of a new paradigm to understand fundamentals, verify, analyze, and predict data, and design and discover materials. Taking stacking fault energy (γSFE) as an example, we perform a correlation analysis ofγSFEin dilute Al-, Ni-, and Pt-based alloys by descriptors and ML algorithms. TheseγSFEvalues were predicted by DFT-based alias shear deformation approach, and up to 49 elemental descriptors and 21 regression algorithms were examined. The present work indicates that (i) the variation ofγSFEaffected by alloying elements can be quantified through 14 elemental attributes based on their statistical significances to decrease the mean absolute error (MAE) in ML predictions, and in particular, the number of p valence electrons, a descriptor second only to the covalent radius in importance to model performance, is unexpected; (ii) the alloys with elements close to Ni and Co in the periodic table possess higherγSFEvalues; (iii) the top four outliers of DFT predictions ofγSFEare for the alloys of Al23La, Pt23Au, Ni23Co, and Al23Be based on the analyses of statistical differences between DFT and ML predictions; and (iv) the best ML model to predictγSFEis produced by Gaussian process regression with an average MAE < 8 mJ m-2. Beyond detailed analysis of the Al-, Ni-, and Pt-based alloys, we also predict theγSFEvalues using the present ML models in other fcc-based dilute alloys (i.e., Cu, Ag, Au, Rh, Pd, and Ir) with the expected MAE < 17 mJ m-2and observe similar effects of alloying elements onγSFEas those in Pt23X or Ni23X.

Keywords: dilute fcc-based alloys; first-principles calculations; machine learning; stacking fault energy.