Sampling via the aggregation value for data-driven manufacturing

Xu Liu; Gengxiang Chen; Yingguang Li; Lu Chen; Qinglu Meng; Charyar Mehdi-Souzani

doi:10.1093/nsr/nwac201

Sampling via the aggregation value for data-driven manufacturing

Natl Sci Rev. 2022 Sep 24;9(11):nwac201. doi: 10.1093/nsr/nwac201. eCollection 2022 Nov.

Authors

Xu Liu¹, Gengxiang Chen², Yingguang Li², Lu Chen², Qinglu Meng², Charyar Mehdi-Souzani³

Affiliations

¹ School of Mechanical and Power Engineering, Nanjing Tech University, Nanjing 211816, China.
² College of Mechanical & Electrical Engineering, Nanjing University of Aeronautics and Astronautics, Nanjing 210016, China.
³ University Research Laboratory in Automated Production, École normale supérieure Paris-Saclay, Université Paris-Saclay, Université Sorbonne Paris Nord, Gif-Sur-Yvette 91190, France.

Abstract

Data-driven modelling has shown promising potential in many industrial applications, while the expensive and time-consuming labelling of experimental and simulation data restricts its further development. Preparing a more informative but smaller dataset to reduce labelling efforts has been a vital research problem. Although existing techniques can assess the value of individual data samples, how to represent the value of a sample set remains an open problem. In this research, the aggregation value is defined using a novel representation for the value of a sample set by modelling the invisible redundant information as the overlaps of neighbouring values. The sampling problem is hence converted to the maximisation of the submodular function over the aggregation value. The comprehensive analysis of several manufacturing datasets demonstrates that the proposed method can provide sample sets with superior and stable performance compared with state-of-the-art methods. The research outcome also indicates its appealing potential to reduce labelling efforts for more data-scarcity scenarios.

Keywords: data sampling; data value; data-driven modelling; intelligent manufacturing.