Data-Driven Insights through Industrial Retrofitting: An Anonymized Dataset with Machine Learning Use Cases

Sensors (Basel). 2023 Jul 1;23(13):6078. doi: 10.3390/s23136078.

Abstract

Small and medium-sized enterprises (SMEs) often encounter practical challenges and limitations when extracting valuable insights from the data of retrofitted or brownfield equipment. The existing literature fails to reflect the full reality and potential of data-driven analysis in current SME environments. In this paper, we provide an anonymized dataset obtained from two medium-sized companies leveraging a non-invasive and scalable data-collection procedure. The dataset comprises mainly power consumption machine data collected over a period of 7 months and 1 year from two medium-sized companies. Using this dataset, we demonstrate how machine learning (ML) techniques can enable SMEs to extract useful information even in the short term, even from a small variety of data types. We develop several ML models to address various tasks, such as power consumption forecasting, item classification, next machine state prediction, and item production count forecasting. By providing this anonymized dataset and showcasing its application through various ML use cases, our paper aims to provide practical insights for SMEs seeking to leverage ML techniques with their limited data resources. The findings contribute to a better understanding of how ML can be effectively utilized in extracting actionable insights from limited datasets, offering valuable implications for SMEs in practical settings.

Keywords: benchmark dataset; industrial IoT; industry 4.0; machine learning; retrofit.

MeSH terms

  • Data Collection
  • Industry*
  • Machine Learning*

Grants and funding

This work has been partially funded by Programme Erasmus+, Knowledge Alliances, application no. 621639-EPP-1-2020-1-IT-EPPKA2-KA, PLANET4: Practical Learning of Artificial iNtelligence on the Edge for indusTry 4.0. This research is supported by the Ministry of University and Research (MUR) as part of the PON 2014–2020 “Research and Innovation” resources—Green/Innovation Action—DM MUR 1061/2022.