A synthetic energy dataset for non-intrusive load monitoring in households

Sci Data. 2020 Apr 2;7(1):108. doi: 10.1038/s41597-020-0434-6.

Abstract

Research on smart grid technologies is expected to result in effective climate change mitigation. Non-Intrusive Load Monitoring (NILM) is seen as a key technique for enabling innovative smart-grid services. By breaking down the energy consumption of households and industrial facilities into its components, NILM techniques provide information on present appliances and can be applied to perform diagnostics. As with related Machine Learning problems, research and development requires a sufficient amount of data to train and validate new approaches. As a viable alternative to collecting datasets in buildings during expensive and time-consuming measurement campaigns, the idea of generating synthetic datasets for NILM gain momentum recently. With SynD, we present a synthetic energy dataset with focus on residential buildings. We release 180 days of synthetic power data on aggregate level (i.e. mains) and individual appliances. SynD is the result of a custom simulation process that relies on power traces of real household appliances. In addition, we present several case studies that demonstrate similarity of our dataset and four real-world energy datasets.