Deep-Reinforcement-Learning-Based IoT Sensor Data Cleaning Framework for Enhanced Data Analytics

Alaelddin F Y Mohammed; Salman Md Sultan; Joohyung Lee; Sunhwan Lim

doi:10.3390/s23041791

Deep-Reinforcement-Learning-Based IoT Sensor Data Cleaning Framework for Enhanced Data Analytics

Sensors (Basel). 2023 Feb 5;23(4):1791. doi: 10.3390/s23041791.

Authors

Alaelddin F Y Mohammed¹, Salman Md Sultan², Joohyung Lee¹, Sunhwan Lim³

Affiliations

¹ School of Computing, Gachon University, Seongnam 13120, Republic of Korea.
² European IT Solutions Institute, Dhaka 1216, Bangladesh.
³ Autonomous IoT Research Section, ETRI, Electronics and Telecommunications Research Institute, Daejeon 34129, Republic of Korea.

Abstract

The Internet of things (IoT) combines different sources of collected data which are processed and analyzed to support smart city applications. Machine learning and deep learning algorithms play a vital role in edge intelligence by minimizing the amount of irrelevant data collected from multiple sources to facilitate these smart city applications. However, the data collected by IoT sensors can often be noisy, redundant, and even empty, which can negatively impact the performance of these algorithms. To address this issue, it is essential to develop effective methods for detecting and eliminating irrelevant data to improve the performance of intelligent IoT applications. One approach to achieving this goal is using data cleaning techniques, which can help identify and remove noisy, redundant, or empty data from the collected sensor data. This paper proposes a deep reinforcement learning (deep RL) framework for IoT sensor data cleaning. The proposed system utilizes a deep Q-network (DQN) agent to classify sensor data into three categories: empty, garbage, and normal. The DQN agent receives input from three received signal strength (RSS) values, indicating the current and two previous sensor data points, and receives reward feedback based on its predicted actions. Our experiments demonstrate that the proposed system outperforms a common time-series-based fully connected neural network (FCDQN) solution, with an accuracy of around 96% after the exploration mode. The use of deep RL for IoT sensor data cleaning is significant because it has the potential to improve the performance of intelligent IoT applications by eliminating irrelevant and harmful data.

Keywords: DQN; IoT; data cleaning; edge intelligence.

Grants and funding

Institute of Information \& communications Technology Planning \& Evaluation (IITP) grant/Korea government (MSIT)(2022-0-01032, Development of Collective Collaboration Intelligence Framework for Internet of Autonomous Things)