Enhanced Neural Network-Based Univariate Time-Series Forecasting Model for Big Data

Big Data. 2024 Apr;12(2):83-99. doi: 10.1089/big.2022.0155. Epub 2023 Feb 24.

Abstract

Big data is a combination of large structured, semistructured, and unstructured data collected from various sources that must be processed before using them in many analytical applications. Anomalies or inconsistencies in big data refer to the occurrences of some data that are in some way unusual and do not fit the general patterns. It is considered one of the major problems of big data. Data trust method (DTM) is a technique used to identify and replace anomaly or untrustworthy data using the interpolation method. This article discusses the DTM used for univariate time series (UTS) forecasting algorithms for big data, which is considered the preprocessing approach by using a neural network (NN) model. In this work, DTM is the combination of statistical-based untrustworthy data detection method and statistical-based untrustworthy data replacement method, and it is used to improve the forecast quality of UTS. In this study, an enhanced NN model has been proposed for big data that incorporates DTMs with the NN-based UTS forecasting model. The coefficient variance root mean squared error is utilized as the main characteristic indicator in the proposed work to choose the best UTS data for model development. The results show the effectiveness of the proposed method as it can improve the prediction process by determining and replacing the untrustworthy big data.

Keywords: health care data; layer recurrent neural network; nonlinear autoregressive neural network; statistical measure-based data trust method.

MeSH terms

  • Algorithms
  • Big Data*
  • Forecasting
  • Neural Networks, Computer*
  • Time Factors