MIND: A Multi-Source Data Fusion Scheme for Intrusion Detection in Networks

Naveed Anjum; Zohaib Latif; Choonhwa Lee; Ijaz Ali Shoukat; Umer Iqbal

doi:10.3390/s21144941

MIND: A Multi-Source Data Fusion Scheme for Intrusion Detection in Networks

Sensors (Basel). 2021 Jul 20;21(14):4941. doi: 10.3390/s21144941.

Authors

Naveed Anjum¹, Zohaib Latif², Choonhwa Lee², Ijaz Ali Shoukat¹, Umer Iqbal¹

Affiliations

¹ Department of Computing, Riphah International University, Faisalabad 38000, Pakistan.
² Department of Computer Science, Hanyang University, Seoul 04763, Korea.

Abstract

In recent years, there is an exponential explosion of data generation, collection, and processing in computer networks. With this expansion of data, network attacks have also become a congenital problem in complex networks. The resource utilization, complexity, and false alarm rates are major challenges in current Network Intrusion Detection Systems (NIDS). The data fusion technique is an emerging technology that merges data from multiple sources to form more certain, precise, informative, and accurate data. Moreover, most of the earlier intrusion detection models suffer from overfitting problems and lack optimal detection of intrusions. In this paper, we propose a multi-source data fusion scheme for intrusion detection in networks (MIND) , where data fusion is performed by the horizontal emergence of two datasets. For this purpose, the Hadoop MapReduce tool such as, Hive is used. In addition, a machine learning ensemble classifier is used for the fused dataset with fewer parameters. Finally, the proposed model is evaluated with a 10-fold-cross validation technique. The experiments show that the average accuracy, detection rate, false positive rate, true positive rate, and F-measure are 99.80%, 99.80%, 0.29%, 99.85%, and 99.82% respectively. Moreover, the results indicate that the proposed model is significantly effective in intrusion detection compared to other state-of-the-art methods.

Keywords: anomaly detection; data fusion; ensemble learning; machine learning; network intrusion detection systems.

MeSH terms

Algorithms*
Machine Learning*

Grants and funding

(No.2020R1A2B5B01001758)/National Research Foundation of Korea (NRF) grant 487 funded by the Korea government (MSIT)