TY - JOUR
T1 - HIGH PERFORMANCE NMF BASED INTRUSION DETECTION SYSTEM FOR BIG DATA IOT TRAFFIC
AU - Touzene, Abderezak
AU - Farsi, Ahmed Al
AU - Zeidi, Nasser Al
N1 - Publisher Copyright:
© (2024), (Academy and Industry Research Collaboration Center (AIRCC)). All rights reserved.
PY - 2024/3/29
Y1 - 2024/3/29
N2 - With the emergence of smart devices and the Internet of Things (IoT), millions of users connected to the network produce massive network traffic datasets. These vast datasets of network traffic, Big Data are challenging to store, deal with and analyse using a single computer. In this paper we developed parallel implementation using a High Performance Computer (HPC) for the Non-Negative Matrix Factorization technique as an engine for an Intrusion Detection System (HPC-NMF-IDS). The large IoT traffic datasets of order of millions samples are distributed evenly on all the computing cores for both storage and speedup purpose. The distribution of computing tasks involved in the Matrix Factorization takes into account the reduction of the communication cost between the computing cores. The experiments we conducted on the proposed HPC-IDS-NMF give better results than the traditional ML-based intrusion detection systems. We could train the HPC model with datasets of one million samples in only 31 seconds instead of the 40 minutes using one processor), that is a speed up of 87 times. Moreover, we have got an excellent detection accuracy rate of 98% for KDD dataset.
AB - With the emergence of smart devices and the Internet of Things (IoT), millions of users connected to the network produce massive network traffic datasets. These vast datasets of network traffic, Big Data are challenging to store, deal with and analyse using a single computer. In this paper we developed parallel implementation using a High Performance Computer (HPC) for the Non-Negative Matrix Factorization technique as an engine for an Intrusion Detection System (HPC-NMF-IDS). The large IoT traffic datasets of order of millions samples are distributed evenly on all the computing cores for both storage and speedup purpose. The distribution of computing tasks involved in the Matrix Factorization takes into account the reduction of the communication cost between the computing cores. The experiments we conducted on the proposed HPC-IDS-NMF give better results than the traditional ML-based intrusion detection systems. We could train the HPC model with datasets of one million samples in only 31 seconds instead of the 40 minutes using one processor), that is a speed up of 87 times. Moreover, we have got an excellent detection accuracy rate of 98% for KDD dataset.
KW - Dimensionality Reduction
KW - High Performance Computing
KW - Intrusion Detection Systems
KW - IoT traffic
KW - Machine Learning
UR - http://www.scopus.com/inward/record.url?scp=85190824757&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85190824757&partnerID=8YFLogxK
UR - https://www.mendeley.com/catalogue/874bec83-26ed-394b-a477-f2ee1bf82889/
U2 - 10.5121/ijcnc.2024.16203
DO - 10.5121/ijcnc.2024.16203
M3 - Article
AN - SCOPUS:85190824757
SN - 0975-2293
VL - 16
SP - 43
EP - 58
JO - International Journal of Computer Networks and Communications
JF - International Journal of Computer Networks and Communications
IS - 2
ER -