SSCI - Explainable artificial intelligence for improving a session-based malware traffic classification with deep learning

In network security, applying deep learning methods to detect network traffic anomalies has achieved great results with various network traffic representations. A possible representation is the transformation of raw network communication to images to extract valuable information from the unmanageable amount of network traffic by applying representation learning. However, since deep learning models can result in black boxes for users, it is interesting to understand what valuable information is learned from network communication converted into images. This paper elaborates on that question using explainable artificial intelligence (XAI) methods to identify network packets that most influence the prediction and verify that packets in a malware communication containing malicious payloads have a higher influence on the prediction. We inspect the Grad-CAM and visualize the Integrated Gradients of the Xception and VGG-19 model and investigate the attention heat maps of our Vision Transformer (ViT) model. In addition, we present a novel transformation of sessions to a new image representation to expand the informativeness of network communication. For multiclass classification, our best model Xception achieves an accuracy of $97.95%$, whereas, for binary classification, Xception and VGG-19 achieve well above $99.50%$. Our ViT model achieves a significantly lower performance with $95.86%$ for multiclass and $99.36%$ for binary classification. In particular, computing centers could benefit by examining their inbound and outbound traffic to detect malicious behaviors ahead of time.

All trained models and data sets are available on heiBOX

Getting Started

Set up Conda environment:

conda create --name ssci python=3.10

Install requirements.txt

pip install -r requirements.txt

Start training:

python train_binary.py

Troubleshooting

For any issues related to CUDA, check out Install TensorFlow with pip guide.

for a in /sys/bus/pci/devices/*; do echo 0 | sudo tee -a $a/numa_node; done # https://github.com/tensorflow/tensorflow/issues/42738


export XLA_FLAGS=--xla_gpu_cuda_data_dir=/usr/lib/cuda

Other related issue.

Name		Name	Last commit message	Last commit date
Latest commit History 54 Commits
examples		examples
explaining		explaining
figures		figures
training_logs		training_logs
.gitignore		.gitignore
README.md		README.md
config.py		config.py
datasets.py		datasets.py
evaluate.py		evaluate.py
job_cnn.slurm		job_cnn.slurm
job_vit.slurm		job_vit.slurm
metrics.py		metrics.py
models.py		models.py
new_model.py		new_model.py
plots.py		plots.py
requirements.txt		requirements.txt
run_models.sh		run_models.sh
train_cnn.py		train_cnn.py
train_vit.py		train_vit.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SSCI - Explainable artificial intelligence for improving a session-based malware traffic classification with deep learning

Getting Started

Troubleshooting

About

Languages

EMCL-Research-ITSecLab/ssci23-xai-network-traffic

Folders and files

Latest commit

History

Repository files navigation

SSCI - Explainable artificial intelligence for improving a session-based malware traffic classification with deep learning

Getting Started

Troubleshooting

About

Topics

Resources

Stars

Watchers

Forks

Languages