A collection of Jupyter notebooks with examples of usage of the cesnet-datazoo
and cesnet-models
packages.
🐸 CESNET DataZoo
🧠 CESNET Models
The following notebooks are available:
explore_data.ipynb
- Simple initialization of a dataset class to explore available features.example_evaluation.ipynb
- Training of a LightGBM classifier and its evaluation on a per-week and per-day basis.reproduce_tls.ipynb
- Use a pre-trained model from thecesnet-models
package to reproduce the results of the "Fine-grained TLS services classification with reject option" paper.reproduce_quic.ipynb
- Use a pre-trained model from thecesnet-models
package to reproduce the results of the "Encrypted traffic classification: the QUIC case" paper.example_train_nn.ipynb
- Training of a neural network from scratch. Thecesnet-datazoo
package provides a dataset, which is split into the train, validation, and test sets. Thecesnet-models
package provides the neural network architecture and data transformations.month_evaluation_cesnet_tls_year22.ipynb
- Training and per-month evaluation of a LightGBM model using the CESNET-TLS-Year22 dataset.
The dependencies are installed in the first cell of each notebook. Alternatively, the requirements.txt
file is also provided. PyTorch with CUDA 11.8 support should be installed with the following command (more info here):
python -m pip install torch>=1.10 --index-url https://download.pytorch.org/whl/cu124