This repository implements three different algorithms for static list scheduling of Directed Acyclic Graphs (DAGs). The primary objective of task scheduling is to assign tasks to the available processors and define the order of the execution of tasks to minimize the overall completion time. Both the tasks as well as the available processors may be heterogenous. The preemptive scheduling problem is a well known NP complete problem and the heterogeneity of processors only adds another layer of complexity to finding the optimal solution. Static scheduling makes the assumption that information such as the execution and communication times between tasks in known beforehand.
The following algorithms have been implemented in this repository:
- Heterogenous Earliest FInish Time (HEFT): A well known DAG scheduling algorithm developed by H. Topcuoglu , M. Wu, et al. (2002) in this paper
- RandomHEFT: This algorithm is proposed by S. AlEbrahim and I. Ahmad (2017) in this paper. They do not name the algorithm as RandomHEFT, however, I use this name to refer to the algorithm proposed by them because of the key additions done to the vanilla HEFT algorithm.
- Improved Predict Earliest Finish Time (IPEFT): This algorithm is proposed by N.Zhou et al. (2017) in this paper as an improvement to the commonly used PEFT algorithm.
The work in this repository was done to primarily compare the performance of the three different algorithms.
Constructing new example DAGs requires the DAGGEN github repository. The code assumes that daggen.c
is inside folder /daggen-master/
.
Required python packages: pandas, numpy and multiprocessing.
To run the HEFT algorithm, provide the DAG definition as a .dot
file:
$ python heft.py -i test.dot
For randomHEFT:
$ python randomHEFT.py -i test.dot
For IPEFT:
$ python ipeft.py -i test.dot
generate_dags.py
: Used to generate new DAGs using DAGGEN with different parameters. The no. of tasks (n), FAT, density, regularity and jump can be set inside this file. All generated DAGs are saved inside the /dag folder with name convention: n_fat_density_regularity_jump.dotheft.py
: HEFT Scheduleripeft.py
: IPEFT SchedulerrandomHEFT.py
: randomHEFT Schedulerread_dag.py
: Parser of dot file generated by DAGGEN. Takes.dot
file and outputs an array[no. of tasks, no. of processors, communication matrix, adjacency matrix]
. Also accepts parameters of CCR, p and beta (heterogeneity of processors)main_parallel.py
: Main program that runs threads (equal to the number of cores on PC) where each thread solves a single .dot DAG file with the three algorithms for various values of CCR, p and beta. Saves the results into a pickle file called 'data.pkl' every 57 DAGs.Result Visualization.ipynb
: Jupyter notebook which provides some results on the performance of the three algorithm for different types of DAGs.Report.pdf
: Contains a well documented report on the comparison of the three different algorithms.
Below are some results showing the percentage improvement in Total Schedule Length when using the randomHEFT algorithm over the HEFT algorithm. Finally, below is the same percentage improvement for the IPEFT algorithm over the HEFT algorithm.