snake-reinforced My university project on reinforcement learning. Policy optimization. Video of best run