DLRL Coursework

Deep Learning

For the Deep Learning sub-module, we were tasked to build a generative model, using of the CIFAR10 (32x32) & STL10 (96x96) image datasets to develop pictures of a pegasus.

Abstract from my report:

This paper proposes using a GLO model to generate images that look like a Pegasus. This model uses a basic generator architecture along with the Laplacian loss function, an informative loss equation which helps at a range of image resolutions. Then, we map each image to a point on the n-dimensional spherical latent space, acting as our probability distribution. Using this spherical distribution, we examine the latent space between horses and birds with the intention of generating a smooth interpolation of the two images.

To view the full paper please visit: https://julianwyatt.co.uk/DLRL

Comments:

I quickly learned however that this model was not able to produce high quality samples as the space between image latents was not meaningful. For example see below the high quality reconstructions vs low quality interpolations:

Image	Info
	Best Image Reconstruction
	Best Image Interpolation

Reinforcement Learning

For the module, we were tasked to develop an RL agent to play Atari Gravitar. The aim of Gravitar is complex, even for humans. The player must meander around gravity, visiting planets, collecting fuel, all while shooting other space ships. This makes the game notoriously difficult for agents to learn due to the sparse, long term reward structure. Therefore, I looked towards sequential models such as LSTMs from recent papers such as R2D2. Please see https://www.youtube.com/embed/oTNwsHRPOeM for my most intuitive gameplay sequence.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
submission		submission
.gitignore		.gitignore
README.md		README.md
atari_wrappers.py		atari_wrappers.py
batch1.png		batch1.png
batch2.png		batch2.png
batch3.png		batch3.png
batch4.png		batch4.png
best-batch.png		best-batch.png
best-horse.png		best-horse.png
dcgan_generator.png		dcgan_generator.png
gravitar-code.ipynb		gravitar-code.ipynb
gravitar-log.txt		gravitar-log.txt
gravitar-outtake-episode-1480-score-1100.mp4		gravitar-outtake-episode-1480-score-1100.mp4
gravitar-video-episode-595-score-950.mp4		gravitar-video-episode-595-score-950.mp4
horses.png		horses.png
mappingFunction.jpg		mappingFunction.jpg
pegasus-code.ipynb		pegasus-code.ipynb
pegasus-paper.pdf		pegasus-paper.pdf
reconBatch.png		reconBatch.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DLRL Coursework

Deep Learning

Abstract from my report:

Comments:

Reinforcement Learning

About

Releases

Packages

Languages

Julian-Wyatt/DLRL-Coursework

Folders and files

Latest commit

History

Repository files navigation

DLRL Coursework

Deep Learning

Abstract from my report:

Comments:

Reinforcement Learning

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages