diff --git a/README.md b/README.md index d17ea52..a15595f 100644 --- a/README.md +++ b/README.md @@ -92,7 +92,7 @@ We train and compare KAN-GPT with an equivalent MLP-GPT model on the Tiny Shakes | Metrics | | | |---------|---------|---------| -| | | | +| ![results_loss](media/results_loss.png) | ![results_cross_entropy](media/results_cross_entropy.png) | ![results_perplexity](media/results_perplexity.png) | ## TODOs diff --git a/docs/media/results_cross_entropy.png b/docs/media/results_cross_entropy.png new file mode 100644 index 0000000..5ad70d3 Binary files /dev/null and b/docs/media/results_cross_entropy.png differ diff --git a/docs/media/results_loss.png b/docs/media/results_loss.png new file mode 100644 index 0000000..1351e1f Binary files /dev/null and b/docs/media/results_loss.png differ diff --git a/docs/media/results_perplexity.png b/docs/media/results_perplexity.png new file mode 100644 index 0000000..4e9d1fc Binary files /dev/null and b/docs/media/results_perplexity.png differ diff --git a/docs/results.md b/docs/results.md new file mode 100644 index 0000000..90b2d28 --- /dev/null +++ b/docs/results.md @@ -0,0 +1,10 @@ +# Results + +We train and compare KAN-GPT with an equivalent MLP-GPT model on the Tiny Shakespeare dataset. We observe that the KAN-GPT performs slightly better than the MLP-GPT. We are looking into further experiments to dive deeper. The results are shown below: + + +## Metrics + +| Metrics | | | +|---------|---------|---------| +| ![results_loss](media/results_loss.png) | ![results_cross_entropy](media/results_cross_entropy.png) | ![results_perplexity](media/results_perplexity.png) | diff --git a/mkdocs.yml b/mkdocs.yml index d0a1544..2284937 100644 --- a/mkdocs.yml +++ b/mkdocs.yml @@ -1,2 +1,5 @@ site_name: kan_gpt theme: readthedocs +nav: + - 'index.md' + - 'results.md'