Hangman with RL

This repository contains the code for a quick fun little project I coded trying to solve the game of hangman using Reinforcement Learning

Dataset

For the dataset I used a file downloaded from here which should contain all words in English (I've not checked)

State

In order to simplicy the problem, since the hangman is based on such dataset, I just went and check the longest word, and considered the shorter ones with additional padding at the end.

At that point, the number of possible letters is fixed, the length of the words is fixed, so the final state it's just the concatenation of those 2 infos + how many lives are left to the agent.

Action

For the action, since the set of possible letters is fixed, I just outputed a distribution over list, masked out the ones already tried, and re-normalized

Algorithm

For the learning I used PPO implemented from scratch using TD(0)

Result

The learning is successful and rewards can be checked at the end of the ipynb notebook

Possible improvement

Consider the prior of humans in the sampling of the words (estimated from a big corpora of text?)

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
__pycache__		__pycache__
README.md		README.md
dataset.txt		dataset.txt
hangman.ipynb		hangman.ipynb
hangman.py		hangman.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Hangman with RL

Dataset

State

Action

Algorithm

Result

Possible improvement

About

Releases

Packages

Languages

AlbertoSinigaglia/hangman-reinforcement-learning

Folders and files

Latest commit

History

Repository files navigation

Hangman with RL

Dataset

State

Action

Algorithm

Result

Possible improvement

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages