This is an ETL pipeline to pull Turkish football events (red cards-goals etc.) from Turkish Football Federation website.
An example event document from MongoDB:
I used python to pull, transform and load data. Warehouse is MongoDB. All the components are running as docker containers.
- Docker and Docker Compose v1.27.0 or later.
- AWS account.
- AWS CLI installed and configured.
- git.
A Makefile exists with common commands. These are executed in the running container.
make up # starts all the containers
make ci # runs formatting, lint check, type check and python test
The remaining configs are available in the .env.dist file.
In production instances will run as containers. Thus, for MongoDB port 27017 must be available.
You can spin down your local instance with:
make down
This project inspired from Designing a Data Project to Impress Hiring Managers.