- Setting up the whole pipeline which includes
- Setting up DVC to version data & model
- Setting up a gitlab pipeline to automate the execution
- Workflow
- Training everything once, and committing everything necessary
- Replacing the ML model with something new and training that
- Replacing the data with more data and running that through the pipeline.
- Going further ...
The project uses pipenv, you simply need to run
$ pipenv install
to install the right python version as well as the dependencies. For reference:
- dvc
- dvc[s3] as we use AWS S3 for storage.
then run $ pipenv shell
to enter the interactive env.