ML Model Deployment Options

This repo demonstrates an end-to-end, architecturally correct solution for deploying a Tensorflow (Keras) model on a backend with a queue, and entirely on the frontend with TensorflowJS.

It accompanies my Medium article (find it https://medium.com/@tomgrek.)

An example deployment can be found at https://exploitip.com.

The repo contains:

An example backend Python application for queuing machine learning jobs received from a web app, and also serving up their progress and results to said web app.
An example worker that also runs on the backend, in Python, that loads a RNN model exported from Tensorflow and processes jobs from the queue in order.
The model (stored as model.json and group1-shard1of1), exported from Tensorflow, TensorflowJS compatible. It's a simple model based entirely on Google's Tensorflow RNN Nietzche text generation example. This model can be run at both the front and back end! (Note I barely trained the model - two or three epochs only, it is not the point of this repo. If you are looking for excellent Nietzche-like text generation, look away!)
A sample front end web application showing how to run the model at the front end, and how to interact with the back end.
A Jupyter notebook showing how the model and character mappings were created, trained, and exported.

Requirements

Backend

A running Redis instance
pip install mlq (MLQ is a queuing system for ML jobs)
Python 3.6 +

Frontend

yarn
yarn install from the root directory
yarn watch lets you develop the frontend locally

Then

./deploy.sh will build and upload to a server such as an AWS t2.micro or a Linode.
python worker.py to run a worker instance. Run as many of these as you have server resources/cores/processes/threads; they do all need to connect to a Redis instance.
FLASK_APP=app.py python -m flask run --host=127.0.0.1 --port=5001 to run the server.
Configure nginx to serve up the static assets and proxy_pass /api to the localhost:5001 (i.e., the server).

Hopefully you know this but

If you're SSH'd into a server and setting all this up, and your SSH connection drops or you log out, the Python apps will stop running. Look into screen or tmux as simple ways to keep them running.

Notes

This is a bare-bones, minimal example. See the README files in server/ and worker/ for further details. For a production-hardened deployment you would probably want extra resiliency, better logging, stronger security and so on.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
notebooks		notebooks
server		server
worker		worker
.babelrc		.babelrc
.gitignore		.gitignore
README.md		README.md
charset.js		charset.js
data.js		data.js
deploy.sh		deploy.sh
group1-shard1of1		group1-shard1of1
index.html		index.html
index.js		index.js
model.json		model.json
package.json		package.json
ui.js		ui.js
utils.js		utils.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ML Model Deployment Options

Requirements

Backend

Frontend

Then

Hopefully you know this but

Notes

About

Releases

Packages

Languages

tomgrek/ml-deployment-demo

Folders and files

Latest commit

History

Repository files navigation

ML Model Deployment Options

Requirements

Backend

Frontend

Then

Hopefully you know this but

Notes

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages