Self hostable service to convert an ONNX model to TensorRT engine.
Server deployment is mandatory for running this app. The fullstack app does not need to be deployed.
-
Create a Modal account. Low usage should fit within the $30 monthly free tier.
-
Create 2 Cloudflare R2 buckets with access key
-
Create a Modal secret:
ONNX_BUCKET=
TRT_BUCKET=
S3_ENDPOINT=
S3_KEY_ID=
S3_KEY=
REGION_NAME=auto
- Create
.env
file in project root for Modal backend deployment:
APP_NAME=
MODAL_WORKSPACE=
- Install python package and deploy:
poetry install
pnpm run deploy:modal