FaST-GShare-Autoscaler is a serverless implementation built based on the FaST-GShare. This platform introduces a new FaSTFunc
CRD (Custom Resource Definition) along with its corresponding Operator Controller. FaSTFunc enables FaaS-level control over FaSTPods within FaST-GShare. Users only need to define the container image required for deep model inference and deploy it. FaST-GShare-Autoscaler will automatically scale the model instances based on varying and real-time user workloads.
- go version v1.22.0+
- docker version 17.03+.
- kubectl version v1.11.3+.
- Access to a Kubernetes v1.11.3+ cluster.
Build and push your image to the location specified by IMG
:
make docker-build docker-push IMG=<some-registry>/fast-gshare-autoscaler:tag
ex.
make docker-build docker-push IMG=docker.io/kontonpuku666/fast-gshare-autoscaler:test
build and update the container in K8S environment
make docker-clean docker-build docker-push IMG=docker.io/kontonpuku666/fast-gshare-autoscaler:test
NOTE: This image ought to be published in the personal registry you specified. And it is required to have access to pull the image from the working environment. Make sure you have the proper permission to the registry if the above commands don’t work.
Install the CRDs into the cluster:
make install
Deploy the Manager to the cluster with the image specified by IMG
:
make deploy IMG=<some-registry>/fast-gshare-autoscaler:tag
ex.
make deploy IMG=docker.io/kontonpuku666/fast-gshare-autoscaler:test
NOTE: If you encounter RBAC errors, you may need to grant yourself cluster-admin privileges or be logged in as admin.
Create a FaSTFunc
kubectl apply -f config/samples/sample.yaml
NOTE: Ensure that the samples has default values to test it out.
Delete the instances (CRs) from the cluster:
kubectl delete -k config/samples/
Delete the APIs(CRDs) from the cluster:
make uninstall
UnDeploy the controller from the cluster:
make undeploy
Following are the steps to build the installer and distribute this project to users.
- Build the installer for the image built and published in the registry:
make build-installer IMG=<some-registry>/fast-gshare-autoscaler:tag
NOTE: The makefile target mentioned above generates an 'install.yaml' file in the dist directory. This file contains all the resources built with Kustomize, which are necessary to install this project without its dependencies.
- Using the installer
Users can just run kubectl apply -f to install the project, i.e.:
kubectl apply -f https://raw.githubusercontent.com/<org>/fast-gshare-autoscaler/<tag or branch>/dist/install.yaml
Copyright 2024 FaST-GShare Authors, KontonGu (Jianfeng Gu), et. al. @Techinical University of Munich, CAPS Cloud Team
Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at
http://www.apache.org/licenses/LICENSE-2.0
Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.