[Helm chart] Adding support for multi hosts and backends #7832

sledress · 2025-11-03T13:42:31Z

Description

This PR adds support for multiple hosts in the Helm ingress template (templates/ingress.yaml), allowing SkyPilot to route traffic for several backend services (e.g. SkyPilot API, MLflow, Grafana) through a shared HTTPS Load Balancer and common TLS certificate.

🔧 Implementation details

Updated templates/ingress.yaml to replace the single ingress.host key with a new ingress.hosts list.

Introduced nested loops to iterate over multiple hosts and paths:

hosts:
  - host: skypilot.domain.com
    paths:
      - path: /
        serviceName: skypilot-api-service
        servicePort: 80
      - path: /grafana
        serviceName: skypilot-grafana
        servicePort: 80
  - host: mlflow.domain.com
    paths:
      - path: /
        serviceName: mlflow-tracking
        servicePort: 5000

Retained the original single-host logic (commented out at the end of the file) to show backward compatibility.
No logic was modified outside of templates/ingress.yaml.

🧾 Example rendered ingress

spec:
  rules:
  - host: skypilot.domain.com
    http:
      paths:
      - pathType: Prefix
        path: /
        backend:
          service:
            name: skypilot-api-service
            port:
              number: 80
      - pathType: Prefix
        path: /grafana
        backend:
          service:
            name: skypilot-grafana
            port:
              number: 80
  - host: mlflow.domain.com
    http:
      paths:
      - pathType: Prefix
        path: /
        backend:
          service:
            name: mlflow-tracking
            port:
              number: 5000

🧩 Values example (to document in `values.yaml`)

# Adding support for multi-host ingress routing
hosts:
  - host: skypilot.domain.com
    paths:
      - path: /
        serviceName: skypilot-api-service
        servicePort: 80

This enables multi-service ingress deployments while keeping compatibility with existing single-host setups.

Describe the tests ran

✅ Environment:

GKE internal load balancer (private IP) with pre-shared TLS cert (mlops-gke-gcp)
NGINX ingress (K3s on-prem)
OAuth2 proxy with HTTPS enabled
Multiple backend services under distinct hostnames

✅ Scenarios tested:

Single-host (legacy) deployment — ✅ no regression
Multi-host ingress with 3 services — ✅ OK
TLS certificate reuse (wildcard cert) — ✅ OK
Service path routing correctness — ✅ OK
No change to other Helm chart components — ✅ OK

Checklist

Tested (run the relevant ones):

Code formatting: install pre-commit or run bash format.sh
Manual validation on GKE and K3s environments
All smoke tests: /smoke-test or pytest tests/test_smoke.py
Backward compatibility: /quicktest-core or pytest tests/smoke_tests/test_backward_compat.py

concretevitamin · 2025-11-03T15:45:07Z

This is awesome @sledress! Will review soon.

kevinmingtarja · 2025-11-03T19:55:46Z

Hi @sledress! Thanks for this PR, I think this would indeed be a common use case as teams using SkyPilot scale up.

However, I think we can achieve the same goal without modifying the Helm chart, by using multiple Ingress resources with the same ingress controller, i.e. by setting ingressClassName to the same ingress controller, so that they automatically share the same L7 load balancer and TLS cert with host-based routing. I am thinking we could do something like:

Deploy SkyPilot normally with existing values:

ingress:
  enabled: true
  host: skypilot.domain.com
  # ... other settings

Create separate Ingress resources for other services (MLflow, etc.)

apiVersion: networking.k8s.io/v1
kind: Ingress
metadata:
  name: mlflow-ingress
  namespace: skypilot # Or another namespace
spec:
  ingressClassName: nginx  # Same controller as the default from our Helm chart (.Values.ingress.ingressClassName)
  rules:
  - host: mlflow.domain.com
    http:
      paths:
      - path: /
        pathType: Prefix
        backend:
          service:
            name: mlflow-tracking
            port:
              number: 5000

Could you try and see whether this setup works for you?

If so, would you be interested in contributing a section explaining this pattern to our docs instead? I think this would help a lot of people in the community, as it should be a pretty common pattern!

sledress · 2025-11-04T12:48:04Z

Hi @concretevitamin
I'll prepare this test and keep you posted.
For sure, I'd love updating the doc.

sledress and others added 6 commits November 3, 2025 14:19

Adding support for multiple hosts and backend services

46a20db

Merge branch 'master' of https://github.com/devteam418/skypilot

75744e9

Adding backward compatibility / Values schema check updated

dcb0107

values.schema.json regenerated

6e55290

Fixed defaults

32a02a3

values.schema.json regenerated

7e009ea

Fixing bad serviceName retrofit

88eaf8a

Michaelvll requested review from kevinmingtarja and rohansonecha November 3, 2025 17:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Helm chart] Adding support for multi hosts and backends #7832

[Helm chart] Adding support for multi hosts and backends #7832

Uh oh!

sledress commented Nov 3, 2025

Uh oh!

concretevitamin commented Nov 3, 2025

Uh oh!

kevinmingtarja commented Nov 3, 2025 •

edited

Loading

Uh oh!

sledress commented Nov 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[Helm chart] Adding support for multi hosts and backends #7832

Are you sure you want to change the base?

[Helm chart] Adding support for multi hosts and backends #7832

Uh oh!

Conversation

sledress commented Nov 3, 2025

Description

🔧 Implementation details

🧾 Example rendered ingress

🧩 Values example (to document in values.yaml)

Describe the tests ran

Checklist

Uh oh!

concretevitamin commented Nov 3, 2025

Uh oh!

kevinmingtarja commented Nov 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sledress commented Nov 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

🧩 Values example (to document in `values.yaml`)

kevinmingtarja commented Nov 3, 2025 •

edited

Loading