Skip to content

Conversation

@benbz
Copy link
Member

@benbz benbz commented Nov 14, 2025

k3d integrates a LB, metrics-server and a registry so we can simplify our config a fair bit.

It also appears faster to startup, at the cost of needing to retry

resp = <Response [429 Too Many Requests]>

    @staticmethod
    def raise_for_status(resp):
        try:
            resp.raise_for_status()
        except httpx.HTTPError as e:
>           raise transform_exception(e)
E           lightkube.core.exceptions.ApiError: storage is (re)initializing

@benbz benbz requested a review from a team as a code owner November 14, 2025 16:19
@github-actions
Copy link

github-actions bot commented Nov 14, 2025

dyff of changes in rendered templates of CI manifests

Full contents of manifests and dyffs are available in https://github.com/element-hq/ess-helm/actions/runs/19371712297/artifacts/4570957734

pytest-matrix-rtc-synapse-wellknown-values.yaml
@@ Ingress/ess-ci/release-name-matrix-rtc - metadata @@
- annotations:
-   nginx.ingress.kubernetes.io/proxy-buffering: off
-   nginx.ingress.kubernetes.io/proxy-read-timeout: 120
-   nginx.ingress.kubernetes.io/proxy-send-timeout: 120



@@ Ingress/ess-ci/release-name-synapse - metadata @@
- annotations:
-   nginx.ingress.kubernetes.io/proxy-body-size: 100M



@@ Ingress/ess-ci/release-name-well-known - spec.rules.0.http.paths.0.pathType @@
- ImplementationSpecific
+ Prefix

@benbz benbz force-pushed the bbz/k3d branch 5 times, most recently from d963803 to 4fb4d24 Compare November 14, 2025 16:49
if: ${{ failure() }}
shell: bash
run: |
kind export logs --name ess-helm ./ess-helm-logs
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We are loosing kubernetes system logs and pod logs with this move to k3d. It's quite useful to understand what happened in CI. We should have an equivalent with k3d before merging this I believe.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yup I've run into the lack of this whilst debugging the CI failures. Oustanding things are

  • Pod logs in CI
  • Wait for 443/TCP to be available locally before ingress_ready returns
  • Fix the flakiness with the pod networking tests which is now much frequent
  • Fix whatever is causing HAProxy failures?
  • Possibly do things with DockerHub in the k3d config itself

@benbz benbz marked this pull request as draft November 17, 2025 09:24
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants