Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix: requeue if failed to trigger an un-instrument rollout #2406

Merged
merged 4 commits into from
Feb 8, 2025

Conversation

RonFed
Copy link
Collaborator

@RonFed RonFed commented Feb 7, 2025

  • Reduce flakiness in the sources e2e test by waiting 1 second between rollouts. This cause flakiness since we are now using a timestamp annotation to trigger a rollout. This timestamp has a resolution of seconds, hence if the test progresses through 2 steps in less than a second we'll have a missing rollout.
  • If a rollout fails when we want to un-instrument a workload, add requeue.

@RonFed RonFed marked this pull request as ready for review February 8, 2025 09:29
kubectl rollout status deployment -l app=inventory
kubectl rollout status deployment -l app=pricing
kubectl rollout status deployment -l app=membership
content: ./wait_for_rollout.sh
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

at this point, we don't yet know if the rollout has started or not, right?
can it lead to race conditions?

@RonFed RonFed merged commit fb4fa7f into odigos-io:main Feb 8, 2025
45 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants