forked from kserve/kserve
-
Notifications
You must be signed in to change notification settings - Fork 53
RHOAIENG-34472: [WIP] GIE v1 Migration #948
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Closed
KillianGolds
wants to merge
166
commits into
opendatahub-io:release-v0.15
from
KillianGolds:RHOAIENG-34472-refactor
Closed
Changes from all commits
Commits
Show all changes
166 commits
Select commit
Hold shift + click to select a range
5c6b34f
(llmisvc): migrate to v1 InferencePool with v1alpha2 failover
KillianGolds c9df501
Resolve pre-commit linting errors
KillianGolds 444671a
Downgrade gateway-api-inference-extension to v1.0.0
KillianGolds 70cc0cb
upgrade controller-gen to v0.17.2
KillianGolds cc7f4b1
Apply fork workaround for GIE v1.0.0 validation bug and regenerate ma…
KillianGolds 0429ef2
Add GIE v1 InferencePool CRD for integration tests
KillianGolds b861c9b
Fix openapi_generated.go formatting
KillianGolds 483a080
Add GIE v1alpha2 InferencePool CRD for integration tests
KillianGolds 29aec70
fix(llmisvc): add required matchLabels to scheduler config
KillianGolds 886d72b
fix(llmisvc): add required port field to endpointPickerRef
KillianGolds 059202b
fix(llmisvc): fix integration tests for GIE v1 migration
KillianGolds 308fba0
chore: regenerate openapi_generated.go
KillianGolds 6ea6db2
fix(llmisvc): fix GIE v1 InferencePool test helpers
KillianGolds 91623f8
fix(llmisvc): use int64 for unstructured port numbers
KillianGolds aa093ae
fix(llmisvc): restore config merge logic and update tests for GIE v1
KillianGolds acfd40c
fix(llmisvc): add GIE v1 scheme registration and InferencePool watch
KillianGolds e491892
fix(llmisvc): add watches for v1alpha2 InferencePool and InferenceModel
KillianGolds 2dfbf5b
chore: upgrade to Kubernetes v0.34, Gateway API v1.4, and KEDA v2.18
KillianGolds a83d643
fix: update Dockerfiles to use Go 1.24.7
KillianGolds 22e7100
fix: allow Go 1.24.6 to build code requiring 1.24.7
KillianGolds 9b8f9d1
fix: use GOTOOLCHAIN=auto to handle Go version mismatch
KillianGolds d220b1e
add GOTOOLCHAIN=auto to all Dockerfiles for Go version compatibility
KillianGolds dc9d572
fix(build): disable go-licenses checks in localmodel Dockerfiles
KillianGolds 156dfaa
fix(rbac): add GIE v1 inferencemodels and inferenceobjectives permiss…
KillianGolds 659b827
fix(e2e): upgrade Gateway API Inference Extension to v1.0.0
KillianGolds 4c85a7b
fix(llmisvc): implement dual-pool fallback in InferencePool readiness…
KillianGolds 987c43c
fix(llmisvc): auto-inject dual InferencePool backend refs for schedul…
KillianGolds 6aa70d2
Make Precommit failure
KillianGolds b1f5abc
fix(llmisvc): use separate pool names for v1 and v1alpha2 backend refs
KillianGolds 798c993
fix(llmisvc): check Gateway Controller support before migrating to v1
KillianGolds cc7d54c
fix(llmisvc): check ResolvedRefs condition in InferencePool readiness…
KillianGolds 141ed63
fix(llmisvc): defer InferencePool readiness evaluation until after HT…
KillianGolds 08dc6e6
fix(ci): install GIE controller in OpenShift E2E for InferencePool su…
KillianGolds 8a0fc38
fix(ci): enable InferencePool support in Istio
KillianGolds 016a8dd
fix(llmisvc): add retry logic for HTTPRoute update conflicts
KillianGolds 012676a
fix(ci): use openshift-default GatewayClass name for E2E tests
KillianGolds ffff4ba
chore(deps): switch to upstream gateway-api with merged validation fix
KillianGolds b79845e
fix(llmisvc): add retry logic for finalizer operations and fix async …
KillianGolds 2be3766
chore: update generated files from precommit hook
KillianGolds 07a191b
Alternative
pierDipi 717575c
(llmisvc): migrate to v1 InferencePool with v1alpha2 failover
KillianGolds 7b048d8
Resolve pre-commit linting errors
KillianGolds a47c80f
Downgrade gateway-api-inference-extension to v1.0.0
KillianGolds 79e6c27
upgrade controller-gen to v0.17.2
KillianGolds 8c16e7c
Apply fork workaround for GIE v1.0.0 validation bug and regenerate ma…
KillianGolds 86aeed4
Add GIE v1 InferencePool CRD for integration tests
KillianGolds 4b4844b
Fix openapi_generated.go formatting
KillianGolds 02b0abe
Add GIE v1alpha2 InferencePool CRD for integration tests
KillianGolds 06565e5
fix(llmisvc): add required matchLabels to scheduler config
KillianGolds a1d8e51
fix(llmisvc): add required port field to endpointPickerRef
KillianGolds 68c55f0
fix(llmisvc): fix integration tests for GIE v1 migration
KillianGolds c8353e1
chore: regenerate openapi_generated.go
KillianGolds 7f0bebf
fix(llmisvc): fix GIE v1 InferencePool test helpers
KillianGolds 3eb5d45
fix(llmisvc): use int64 for unstructured port numbers
KillianGolds 4ffa44a
fix(llmisvc): restore config merge logic and update tests for GIE v1
KillianGolds b382549
fix(llmisvc): add GIE v1 scheme registration and InferencePool watch
KillianGolds 920b8eb
fix(llmisvc): add watches for v1alpha2 InferencePool and InferenceModel
KillianGolds 3bed0b4
chore: upgrade to Kubernetes v0.34, Gateway API v1.4, and KEDA v2.18
KillianGolds 3581f80
fix: update Dockerfiles to use Go 1.24.7
KillianGolds 60b5a4f
fix: allow Go 1.24.6 to build code requiring 1.24.7
KillianGolds 7e2f7c8
fix: use GOTOOLCHAIN=auto to handle Go version mismatch
KillianGolds 65536cd
add GOTOOLCHAIN=auto to all Dockerfiles for Go version compatibility
KillianGolds 73ce255
fix(build): disable go-licenses checks in localmodel Dockerfiles
KillianGolds ff187b6
fix(rbac): add GIE v1 inferencemodels and inferenceobjectives permiss…
KillianGolds e416655
fix(e2e): upgrade Gateway API Inference Extension to v1.0.0
KillianGolds 3e2bbcd
fix(llmisvc): implement dual-pool fallback in InferencePool readiness…
KillianGolds ef835ac
fix(llmisvc): auto-inject dual InferencePool backend refs for schedul…
KillianGolds 9d04f60
Make Precommit failure
KillianGolds 8f5a127
fix(llmisvc): use separate pool names for v1 and v1alpha2 backend refs
KillianGolds b63f4ec
fix(llmisvc): check Gateway Controller support before migrating to v1
KillianGolds f41abf5
fix(llmisvc): check ResolvedRefs condition in InferencePool readiness…
KillianGolds 5786f15
fix(llmisvc): defer InferencePool readiness evaluation until after HT…
KillianGolds 97449c2
fix(ci): install GIE controller in OpenShift E2E for InferencePool su…
KillianGolds 2a8caa2
fix(ci): enable InferencePool support in Istio
KillianGolds bd989f7
fix(llmisvc): add retry logic for HTTPRoute update conflicts
KillianGolds 4108db0
fix(ci): use openshift-default GatewayClass name for E2E tests
KillianGolds 38bd1a9
chore(deps): switch to upstream gateway-api with merged validation fix
KillianGolds 254ea6c
fix(llmisvc): add retry logic for finalizer operations and fix async …
KillianGolds 7567270
chore: update generated files from precommit hook
KillianGolds ecb1a22
fix(llmisvc): trim HTTPRoute to single rule for GIE v1 migration
KillianGolds abb40d0
fix(typo)
KillianGolds b7caa8f
fix(llmisvc): initialize Config field in test reconciler
KillianGolds 8ca895c
fix(typo): Update other typo
KillianGolds 3b4c9ad
fix(llmisvc): add required fields to InferencePool status for GIE v1 …
KillianGolds f673d7b
fix(llmisvc): correct Service backend detection in extractRoutePath
KillianGolds 2baf173
test(llmisvc): make auth tests conditional on RHCL availability
KillianGolds dc6c47c
fix(llmisvc): make InferencePool config compatible across KServe vers…
KillianGolds bb9a3aa
chore: run precommit fixes and codegen
KillianGolds 578871d
fix(llmisvc): enable v1alpha2 fallback in dual-pool strategy
KillianGolds 6e09281
fix(test): correct rhcl_available fixture usage in auth tests
KillianGolds 2e0f069
refactor(test): remove duplicate backend ref builder functions
KillianGolds 2f227a8
chore: run precommit
KillianGolds b68114a
fix(llmisvc): watch v1 InferencePool status changes for migration
KillianGolds d358c02
Merge branch 'RHOAIENG-34472-dev' into maybe-simpler
pierDipi 68731fa
v1alpha2
pierDipi ef7e713
chore(clean): commit go.sum go mod tidy cleanup
KillianGolds 6edd063
Fix envtests
pierDipi d8d4978
v1alpha2 llmisvcconfig conversion
pierDipi b3945dc
Out of band migration
pierDipi 61fc2ed
Use v1alpha1 for EndpointPickerConfig
pierDipi ea80b64
Revert config/default/manager_image_patch.yaml
pierDipi ed30c9c
Revert config/overlays/odh/params.env
pierDipi dcdeee7
Revert test/scripts/gh-actions/run-e2e-tests.sh
pierDipi 9943c6c
EndpointPickerConfig v1alpha1
pierDipi 8281a3b
Merge branch 'release-v0.15' into RHOAIENG-34472-refactor
KillianGolds bd28818
Test e2e both v1alpha1 and v1alpha2 versions
pierDipi a7b9c01
Add version marker
pierDipi c8cde0d
Merge branch 'release-v0.15' into RHOAIENG-34472-refactor
KillianGolds b2f6aa7
Address PR feedback: revert DEV.md and add v1 InferencePool RBAC
KillianGolds 04ce251
fix(test): hardcode v1alpha1 objects to fix validation tests
KillianGolds a24cf66
chore: regenerate code from make precommit
KillianGolds 37879e3
fix: disambiguate v1alpha1 and v1alpha2 webhook names
KillianGolds 291d381
fix: separate webhook configurations for v1alpha1 and v1alpha2
KillianGolds ab27617
fix: register v1alpha1 validators in test environment
KillianGolds 2459fc3
test: remove redundant TestODHModelControllerCompatibility
KillianGolds abb5b14
fix: repair malformed Python SDK tests for v1alpha1 and v1alpha2
KillianGolds ea877bc
fix: add valid URIs to LLMModelSpec tests
KillianGolds f3f4560
test: reduce CPU workload requests in e2e tests
KillianGolds f6239e5
fix: reduce e2e test parallelism to prevent HTTP 409 errors
KillianGolds 67fb13a
fix: wait for LLMInferenceService deletion to complete
KillianGolds 1a697b3
RHOAIENG-34916: CA Cert signing and validation (#957)
pierDipi f28fa4a
chore: add Python SDK for KServe types and fix v1alpha2 test reference
KillianGolds d126e8d
Merge upstream release-v0.15 into RHOAIENG-34472-refactor
KillianGolds b86d230
fix: add KServe v1alpha1 model imports to Python SDK
KillianGolds a1705b3
fix: remove orphaned Python SDK files and fix Makefile URL quoting
KillianGolds 13cbaa0
Merge remote-tracking branch 'upstream/release-v0.15' into RHOAIENG-3…
KillianGolds 89009cf
fix: address PR feedback on webhook names and comments
KillianGolds 250bea9
Merge latest changes from upstream/release-v0.15
KillianGolds 96236bd
fix: remove risky RetryOnConflict wrapper from updateIfNeeded
KillianGolds 6e36ead
chore: remove unnecessary comment about Gateway API v1.4 port assignment
KillianGolds 82a4b5a
fix: remove pytest.skip for missing RHCL - tests should fail not skip
KillianGolds 6ddc957
test: use API version in service names to avoid conflicts
KillianGolds 9729850
fix: deploy Kuadrant before KServe in setup script
KillianGolds 2e99def
deps: upgrade GIE to v1.1.0 and remove replace directive
KillianGolds 0045ec7
docs: add comments explaining go-licenses incompatibility
KillianGolds 0907702
chore: regenerate CRDs with GIE v1.1.0 validation updates
KillianGolds e4fb2da
fix: preserve resourceVersion when updating resources
KillianGolds 6398aa3
fix: use template variables for test resource names to support API ve…
KillianGolds 7b079ca
fix: include API version in LLMInferenceServiceConfig names to preven…
KillianGolds 24f21be
fix: use api_version fixture parameter in test_case fixture
KillianGolds be67414
fix: use base service name for InferencePool references
KillianGolds d955510
test: make HTTPRoutes dynamic with API version isolation
KillianGolds 9915747
fix: disable authentication on test Gateway resources
KillianGolds 30065f5
fix: delete Kuadrant AuthPolicies for Gateways with auth disabled
KillianGolds 3d096da
fix(e2e): disable auth on LLMInferenceService resources for router tests
KillianGolds a3adfa1
fix: default endpointPickerRef.port for GIE v1 compatibility
KillianGolds 3c8060b
Merge upstream/release-v0.15 into RHOAIENG-34472-refactor
KillianGolds 85e05dc
chore: update go.sum after upstream merge
KillianGolds 1fa6228
fix: set conversion webhook namespace to opendatahub in ODH overlay
KillianGolds 17caef9
Merge remote-tracking branch 'upstream/release-v0.15' into RHOAIENG-3…
KillianGolds 7605cc7
chore: precommit
KillianGolds 201cf88
fix(ci): use apply instead of create for Istio deployment
KillianGolds 7e2b68d
fix: update MinIO image to resolve CI test failures
KillianGolds 0458d29
perf: reduce webhook load during dual InferencePool creation
KillianGolds 8de247f
fix: increase liveness probe timeouts for CPU workloads in e2e tests
KillianGolds 00397a2
fix: resolve nil pointer errors in Go template configurations
KillianGolds 3fdfc9a
fix: prevent Kubernetes 63-character naming violations with SafeChild…
KillianGolds b01c995
fix: improve HTTPRoute lifecycle management for controller-managed an…
KillianGolds 7f38b4b
fix: correct conversion webhook namespace and service name
KillianGolds 51811ef
fix: load kube config in rhcl_available fixture
KillianGolds 4a18f10
fix: only disable auth for non-auth tests in fixture
KillianGolds e3899d6
Merge remote-tracking branch 'upstream/release-v0.15' into RHOAIENG-3…
KillianGolds 6f9d72c
chore: regenerate consolidated CRD with updated webhook service name
KillianGolds 46b7d63
fix: increase SafeChildName suffix reservation to 20 chars for multi-…
KillianGolds c6e240a
test: update test resource lookups to use SafeChildName
KillianGolds eabd009
Merge remote-tracking branch 'upstream/release-v0.15' into RHOAIENG-3…
KillianGolds File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Maybe this can go as separate PR, but I also wonder if we can fix it differently and leverage faster test cycles?