feat(cache): KEP-2655: Support provisioning of cache with Kubeflow SDK #112

akshaychitneni · 2025-09-30T16:07:54Z

What this PR does / why we need it:
Adds cache initializer in sdk

Which issue(s) this PR fixes (optional, in Fixes #<issue number>, #<issue number>, ... format, will close the issue(s) when PR gets merged):

Fixes # kubeflow/trainer#2866

Checklist:

Docs included if any changes are user facing

google-oss-prow · 2025-09-30T16:08:01Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by:
Once this PR has been reviewed and has the lgtm label, please assign astefanutti for approval. For more information see the Kubernetes Code Review Process.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

coveralls · 2025-09-30T16:09:45Z

Pull Request Test Coverage Report for Build 18136229986

Details

1 of 11 (9.09%) changed or added relevant lines in 1 file are covered.
1 unchanged line in 1 file lost coverage.
Overall coverage decreased (-1.2%) to 70.546%

Changes Missing Coverage	Covered Lines	Changed/Added Lines	%
kubeflow/trainer/utils/utils.py	1	11	9.09%

Files with Coverage Reduction	New Missed Lines	%
kubeflow/trainer/utils/utils.py	1	62.15%

Totals
Change from base Build 17979146135:	-1.2%
Covered Lines:	297
Relevant Lines:	421

💛 - Coveralls

andreyvelich · 2025-10-06T14:42:42Z

kubeflow/trainer/types/types.py

+    metadata_loc: str
+    schema_name: str
+    table_name: str
+    env: Optional[dict[str, str]] = None


Do we want to be explicit which optional values user can set rather than giving them arbitrary env configuration in DataCacheInitializer?

andreyvelich · 2025-10-06T14:43:47Z

kubeflow/trainer/utils/utils.py

-    if not isinstance(dataset, types.HuggingFaceDatasetInitializer):
-        return None
+    if isinstance(dataset, types.HuggingFaceDatasetInitializer):
+        # TODO (andreyvelich): Support more parameters.


you probably can remove this TODO for now.

Suggested change

# TODO (andreyvelich): Support more parameters.

andreyvelich · 2025-10-06T14:46:43Z

kubeflow/trainer/utils/utils.py

+            ),
+        )
+        return dataset_initializer
+    elif isinstance(dataset, types.DataCacheInitializer):


you probably can make this more generic, since ENV names are equal to the DataClass field names:

envs = [] for f in fields(dataset): name = f.name.upper() value = getattr(dataset, f.name) envs.append(models.IoK8sApiCoreV1EnvVar(name=name, value=str(value)))

andreyvelich · 2025-10-06T14:52:42Z

kubeflow/trainer/types/types.py

+    cluster_size: int
+    metadata_loc: str
+    schema_name: str
+    table_name: str


Would it be possible to leverage storage_uri to set location or it has some limitations ?

e.g.

cache://<CATALOG_NAME>/<DATABASE_NAME>/<TABLE_NAME>

I can imagine that metadata location can be set separately.

As discussed, lets have uri to cache://<DATABASE_NAME>/<TABLE_NAME> and use metadata_loc

andreyvelich · 2025-10-08T15:19:15Z

/milestone v0.2

Signed-off-by: Akshay Chitneni <[email protected]>

google-oss-prow bot requested review from kramaranya and szaher September 30, 2025 16:08

google-oss-prow bot added the size/L label Sep 30, 2025

akshaychitneni force-pushed the cache branch from fe6eb07 to e1b410a Compare October 1, 2025 17:05

andreyvelich reviewed Oct 6, 2025

View reviewed changes

google-oss-prow bot added this to the v0.2 milestone Oct 8, 2025

feat(cache): KEP-2655: Adding cache initializer

502286a

Signed-off-by: Akshay Chitneni <[email protected]>

akshaychitneni force-pushed the cache branch from e1b410a to 502286a Compare October 8, 2025 19:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(cache): KEP-2655: Support provisioning of cache with Kubeflow SDK #112

feat(cache): KEP-2655: Support provisioning of cache with Kubeflow SDK #112

akshaychitneni commented Sep 30, 2025

Uh oh!

google-oss-prow bot commented Sep 30, 2025

Uh oh!

coveralls commented Sep 30, 2025

Uh oh!

andreyvelich Oct 6, 2025

Uh oh!

andreyvelich Oct 6, 2025

Uh oh!

andreyvelich Oct 6, 2025

Uh oh!

andreyvelich Oct 6, 2025

Uh oh!

akshaychitneni Oct 8, 2025 •

edited

Loading

Uh oh!

andreyvelich commented Oct 8, 2025

Uh oh!

Uh oh!

feat(cache): KEP-2655: Support provisioning of cache with Kubeflow SDK #112

Are you sure you want to change the base?

feat(cache): KEP-2655: Support provisioning of cache with Kubeflow SDK #112

Conversation

akshaychitneni commented Sep 30, 2025

Uh oh!

google-oss-prow bot commented Sep 30, 2025

Uh oh!

coveralls commented Sep 30, 2025

Pull Request Test Coverage Report for Build 18136229986

Details

💛 - Coveralls

Uh oh!

andreyvelich Oct 6, 2025

Choose a reason for hiding this comment

Uh oh!

andreyvelich Oct 6, 2025

Choose a reason for hiding this comment

Uh oh!

andreyvelich Oct 6, 2025

Choose a reason for hiding this comment

Uh oh!

andreyvelich Oct 6, 2025

Choose a reason for hiding this comment

Uh oh!

akshaychitneni Oct 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

andreyvelich commented Oct 8, 2025

Uh oh!

Uh oh!

akshaychitneni Oct 8, 2025 •

edited

Loading