Skip to content

Conversation

@ericm-db
Copy link
Contributor

@ericm-db ericm-db commented Dec 3, 2025

What changes were proposed in this pull request?

We want to add .name() to the DataStreamReader to enable naming streaming sources, which will allow users to add, remove and reorder sources. This API is currently package-private and not user-accessible.

Why are the changes needed?

Currently, if a user tries to make any change to their streaming source in their query, the query will fail as the way we compare sources (via ordinal in the logical plan) are incompatible. Keying by name as opposed to ordinal is far less brittle

Does this PR introduce any user-facing change?

No

How was this patch tested?

Unit tests

Was this patch authored or co-authored using generative AI tooling?

No

@ericm-db ericm-db changed the title [WIP] Adding .name() to DataStreamReader to enable naming streaming sources [SPARK-54584] Adding .name() to DataStreamReader to enable naming streaming sources Dec 3, 2025
@HyukjinKwon HyukjinKwon changed the title [SPARK-54584] Adding .name() to DataStreamReader to enable naming streaming sources [SPARK-54584][SS] Adding .name() to DataStreamReader to enable naming streaming sources Dec 4, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant