Skip to content

Conversation

@gengliangwang
Copy link
Member

What changes were proposed in this pull request?

Introduce a new Spark config spark.log.legacyTaskNameMdc.enabled:
When true, the MDC key mdc.taskName will be set in the logs, which is consistent with the behavior of Spark 3.1 to Spark 3.5 releases. When false, the logging framework will use task_name as the MDC key for consistency with other new MDC keys.

Why are the changes needed?

As discussed in #46386 (comment), we should add a configuration and migration guide about the change in the MDC key of Task Name.

Does this PR introduce any user-facing change?

No

How was this patch tested?

Manual test

Was this patch authored or co-authored using generative AI tooling?

No

@dongjoon-hyun
Copy link
Member

To @mridulm , are you okay with the default value of spark.log.legacyTaskNameMdc.enabled?

Copy link
Member

@dongjoon-hyun dongjoon-hyun left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm fine for both true or false of the default value.

Please determine the default according to @mridulm 's perspective because he is the effective user of MDC who will be affected by this change.

@dongjoon-hyun
Copy link
Member

To @mridulm , let me merge this first because @cloud-fan will cut Today.
We can change the default value later before 4.0.0.

@dongjoon-hyun
Copy link
Member

Merged to master. Thank you, @gengliangwang !

@gengliangwang
Copy link
Member Author

Thanks a lot, @dongjoon-hyun !

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants