Fix zero percentage replacement in get_binned_data Function and update parameter name#1278
Fix zero percentage replacement in get_binned_data Function and update parameter name#1278boemer00 wants to merge 3 commits intoevidentlyai:mainfrom boemer00:min_non_zero
Conversation
Enhance
|
|
The linter check has been failing for a while, and now the branch also has merge conflicts with main. Given the current state of the repo, resolving both issues would likely be more work than starting fresh. If the changes are still relevant, feel free to rebase onto the latest main and open a new PR. Happy to review it then! |
|
Hey @boemer00, thank you for the original work here. Since this branch had diverged and had some linter and merge issues, I’ve opened a fresh PR based on your implementation and rebased it on the latest main. New PR: #1660 Hello, @emeli-dral! Happy to continue iteration there! |
This pull request introduces changes that address the issue:
The fixed value for feel_zeroes in get_binned_data may lead to deviation in some case. #334Dynamic Fill Value Calculation: The fill value used to replace zero percentages is now calculated dynamically based on the actual data, rather than using a fixed value.
Ensuring Correct Fill Value: The fill value is guaranteed to be smaller than the minimum non-zero percentage in both the reference and current datasets. This adjustment ensures that the data distribution remains accurate.
Maintaining Data Distribution: These changes help maintain the correct distribution of data, which is crucial for accurate statistical tests, including Kullback-Leibler divergence drift score calculations.
Parameter Name Update: The parameter name has been updated from "feel_zeroes" to "fill_zeroes" for clarity and consistency.
Hope it helps
boemer00