-
Notifications
You must be signed in to change notification settings - Fork 9
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
fix: prevent multiple credible filters to override spark plan #766
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
General question to this PR. Since spark dafarames are immutable, we might enforce directly on the dataset, that the .df
should not be allowed to set outside of the constructor(s).
Currently we allow for the .df
property to be mutable, which could cause similar cases like the one mentioned here.
I think this topic is up to open discussion.
We should definately enforce one way, not both in our code.
As discussed, this was not causing any issues in the pipeline, but it could easily cause problems. |
…/opentargets/gentropy into do_fix_credible_set_filter_issue
This reverts commit a358781.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The wandb version was dumped. Was that on purpose? Judging by the commit names, I guess not :)
There is still some change in the LGTM |
I aligned the poetry version. One line changed in the lock, and I'm not sure what caused it. It might even be linked to machine architecture. We can discuss it; I don't think it should be a blocker. I'm merging now... this has been more painful than it should |
Because the
filter_credible_set
was applied over the same object, 2 subsequent calls to the same object would apply them on the same dataframe. That's not the expected as 2 calls to the object could return different results as illustrated here:This PR creates a new StudyLocus object any time a new call is done to the
filter_credible_set
function. This is the behaviour after the change:Tests were passing because they were creating a dataframe for each test.