Allow do interventions to reference intervened variable #219

ricardoV94 · 2023-07-24T13:10:36Z

This is now possible:

import pymc as pm
from pymc_experimental.model_transform.conditioning import do

with pm.Model() as m:
    x = pm.Normal("x")

new_m = do(m, {x: x + 100})
assert pm.draw(new_m["do_x"] > 50)

It was already fine to do this kind of replacement with Deterministics (or other RVs) as is the following:

with pm.Model() as m:
    x = pm.Normal("x")
    do_x = pm.Deterministic("det", x)

new_m = do(m, {det: x + 100})
assert pm.draw(new_m["det"] > 50)

with pm.Model() as m:
    x = pm.Normal("x")
    y = pm.Normal("y", x)

new_m = do(m, {y: x + 100})
assert pm.draw(new_m["y"] > 50)

juanitorduz

I do not know the complete details of the do-operator implementation but his change makes sense by reading the code and test :)

ricardoV94 · 2023-08-02T19:03:21Z

Thanks for reviewing @juanitorduz

@twiecki wanna give a thumbs up/down?

twiecki · 2023-08-02T20:08:48Z

Big 👍

lucianopaz

Looks good to me. Were you getting duplicate names errors before? I wouldn't have expected that though. Did the error come at the point where the new model would be recompiled from the FunctionGraph?

pymc_experimental/tests/model_transform/test_conditioning.py

ricardoV94 · 2023-08-03T10:41:32Z

Looks good to me. Were you getting duplicate names errors before? I wouldn't have expected that though. Did the error come at the point where the new model would be recompiled from the FunctionGraph?

The error arises when we convert fgraph to a Model and try to register multiple variables (such as an RV and a Deterministic) with the same name. This happens here because we borrow the name of the variable being intervened for the deterministic that represents the intervention.

Does that make sense?

lucianopaz

LGTM

drbenvincent

Just a check of what new_m = do(m, {x: x + 100}) does. This might help inform extra info for the docstring for do potentially.

My intuition of what this would do, under the principle of least surprise, would be:

Replace the RV x with a new RV but which has the +100 modifier.
Cut the incoming edges to x because that's a core part of what do does.

Any particular ideas about possible use-cases?

Is this just constrained to some kind of deterministic modification? What if someone comes along and does new_m = do(m, {x: x + z}) where z is another RV. If that is possible/permissible, then might be worth adding a test.

ricardoV94 · 2023-08-06T06:34:18Z

Replace the RV x with a new RV but which has the +100 modifier. * Cut the incoming edges to x because that's a core part of what do does.

No that's not what happens. It keeps x and adds a deterministic x + 100 downstream. Every variable that's depended on x now depends on this deterministic. Paths to x are preserved (there's no other way, what would you use as inputs for x otherwise?).

The PyMC do allows random interventions not just constant ones.

I don't know about use cases but seems like a more powerful mechanism.

ricardoV94 · 2023-08-06T06:41:03Z

Is this just constrained to some kind of deterministic modification? What if someone comes along and does new_m = do(m, {x: x + z}) where z is another RV. If that is possible/permissible, then might be worth adding a test.

That's possible, and under the hood is exactly the same as the test that was introduced. There are already tests for the do that reference other RVs: https://github.com/pymc-devs/pymc-experimental/blob/14d4f2bca8a838be3efa0176f1b0385d4c7e27f3/pymc_experimental/tests/model_transform/test_conditioning.py

What's new in this PR is not that the interventions can contain random variables, but that they can reference the original variable that is being intervened.

drbenvincent · 2023-08-09T11:25:38Z

Replace the RV x with a new RV but which has the +100 modifier. * Cut the incoming edges to x because that's a core part of what do does.

No that's not what happens. It keeps x and adds a deterministic x + 100 downstream. Every variable that's depended on x now depends on this deterministic. Paths to x are preserved (there's no other way, what would you use as inputs for x otherwise?).

The PyMC do allows random interventions not just constant ones.

So I think this might throw people, at least initially. The do-operator (from Pearl) is pretty well-defined, and it seems that this goes beyond that to do more generic graph surgery. I don't think I have a strong philosophical objection, but cutting incoming edges to the target node is a pretty major component of the do operator. So I guess we either need to make sure the docstrings are really clear about that, or have a different operator name for this particular manipulation of the graph.

drbenvincent · 2023-08-09T11:28:18Z

What's new in this PR is not that the interventions can contain random variables, but that they can reference the original variable that is being intervened

Same point as above. The PyMC do operator is going beyond Pearl's do operator concept. Either that can be seen as more powerful, or it could be seen as confusing.

ricardoV94 · 2023-08-09T14:10:18Z

We can call the current one "replace" and the "do" would call replace but only allow constants/shared variables?

CC @lucianopaz

drbenvincent · 2023-08-09T19:32:42Z

Like dispatching? That would still involve the user calling do and not deleting incoming edges?

How about the user calling this new function when they want to do that. And if they try to achieve it with do they get an informative error message to use the new function instead?

Could be worth jumping on a quick call to talk this through perhaps?

ricardoV94 · 2023-08-10T08:46:44Z

Like dispatching? That would still involve the user calling do and not deleting incoming edges?

How about the user calling this new function when they want to do that. And if they try to achieve it with do they get an informative error message to use the new function instead?

I mean we rename the current function do -> replace.
We implement a new do function that calls the more flexible replace under the hood, but restricts the type of replacements to be constants or other expressions that don't depend on other random variables (when we fail we can mention the option of using replace directly as you suggested).

Either way I would do that in a separate PR from this one. WDYT?

ricardoV94 · 2023-08-10T09:06:17Z

I just did a random quick search for "Pearl stochastic do interventions" and this showed up first: https://ojs.aaai.org/index.php/AAAI/article/view/6567 (pdf can be found on Google scholar)

I don't know anything about the subject but if this sort of stuff is interesting/valid then the current "supercharged" do would be apt.

I imagine stuff like do({y: pm.math.switch(x > 0, 0, y)}) could be interesting (i.e., intervention is conditional on data/posterior values of another variable). Maybe one group gets treatment and the other does not based on some observed/inferred criteria.

drbenvincent · 2023-08-15T15:36:41Z

That paper is pretty neat. Don't have the luxury to go through it in detail, but I guess it satisfies me that it's kosher to have a single do-operator which implements different kinds of interventions. The general concept of a stochastic do-operator isn't 100% novel to me, but I did perceive it as a categorically different thing. I'm more happy now that it's not.

So I'm happy to withdraw any objection of this happening under the do-operator.

If you wanted to have different functions that are called behind the scenes (perhaps that improves testability?) then feel free. But from an API/user perspective I'm good with this. We obviously just need to clearly document with examples in pymc-examples for example ;)

drbenvincent · 2023-08-15T15:40:05Z

Though they do talk about $\sigma$ calculus as different to do calculus. But they don't propose a $\sigma$ operator.

ricardoV94 · 2023-08-16T07:44:13Z

Thanks @drbenvincent. I am curious if we can write-up a compelling example that showcases these forms of interventions.

ricardoV94 requested review from twiecki, lucianopaz and drbenvincent July 24, 2023 13:10

ricardoV94 added the enhancements New feature or request label Jul 25, 2023

juanitorduz approved these changes Aug 2, 2023

View reviewed changes

twiecki previously approved these changes Aug 2, 2023

View reviewed changes

lucianopaz previously approved these changes Aug 2, 2023

View reviewed changes

pymc_experimental/tests/model_transform/test_conditioning.py Outdated Show resolved Hide resolved

ricardoV94 dismissed stale reviews from lucianopaz and twiecki via 8c32d44 August 3, 2023 10:46

ricardoV94 force-pushed the allow_do_self_reference branch from 9a8aa7b to 8c32d44 Compare August 3, 2023 10:46

lucianopaz approved these changes Aug 3, 2023

View reviewed changes

ricardoV94 force-pushed the allow_do_self_reference branch from 8c32d44 to 14d4f2b Compare August 3, 2023 16:04

drbenvincent reviewed Aug 5, 2023

View reviewed changes

drbenvincent approved these changes Aug 15, 2023

View reviewed changes

Allow do interventions to reference intervened variable

98e13c9

ricardoV94 force-pushed the allow_do_self_reference branch from 14d4f2b to 98e13c9 Compare August 16, 2023 07:45

ricardoV94 merged commit 15c88e8 into pymc-devs:main Aug 17, 2023

ricardoV94 deleted the allow_do_self_reference branch September 21, 2023 08:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow do interventions to reference intervened variable #219

Allow do interventions to reference intervened variable #219

ricardoV94 commented Jul 24, 2023

juanitorduz left a comment

ricardoV94 commented Aug 2, 2023 •

edited

Loading

twiecki commented Aug 2, 2023

lucianopaz left a comment

ricardoV94 commented Aug 3, 2023 •

edited

Loading

lucianopaz left a comment

drbenvincent left a comment •

edited

Loading

ricardoV94 commented Aug 6, 2023 •

edited

Loading

ricardoV94 commented Aug 6, 2023 •

edited

Loading

drbenvincent commented Aug 9, 2023

drbenvincent commented Aug 9, 2023

ricardoV94 commented Aug 9, 2023

drbenvincent commented Aug 9, 2023

ricardoV94 commented Aug 10, 2023 •

edited

Loading

ricardoV94 commented Aug 10, 2023 •

edited

Loading

drbenvincent commented Aug 15, 2023

drbenvincent commented Aug 15, 2023

ricardoV94 commented Aug 16, 2023

Allow do interventions to reference intervened variable #219

Allow do interventions to reference intervened variable #219

Conversation

ricardoV94 commented Jul 24, 2023

juanitorduz left a comment

Choose a reason for hiding this comment

ricardoV94 commented Aug 2, 2023 • edited Loading

twiecki commented Aug 2, 2023

lucianopaz left a comment

Choose a reason for hiding this comment

ricardoV94 commented Aug 3, 2023 • edited Loading

lucianopaz left a comment

Choose a reason for hiding this comment

drbenvincent left a comment • edited Loading

Choose a reason for hiding this comment

ricardoV94 commented Aug 6, 2023 • edited Loading

ricardoV94 commented Aug 6, 2023 • edited Loading

drbenvincent commented Aug 9, 2023

drbenvincent commented Aug 9, 2023

ricardoV94 commented Aug 9, 2023

drbenvincent commented Aug 9, 2023

ricardoV94 commented Aug 10, 2023 • edited Loading

ricardoV94 commented Aug 10, 2023 • edited Loading

drbenvincent commented Aug 15, 2023

drbenvincent commented Aug 15, 2023

ricardoV94 commented Aug 16, 2023

ricardoV94 commented Aug 2, 2023 •

edited

Loading

ricardoV94 commented Aug 3, 2023 •

edited

Loading

drbenvincent left a comment •

edited

Loading

ricardoV94 commented Aug 6, 2023 •

edited

Loading

ricardoV94 commented Aug 6, 2023 •

edited

Loading

ricardoV94 commented Aug 10, 2023 •

edited

Loading

ricardoV94 commented Aug 10, 2023 •

edited

Loading