Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Linear dataset functionality and parameters #1045

Open
priamai opened this issue Oct 11, 2023 · 1 comment
Open

Linear dataset functionality and parameters #1045

priamai opened this issue Oct 11, 2023 · 1 comment
Labels
enhancement New feature or request

Comments

@priamai
Copy link

priamai commented Oct 11, 2023

Example from here:

https://www.pywhy.org/dowhy/v0.9/example_notebooks/dowhy_estimation_methods.html

image

Are those 1000 samples individual units e.g. patients for example?
Can we also generate multiple samples for a unit, for example this can be a treatement test over many days to measure the response.
I want to then be able to say: 10 units x 10 days = 100 samples.

Why the ate,att,atc are identical?

image

Since the W0 treatment is continuous how the system knows to discriminate between the treated and untreated?

How can we constraint the generation for example I want to have only treatment and outcomes in the positive range between 0 and 100.

@priamai priamai added the enhancement New feature or request label Oct 11, 2023
@amit-sharma
Copy link
Member

Are those 1000 samples individual units e.g. patients for example?

Yes

Can we also generate multiple samples for a unit, for example this can be a treatement test over many days to measure the response. I want to then be able to say: 10 units x 10 days = 100 samples.

This is not supported. Will be great if you can add such a dataset simulator.

Why the ate, att, atc are identical?

This is because the true effect is a linear effect. It is homogeneous on the entire population. So it does not matter whether you compute causal effect on everyone, only on the treated, or only on the untreated. It is the same effect.
With a different simulated dataset, these quantities will be different.

Since the W0 treatment is continuous how the system knows to discriminate between the treated and untreated?

Treatment is v0. W0 is a confounder. For linear treatments, user has to specify the "treatment" and control" values (usually 1 and 0 respectively).

How can we constraint the generation for example I want to have only treatment and outcomes in the positive range between 0 and 100.

Not possible with the current function. You will need to add a new function or modify this one.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants