Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Guidance for custom datasets #4

Open
nudro opened this issue Mar 18, 2021 · 1 comment
Open

Guidance for custom datasets #4

nudro opened this issue Mar 18, 2021 · 1 comment

Comments

@nudro
Copy link

nudro commented Mar 18, 2021

Thanks for releasing deepscm. I really enjoyed reading the paper, and appreciate the code. I see under deepscam/datasets there are a few scripts for the mnist and medical datasets. Do you have any guidance on the best way to prepare a custom dataset to train?

@pawni
Copy link
Collaborator

pawni commented Apr 6, 2021

Hi there and thanks for your interest in the paper and code.

Sorry for only responding now -- somehow I didn't get notified about this issue. Could you share some more information about what exactly your looking for?

In general, we have done little data preprocessing and the main pointers that I can think of straight away is to add transformations to the code that do the normalisation for you (see https://github.com/biomedia-mira/deepscm/blob/master/deepscm/experiments/morphomnist/base_experiment.py#L233-L237). Apart from this you will need some idea of how to model the causal graph.

You can find an example of integration of a custom dataset here as well: https://github.com/jcreinhold/counterfactualms

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants