LightningDeprecationWarning: DataModule.setup has already been called #9943

Renthal · 2021-10-15T10:38:47Z

As of v1.4 properties sch as has_setup_fit have been deprecated in DataModule and are set to be removed in v1.6.
However, the docs on latest still show "If you need information from the dataset to build your model, then run prepare_data() and setup() manually"
and the code:

dm = MNISTDataModule()
dm.prepare_data()
dm.setup(stage="fit")

model = Model(num_classes=dm.num_classes, width=dm.width, vocab=dm.vocab)
trainer.fit(model, dm)

dm.setup(stage="test")
trainer.test(datamodule=dm)

This leads to the following warning being raised:

LightningDeprecationWarning: DataModule.setup has already been called, so it will not be called again. In v1.6 this behavior will change to always call DataModule.setup.

What is the recommended way to do this without ending up calling setup() twice as v1.6?

The text was updated successfully, but these errors were encountered:

Programmer-RD-AI · 2021-10-15T11:02:14Z

hi,
Can you send the cod that you are using,

I will check the warning

With best Regars,
Ranuga

Programmer-RD-AI · 2021-10-15T11:12:08Z

hi can you send MNISTDataModule()

with best regards,
Ranga

Renthal · 2021-10-15T11:16:57Z

This is literally the code shown in the Docs (see link above). The module is described above.
Nevertheless, here is a MWE to reproduce (but then again, is straight out of the docs):

import os

import torch
from pytorch_lightning import LightningModule, Trainer, LightningDataModule
from torch.utils.data import DataLoader, Dataset


class RandomDataset(Dataset):

    def __init__(self, size, length):
        self.len = length
        self.data = torch.randn(length, size)

    def __getitem__(self, index):
        return self.data[index]

    def __len__(self):
        return self.len


class RandomDataModule(LightningDataModule):

    def setup(self, stage: str = None, **kwargs):
        if stage == 'fit' or stage is None:
            self.dataset_size = 32

        if stage == 'test' or stage is None:
            pass

    def train_dataloader(self):
        return DataLoader(RandomDataset(self.dataset_size, 64), batch_size=2)

    def test_dataloader(self):
        return DataLoader(RandomDataset(self.dataset_size, 64), batch_size=2)


class BoringModel(LightningModule):

    def __init__(self, hidden_neurons: int):
        super().__init__()
        self.layer = torch.nn.Linear(hidden_neurons, 2)
        self.save_hyperparameters()

    def forward(self, x):
        return self.layer(x)

    def training_step(self, batch, batch_idx):
        loss = self(batch).sum()
        self.log("train_loss", loss)
        return {"loss": loss}

    def test_step(self, batch, batch_idx):
        loss = self(batch).sum()
        self.log("test_loss", loss)

    def configure_optimizers(self):
        return torch.optim.SGD(self.layer.parameters(), lr=0.1)


def run():
    dm = RandomDataModule()
    dm.prepare_data()
    dm.setup(stage="fit")

    model = BoringModel(hidden_neurons=dm.dataset_size)

    trainer = Trainer(
        default_root_dir=os.getcwd(),
        limit_train_batches=10,
        limit_val_batches=10,
        num_sanity_val_steps=0,
        max_epochs=2,
    )
    trainer.fit(model, datamodule=dm)

    dm.setup(stage="test")
    trainer.test(model, datamodule=dm)

if __name__ == '__main__':
    run()

Programmer-RD-AI · 2021-10-15T11:18:53Z

ok thank you,

With best regards,
Ranuga

Renthal · 2021-10-15T11:28:52Z

Yes this MWE gets those too, but before those are displayed there is

LightningDeprecationWarning: DataModule.prepare_data has already been called, so it will not be called again. In v1.6 this behavior will change to always call DataModule.prepare_data.

LightningDeprecationWarning: DataModule.setup has already been called, so it will not be called again. In v1.6 this behavior will change to always call DataModule.setup.

What is your point?

Programmer-RD-AI · 2021-10-15T11:34:34Z

Yes this MWE gets those too, but before those are displayed there is

LightningDeprecationWarning: DataModule.prepare_data has already been called, so it will not be called again. In v1.6 this behavior will change to always call DataModule.prepare_data.

LightningDeprecationWarning: DataModule.setup has already been called, so it will not be called again. In v1.6 this behavior will change to always call DataModule.setup.

What is your point?

I don't understand what do you mean?

With best regards,
Ranuga

Renthal · 2021-10-15T11:36:32Z

When I posted that there was another message with some warning printed which has no been deleted. I was referring to that. If it was posted by mistake ignore my last message please.

Programmer-RD-AI · 2021-10-15T11:40:25Z

When I posted that there was another message with some warning printed which has no been deleted. I was referring to that. If it was posted by mistake ignore my last message please.

oh ok, I can understand,

I just fixed this warning for now I will check the other warnings later.

sorry for the misunderstanding.

With best regards,
Ranuga

carmocca · 2021-10-15T12:14:33Z

Related to #9939

Programmer-RD-AI · 2021-10-15T13:30:18Z

does this pull request help with the error?
#9945

With best regards,
Ranuga

carmocca · 2021-10-15T15:24:53Z

@Programmer-RD-AI Your PR is not updated on master, additionally, I believe the deprecation only appears in the bug-fix branch so the fix needs to be applied to it directly, not to master.

The branch is https://github.com/PyTorchLightning/pytorch-lightning/tree/release/1.4.x

Programmer-RD-AI · 2021-10-15T15:41:50Z

hi I create a new pull request #9953

with best regards,

Programmer-RD-AI · 2021-10-17T04:39:52Z

hi #9970 is the new PR

carmocca · 2022-03-01T13:47:09Z

Hi! The recommendation here is that prepare_data or setup check whether they've been called already before running expensive work, for example, checking whether a directory already exists before downloading inside prepare_data.

You could also set and check your own self.has_setup_fit attribute to know whether it has been called already. For example:

def setup(self, stage=None):
    if not self.has_setup_fit and stage == "fit":
         expensive_work()
         self.has_setup_fit = True

Hope that helps! Feel free to ask any further questions.

Renthal · 2022-03-01T15:21:03Z

Hasn't has_setup_fit been deprecated from v1.4 ?

carmocca · 2022-03-01T15:40:23Z

What was deprecated is relying on the library setting and checking has_setup_fit automatically because some users needed to run it twice.

However, you can still replicate the behaviour as I described in my previous comment. Although you might need to change the variable name to avoid the collision

chanshing · 2022-03-09T00:20:35Z

When I use the solution suggested by @carmocca I get:

LightningDeprecationWarning: DataModule property `has_setup_fit` was deprecated in v1.4 and will be removed in v1.6.

I'm using 1.5.10.

ananthsub · 2022-03-09T09:31:50Z

@chanshing you can do this within your data module to ensure setup is only called once per stage by the Trainer.

def __init__(self, ...) -> None:
    self._already_called: Dict[str, bool] = {}
    for stage in ("fit", "validate", "test", "predict"):
         self._already_called[stage] = False

def setup(self, stage: Optional[str] = None) -> None:
    if stage and self._already_called[stage]:
         return
    # do your logic here
    self._already_called[stage] = True

mahieyin-rahmun · 2022-03-09T12:30:08Z

I also ran into this issue since my model depends on the dimensions of the data to be initialized and I followed the documentation but ended up with the aforementioned warning. Seems counter-intuitive to deprecate this property when the workaround is literally doing the same thing but with a different name

carmocca · 2022-03-13T14:01:35Z

We propose a different name to avoid the name collision as it would trigger the same deprecation warning you are trying to hide.

Once the deprecation path is removed, you will be able to use has_setup_fit if that's your preference going forward.

Renthal · 2022-03-15T16:27:32Z

I think the main point was why is it being deprecated if then the expectations are to do the exact same thing ourselves?

ananthsub · 2022-03-15T16:47:36Z

I think the main point was why is it being deprecated if then the expectations are to do the exact same thing ourselves?

See the issue which deprecated these: #7301
These properties are being deprecated in order to allow users who do need to call these methods every time fit / validate/ etc are called. Before, this was not possible, as these were hardcoded to be called once on the Trainer.

Programmer-RD-AI mentioned this issue Oct 15, 2021

Fix LightningDeprecationWarning: DataModule.setup has already been called #9943 #9945

Closed

11 tasks

carmocca added this to the v1.4.x milestone Oct 15, 2021

carmocca added bug Something isn't working deprecation Includes a deprecation labels Oct 15, 2021

Programmer-RD-AI mentioned this issue Oct 15, 2021

Fix LightningDeprecationWarning: DataModule.setup has already been called #9943 #9953

Closed

11 tasks

This was referenced Oct 17, 2021

LightningDeprecationWarning: DataModule.setup has already been called #9943 and LightningDeprecationWarning: DataModule.setup has already been called #9943 #9969

Closed

LightningDeprecationWarning: DataModule.setup has already been called Fix #9943 #9970

Closed

awaelchli modified the milestones: v1.4.x, 1.5.x Nov 3, 2021

carmocca closed this as completed Mar 1, 2022

djaniak mentioned this issue Apr 8, 2022

Feature/trained models inference CLARIN-PL/embeddings#226

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LightningDeprecationWarning: DataModule.setup has already been called #9943

LightningDeprecationWarning: DataModule.setup has already been called #9943

Renthal commented Oct 15, 2021

Programmer-RD-AI commented Oct 15, 2021

Programmer-RD-AI commented Oct 15, 2021

Renthal commented Oct 15, 2021

Programmer-RD-AI commented Oct 15, 2021

Renthal commented Oct 15, 2021

Programmer-RD-AI commented Oct 15, 2021

Renthal commented Oct 15, 2021

Programmer-RD-AI commented Oct 15, 2021

carmocca commented Oct 15, 2021

Programmer-RD-AI commented Oct 15, 2021

carmocca commented Oct 15, 2021

Programmer-RD-AI commented Oct 15, 2021

Programmer-RD-AI commented Oct 17, 2021

carmocca commented Mar 1, 2022

Renthal commented Mar 1, 2022

carmocca commented Mar 1, 2022

chanshing commented Mar 9, 2022

ananthsub commented Mar 9, 2022 •

edited

Loading

mahieyin-rahmun commented Mar 9, 2022

carmocca commented Mar 13, 2022

Renthal commented Mar 15, 2022

ananthsub commented Mar 15, 2022

LightningDeprecationWarning: DataModule.setup has already been called #9943

LightningDeprecationWarning: DataModule.setup has already been called #9943

Comments

Renthal commented Oct 15, 2021

Programmer-RD-AI commented Oct 15, 2021

Programmer-RD-AI commented Oct 15, 2021

Renthal commented Oct 15, 2021

Programmer-RD-AI commented Oct 15, 2021

Renthal commented Oct 15, 2021

Programmer-RD-AI commented Oct 15, 2021

Renthal commented Oct 15, 2021

Programmer-RD-AI commented Oct 15, 2021

carmocca commented Oct 15, 2021

Programmer-RD-AI commented Oct 15, 2021

carmocca commented Oct 15, 2021

Programmer-RD-AI commented Oct 15, 2021

Programmer-RD-AI commented Oct 17, 2021

carmocca commented Mar 1, 2022

Renthal commented Mar 1, 2022

carmocca commented Mar 1, 2022

chanshing commented Mar 9, 2022

ananthsub commented Mar 9, 2022 • edited Loading

mahieyin-rahmun commented Mar 9, 2022

carmocca commented Mar 13, 2022

Renthal commented Mar 15, 2022

ananthsub commented Mar 15, 2022

ananthsub commented Mar 9, 2022 •

edited

Loading