Introduce ServableModuleValidator Callback #13614

tchaton · 2022-07-12T09:10:56Z

What does this PR do?

Context: Improve model mobility from training to serving. MLOps Engineers would argue that a production model shouldn't be trained and resources wasted if the model can't be served to provide value to customers. They would argue that the model should be unit tested as soon as possible to validate its conformity for its production usage. This PR investigates the addition of serving functionalities in PyTorch Lightning.

As a user, you would need to add the SanityServing callback to your Trainer and make your model subclass ServableModule.

The ServableModule requires 3 hooks to be implemented in order to fully describe how the model behaves when being served.

configure_payload: Returns an example of a payload object: Lightning can provide example payloads for images, text, videos, etc..
configure_serialization: Provide serialization / deserialization methods. Lightning can provide common data types.
serve_step: The logic to be performed when you have received a request.

from typing import Dict
import torch
from pytorch_lightning import seed_everything, Trainer
from pytorch_lightning.serve import ServableModuleValidator, ServableModule
from pytorch_lightning.demos.boring_classes import BoringModel

class ServableBoringModel(BoringModel, ServableModule):
    def configure_payload(self) -> ...:
        return {"body": {"x": list(range(32))}}

    def configure_serialization(self):
        class Tensor:
            @staticmethod
            def deserialize(x):
                return torch.tensor(x).float()

            @staticmethod
            def serialize(x):
                return x.numpy().tolist()

        return {"x": Tensor.deserialize}, {"output": Tensor.serialize}

    def serve_step(self, x: torch.Tensor) -> Dict[str, torch.Tensor]:
        return {"output": self.forward(x)}

trainer = Trainer(max_epochs=1, limit_train_batches=2, limit_val_batches=0, callbacks=[ServableModuleValidator()])
trainer.fit(ServableBoringModel())

The Ideal API would rely only on the type to apply the deserialization and serialization and tensor shape could be added:

from pydantic import BaseModel
from typing import TypedDict

class Tensor(BaseModel):
    @staticmethod
    def deserialize(x):
        return torch.tensor(x).float()

    @staticmethod
    def serialize(x):
        return x.numpy().tolist()

class OutputTensor(TypedDict):
    output: Tensor[2]

class ServableBoringModel(BoringModel, ServableModule):
    def configure_payload(self) -> ...:
        return {"body": {"x": list(range(32))}}

    def serve_step(self, x: Tensor[32]) -> OutputTensor:
        return {"output": self.forward(x)}

callback = ServableModuleValidator(
       optimization="trace|script|onxx|tensor_rt|...",
       server='fastapi|torch_serve|ml_server|sagemaker|triton"
)
trainer = Trainer(max_epochs=1, limit_train_batches=2, limit_val_batches=0, callbacks=[callback])
trainer.fit(ServableBoringModel())

Extra functionalities for sanity serving

Performance Testing. Latency and RPS
Apply serving check at start and end of quantization and pruning.
etc...

This would be even more impactful once https://github.com/pytorch/torchdynamo is available in PyTorch and models and their transforms served in pure python are further optimized.

Does your PR introduce any breaking changes? If yes, please list them.

Before submitting

Was this discussed/approved via a GitHub issue? (not for typos and docs)
Did you read the contributor guideline, Pull Request section?
Did you make sure your PR does only one thing, instead of bundling different changes together?
Did you make sure to update the documentation with your changes? (if necessary)
Did you write any new necessary tests? (not for typos and docs)
Did you verify new and existing tests pass locally with your changes?
Did you list all the breaking changes introduced by this pull request?
Did you update the CHANGELOG? (not for typos, docs, test updates, or minor internal changes/refactors)

PR review

Anyone in the community is welcome to review the PR.
Before you start reviewing, make sure you have read the review guidelines. In short, see the following bullet-list:

Is this pull request ready for review? (if not, please submit in draft mode)
Check that all items from Before submitting are resolved
Make sure the title is self-explanatory and the description concisely explains the PR
Add labels and milestones (and optionally projects) to the PR so it can be classified

Did you have fun?

Make sure you had fun coding 🙃

cc @Borda @tchaton @justusschock @awaelchli @carmocca @ananthsub @ninginthecloud @jjenniferdai @rohitgr7 @akihironitta

src/pytorch_lightning/callbacks/sanity_serving.py

tests/tests_pytorch/callbacks/test_sanity_serving.py

src/pytorch_lightning/callbacks/sanity_serving.py

zippeurfou · 2022-07-12T12:05:30Z

I am not sure I am a fan of the sanityServe to be a callback.
Here are a few reasons:

Serving and model training are two different things (separation of concern). You have multiple scenario where they are not tied as much as this (eg. serving as model composition, serving when you depend on different party)
Serving look at different performance metrics (eg. RPS, uptime...), you do care about theses benchmark. I am not sure tying it up to training step helps (this is not a now but how do I load test here? Or how would we help the users in the future do load testing?) which brings me to the next point.
Hardware / auto-scaling.. is important there. I am not sure we allow good separation there by putting it in a callback.

I am bringing this up because I think it's important SanityServe provides value otherwise no one will use it. However, to provide value we should think about what can it provide that you would not have in the Serve module.

zippeurfou · 2022-07-12T13:02:56Z

Spoke with @tchaton async and we synced on it.

tests/tests_pytorch/serve/test_servable_module_validator.py

src/pytorch_lightning/serve/servable_module_validator.py

Co-authored-by: Carlos Mocholí <[email protected]>

…/lightning into investigate_serving

examples/pl_servable_module/production.py

Co-authored-by: Carlos Mocholí <[email protected]>

adriangonz

This looks great @tchaton !

Following up from our brief discussion in Slack, just added a couple comments below. It would be great to hear your thoughts!

src/pytorch_lightning/serve/servable_module_validator.py

williamFalcon · 2022-07-14T01:24:51Z

the premise here is that every model needs to be served which is not true.

for example a model to fold proteins does not need “serving”… this is a model that runs once in a while to generate a new protein sequence for a lab to synthesize.

so, no, we can’t “force” every model to be “production ready.”.

however, if a team opts for forcing their particular research code to always be production ready, they should have a mechanism to enforce that behavior and can opt in.

src/pytorch_lightning/serve/servable_module.py

src/pytorch_lightning/serve/servable_module_validator.py

src/pytorch_lightning/serve/servable_module.py

tests/tests_pytorch/serve/test_servable_module_validator.py

src/pytorch_lightning/serve/servable_module.py

justusschock · 2022-07-14T13:13:20Z

@williamFalcon that is why this was designed to be a combination of an optional mixin + a callback instead of directly added to the lightning module and the trainer. So this is opt-in as it is currently implemented :)

Meaning that all the hooks for the class class ServableBoringModel(BoringModel, ServableModule): are coming from ServableModule, which is optional to use and the validation of these hooks only happens when adding the ServableModuleValidator as a callback :)

…/lightning into investigate_serving

awaelchli

unblock

…/lightning into investigate_serving

tchaton added 2 commits July 12, 2022 10:10

wip

2dc3c5f

wip

c951ab0

tchaton changed the title ~~Investigate Serving Callack~~ Investigate Sanity Serving Callback Jul 12, 2022

carmocca reviewed Jul 12, 2022

View reviewed changes

tchaton added 3 commits July 12, 2022 12:46

wip

478b31f

wip

c3a89f3

wip

ba1044d

tchaton marked this pull request as ready for review July 12, 2022 11:51

tchaton requested review from williamFalcon, Borda, SeanNaren, awaelchli, justusschock, kaushikb11 and rohitgr7 as code owners July 12, 2022 11:51

wip

a0857d2

tchaton requested a review from carmocca July 12, 2022 12:00

tchaton changed the title ~~Investigate Sanity Serving Callback~~ Investigate ServableModuleValidator Callback Jul 12, 2022

tchaton added 2 commits July 12, 2022 14:00

wip

aca6a16

wip

a26e888

carmocca reviewed Jul 12, 2022

View reviewed changes

tchaton and others added 7 commits July 12, 2022 14:22

Update tests/tests_pytorch/serve/test_servable_module_validator.py

5f239f2

Co-authored-by: Carlos Mocholí <[email protected]>

Update tests/tests_pytorch/serve/test_servable_module_validator.py

998c59d

Co-authored-by: Carlos Mocholí <[email protected]>

Update src/pytorch_lightning/serve/servable_module_validator.py

f383f43

Co-authored-by: Carlos Mocholí <[email protected]>

Update src/pytorch_lightning/serve/servable_module_validator.py

6e75a47

Co-authored-by: Carlos Mocholí <[email protected]>

Update src/pytorch_lightning/serve/servable_module_validator.py

5aaf663

Co-authored-by: Carlos Mocholí <[email protected]>

Merge branch 'master' into investigate_serving

5fbdacb

Typing improvements

85dc8af

tchaton added 6 commits July 13, 2022 14:35

update

c51282e

update

85de47c

update

25e99bf

Merge branch 'master' into investigate_serving

bc23928

update

56972b2

Merge branch 'investigate_serving' of https://github.com/Lightning-AI…

2656ac4

…/lightning into investigate_serving

carmocca reviewed Jul 13, 2022

View reviewed changes

examples/pl_servable_module/production.py Outdated Show resolved Hide resolved

Update examples/pl_servable_module/production.py

ad1c3a6

Co-authored-by: Carlos Mocholí <[email protected]>

adriangonz reviewed Jul 13, 2022

View reviewed changes

src/pytorch_lightning/serve/servable_module_validator.py Show resolved Hide resolved

src/pytorch_lightning/serve/servable_module_validator.py Show resolved Hide resolved

rohitgr7 approved these changes Jul 14, 2022

View reviewed changes

rohitgr7 reviewed Jul 14, 2022

View reviewed changes

src/pytorch_lightning/serve/servable_module.py Show resolved Hide resolved

Borda requested review from lantiga and justusschock July 14, 2022 22:47

justusschock approved these changes Jul 15, 2022

View reviewed changes

tchaton self-assigned this Jul 15, 2022

tchaton added 4 commits July 15, 2022 12:36

update

09187a3

Merge branch 'master' into investigate_serving

11ccd8d

Merge branch 'investigate_serving' of https://github.com/Lightning-AI…

0bf8531

…/lightning into investigate_serving

Merge branch 'master' into investigate_serving

0caa82a

awaelchli approved these changes Jul 15, 2022

View reviewed changes

carmocca added this to the app:0.6 milestone Jul 15, 2022

tchaton modified the milestones: app:0.6, pl:1.7 Jul 15, 2022

tchaton added 3 commits July 15, 2022 14:58

update

f8a3cf9

Merge branch 'master' into investigate_serving

49bf323

Merge branch 'investigate_serving' of https://github.com/Lightning-AI…

84b1674

…/lightning into investigate_serving

lexierule merged commit 5e26840 into master Jul 15, 2022

lexierule deleted the investigate_serving branch July 15, 2022 15:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce ServableModuleValidator Callback #13614

Introduce ServableModuleValidator Callback #13614

tchaton commented Jul 12, 2022 •

edited by github-actions bot

Loading

zippeurfou commented Jul 12, 2022

zippeurfou commented Jul 12, 2022

adriangonz left a comment

williamFalcon commented Jul 14, 2022 •

edited

Loading

justusschock commented Jul 14, 2022 •

edited

Loading

awaelchli left a comment

Introduce ServableModuleValidator Callback #13614

Introduce ServableModuleValidator Callback #13614

Conversation

tchaton commented Jul 12, 2022 • edited by github-actions bot Loading

What does this PR do?

Does your PR introduce any breaking changes? If yes, please list them.

Before submitting

PR review

Did you have fun?

zippeurfou commented Jul 12, 2022

zippeurfou commented Jul 12, 2022

adriangonz left a comment

Choose a reason for hiding this comment

williamFalcon commented Jul 14, 2022 • edited Loading

justusschock commented Jul 14, 2022 • edited Loading

awaelchli left a comment

Choose a reason for hiding this comment

tchaton commented Jul 12, 2022 •

edited by github-actions bot

Loading

williamFalcon commented Jul 14, 2022 •

edited

Loading

justusschock commented Jul 14, 2022 •

edited

Loading