Update evaluation and inference code to handle other precisions and models #179

coryMosaicML · 2024-10-22T04:41:31Z

Currently inference and eval code assumes amp_fp16 at inference which is no longer correct for some models. This PR makes this somewhat configurable, and fixes some precision bugs during inference associated with the the t5 text encoder.

Changes:

Update clean_fid_eval.py and evaluate.py to run using the image dataset, which allows the same dataset config to be used regardless of the model type. Note this is a config breaking change.
Add support for multiple remotes in the image dataset
Add option to specify precision in generate_geneval_images.py
Add a dtype option to the generic ModelInference class
Disable autocast in precomputed_text_latent_diffusion.py for the computation of text embeddings at inference, and also fix a minor sequence length bug.

corystephenson-db added 8 commits October 14, 2024 20:45

Add configurable dtypes for inference model

eaa393d

Configurable precision

4737452

No autocast when encoding text

a23db51

Switch to using a dataset instead of a dataloader

a6ae808

Image dataset can take sequence of remotes too

2b4af55

SDXL conditioning is now a flag

fc95831

Switch precomputed latents model to use dpmsolver++ for inference

68e6a5f

Formatting

ecf1c2f

coryMosaicML merged commit ba8ca02 into mosaicml:main Nov 14, 2024
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update evaluation and inference code to handle other precisions and models #179

Update evaluation and inference code to handle other precisions and models #179

coryMosaicML commented Oct 22, 2024

Update evaluation and inference code to handle other precisions and models #179

Update evaluation and inference code to handle other precisions and models #179

Conversation

coryMosaicML commented Oct 22, 2024