TCD Scheduler + LoRA IPAdapter SDXL #76

BuffMcBigHuge · 2025-08-12T19:25:23Z

Overview

This pull request introduces comprehensive support for customizable schedulers and samplers in StreamDiffusion, enabling more flexible and efficient diffusion processes. It builds on the existing pipeline to allow users to specify schedulers (e.g., LCM) and samplers (e.g., normal) via configuration, improving generation quality, speed, and compatibility with advanced features like ControlNet and IPAdapter. Additional enhancements include better LoRA handling to resolve conflicts, TensorRT engine optimizations for robustness, a quiet mode for cleaner logging, and minor UI/dependency updates.

Note: Several schedulers (DDIMScheduler, DPMSolverSDEScheduler, etc) were tested and found to be incompatible with t_index_list and scalors/tensors with StreamDiffusion LCM. TCD was found compatible with expected performance requirements.

The implementation refactors the core wrapper and pipeline to integrate scheduler/sampler logic dynamically, supports Temporal Consistency Distillation (TCD) for ControlNet, and ensures backward compatibility with existing setups. This enables experimentation with different diffusion strategies without recompiling engines from scratch.

New Features

Scheduler and Sampler Integration:
- Added configurable scheduler (default: LCM) and sampler (default: normal) support in the core pipeline and wrapper.
- Users can now specify these via YAML configs (e.g., scheduler: lcm, sampler: normal), affecting timestep scaling and noise prediction.
- Integrated into engine building and inference, with dynamic recalculation of timestep-dependent parameters (e.g., scalings for boundary conditions).
- Supports LCM-LoRA modes and unified UNet exports for schedulers/samplers.
- Currently supports LCM and TCD schedulers only
Enhanced LoRA Handling:
- Introduced a hashed signature for LoRA sets in engine naming (e.g., --lora-{num}-{hash}), allowing multiple LoRAs without path conflicts or invalid filenames.
- Removal of use_lcm_lora in favor of lora_dict while remaining backwards compatible.
- Fixed LoRA and IPAdapter conflicts by adjusting engine setup and requirements, ensuring stable fusion during exports.
- Added LoRA signature to engine paths for better caching and reproducibility.
ControlNet Temporal Consistency Distillation (TCD):
- Implemented TCD support for ControlNet, improving temporal stability in video/streaming generations.
- Integrated into pipeline updates for consistent application across timesteps.
Quiet Mode for Uvicorn:
- Added --quiet flag to suppress INFO-level uvicorn logs (e.g., access logs), reducing noise during debugging and production runs.
- Configurable via environment (QUIET=True) or CLI, with logger adjustments in the realtime-img2img demo.
TensorRT Inference Improvements:
- Added input filtering in the infer method to only pass supported tensors to the engine, preventing binding errors from extras like text embeds or time IDs.
- Enhanced engine manager with LoRA-aware path generation and optional params for IPAdapter scale/tokens.

Dependencies

Updated core libs: diffusers remains 0.35.0; transformers to 4.55.4; peft to 0.17.1; accelerate 1.10.0; huggingface_hub to 0.34.4.
No new deps; xformers conditional remains.
Ensure compatible with existing TensorRT exports (e.g., unet_unified_export.py, unet_ipadapter_export.py).

…ts and SDXL.

…r windows support.

BuffMcBigHuge · 2025-09-16T00:46:04Z

src/streamdiffusion/acceleration/tensorrt/utilities.py

            self.graph = None

    def infer(self, feed_dict, stream, use_cuda_graph=False):
+        # Filter inputs to only those the engine actually exposes to avoid binding errors


Not 100% sure about this

BuffMcBigHuge · 2025-09-16T00:58:44Z

demo/realtime-img2img/requirements.txt

 diffusers==0.35.0
-transformers==4.56.0
-peft==0.18.0
+transformers==4.55.4


These dep versions changed for Windows support.

BuffMcBigHuge · 2025-09-16T00:59:32Z

src/streamdiffusion/acceleration/tensorrt/engine_manager.py


            # Create prefix (from wrapper.py lines 1005-1013)
-            prefix = f"{base_name}--lcm_lora-{use_lcm_lora}--tiny_vae-{use_tiny_vae}--min_batch-{min_batch_size}--max_batch-{max_batch_size}"
+            prefix = f"{base_name}--tiny_vae-{use_tiny_vae}--min_batch-{min_batch_size}--max_batch-{max_batch_size}"


This will cause engines to rebuild - so it's easiest to remove lcm_lora-{use_lcm_lora}-- from any engines you've already built.

ty for the heads up on this

BuffMcBigHuge

I have reviewed the changes and identified a critical regression in src/streamdiffusion/wrapper.py that likely causes the reported quality degradation.

The logic for backwards compatibility of use_lcm_lora checks if lora_dict is not None. However, lora_dict defaults to None. As a result, the LCM LoRA is never loaded for default configurations, leading to the lack of detail and sharpness (as the model runs in LCM mode without the required LoRA).

I also noted a change in src/streamdiffusion/pipeline.py regarding init_noise updates that might affect temporal behavior.

BuffMcBigHuge · 2025-11-24T16:32:01Z

src/streamdiffusion/wrapper.py

+        # DEPRECATED: THIS WILL LOAD LCM_LORA IF USE_LCM_LORA IS TRUE
+        # Validate backwards compatibility LCM LoRA selection using proper model detection
+        if hasattr(self, 'use_lcm_lora') and self.use_lcm_lora is not None:
+            if self.use_lcm_lora and not self.sd_turbo and lora_dict is not None:


There is a logic bug here. lora_dict is None by default in __init__. If the user does not provide a lora_dict, this condition lora_dict is not None will be False, and the LCM LoRA will NOT be added to the dictionary (and thus not loaded).

This means users running in default mode (without explicit lora_dict) will run without the LCM LoRA, causing the "lack of detail and sharpness" degradation observed.

It should probably be:

if self.use_lcm_lora and not self.sd_turbo: if lora_dict is None: lora_dict = {} # ...

BuffMcBigHuge · 2025-11-24T16:32:49Z

src/streamdiffusion/pipeline.py

+        # Build latent batch for CFG
+        if self.guidance_scale > 1.0 and cfg_mode == "full":
+            latent_with_uc = torch.cat([latent_model_input, latent_model_input], dim=0)
+        elif self.guidance_scale > 1.0 and cfg_mode == "initialize":


In the previous implementation, self.init_noise = x_t_latent was set here. Its removal changes the behavior of init_noise for subsequent frames (it remains static/random instead of carrying over the noisy latent). Was this removal intentional? It might affect temporal coherence or noise patterns.

BuffMcBigHuge added 2 commits August 7, 2025 21:12

Testing scheduling and sampling.

e98c099

Merge branch 'main' into marco/feat/schedulers-samplers-revert

c404e32

BuffMcBigHuge changed the title ~~Schedulers & Samplers~~ feat: Schedulers & Samplers Aug 12, 2025

BuffMcBigHuge added 14 commits August 12, 2025 15:32

Added lora signature to engine name.

4407724

Merge branch 'main' into marco/feat/schedulers-samplers-revert

cabf811

Merge branch 'main' into marco/feat/schedulers-samplers-revert.

9999514

Clean up of scheduler/samplers that weren't working, fix to controlne…

f79a59c

…ts and SDXL.

Fix to lora engine setup, changed requirements in realtime-img2img fo…

977afb1

…r windows support.

ControlNet TCD.

0044a9b

Merge branch 'main' into marco/feat/schedulers-samplers-revert.

b3182d0

At uvicorn quiet param to help debug issues without unncessary logging.

41e5122

Fix to LoRA and IPAdapter conflict.

b04f0e8

Merge branch 'main' into marco/feat/schedulers-samplers-revert.

1c0f1f6

Deprecation of use_lcm_lora.

e2778b6

Added backwards compatibility for use_lcm_lora.

53f7d92

Reverted single/multi scripts for simplicity.

55a20c9

Updated descriptive comments, added tcd support, small cleanup/fixes.

123ba69

BuffMcBigHuge marked this pull request as ready for review September 16, 2025 00:36

Oops.

312811c

BuffMcBigHuge commented Sep 16, 2025

View reviewed changes

BuffMcBigHuge changed the title ~~feat: Schedulers & Samplers~~ TCD Scheduler + LoRA IPAdapter SDXL Sep 16, 2025

BuffMcBigHuge commented Sep 16, 2025

View reviewed changes

BuffMcBigHuge added 4 commits September 15, 2025 21:20

Fix for potential xformers issue.

54f0546

Fix to TCD update params.

7e210ea

Removal of old fuse method.

a0779f4

Merge branch 'main' into marco/feat/schedulers-samplers-revert.

3854dbe

BuffMcBigHuge mentioned this pull request Sep 23, 2025

Tool: Multi-Test #145

Draft

Merge branch 'main' into marco/feat/schedulers-samplers-revert.

7d7a56f

BuffMcBigHuge merged commit 1d27797 into main Oct 30, 2025

BuffMcBigHuge deleted the marco/feat/schedulers-samplers-revert branch October 30, 2025 18:41

BuffMcBigHuge commented Nov 24, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

TCD Scheduler + LoRA IPAdapter SDXL #76

TCD Scheduler + LoRA IPAdapter SDXL #76

Uh oh!

BuffMcBigHuge commented Aug 12, 2025 •

edited

Loading

Uh oh!

BuffMcBigHuge Sep 16, 2025

Uh oh!

BuffMcBigHuge Sep 16, 2025

Uh oh!

BuffMcBigHuge Sep 16, 2025 •

edited

Loading

Uh oh!

ryanontheinside Sep 16, 2025

Uh oh!

BuffMcBigHuge left a comment

Uh oh!

BuffMcBigHuge Nov 24, 2025

Uh oh!

BuffMcBigHuge Nov 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

TCD Scheduler + LoRA IPAdapter SDXL #76

TCD Scheduler + LoRA IPAdapter SDXL #76

Uh oh!

Conversation

BuffMcBigHuge commented Aug 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview

New Features

Dependencies

Uh oh!

BuffMcBigHuge Sep 16, 2025

Choose a reason for hiding this comment

Uh oh!

BuffMcBigHuge Sep 16, 2025

Choose a reason for hiding this comment

Uh oh!

BuffMcBigHuge Sep 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ryanontheinside Sep 16, 2025

Choose a reason for hiding this comment

Uh oh!

BuffMcBigHuge left a comment

Choose a reason for hiding this comment

Uh oh!

BuffMcBigHuge Nov 24, 2025

Choose a reason for hiding this comment

Uh oh!

BuffMcBigHuge Nov 24, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

BuffMcBigHuge commented Aug 12, 2025 •

edited

Loading

BuffMcBigHuge Sep 16, 2025 •

edited

Loading