[Quantization] Add FP8 support for Wan 2.2 transformer and Qwen Image VAE/text encoder by lishunyang12 · Pull Request #1412 · vllm-project/vllm-omni

lishunyang12 · 2026-02-20T17:24:28Z

Summary

This PR extends FP8 quantization support to two additional model families:

Wan 2.2 transformer — Thread quant_config through all parallel linear layers (same pattern as Z-Image)
Qwen Image VAE & text encoder — Hook-based FP8 weight storage for HF-native layers (nn.Linear, Conv2d, Conv3d)

Subsumes #1414 (FP8 for Qwen Image VAE/encoder).

Wan 2.2 Changes

Wire quant_config from pipelines through all parallel linear layers in the Wan 2.2 video transformer, following the same pattern established by Z-Image (commit b7604ae).

File	Change
`wan2_2_transformer.py`	Add `quant_config` param to 6 classes (`ColumnParallelGELU`, `WanFeedForward`, `WanSelfAttention`, `WanCrossAttention`, `WanTransformerBlock`, `WanTransformer3DModel`) and pass to all `ColumnParallelLinear`, `RowParallelLinear`, `QKVParallelLinear` layers
`pipeline_wan2_2.py`	Extract `quant_config` via `get_vllm_quant_config_for_layers` and pass to both `transformer` and `transformer_2`
`pipeline_wan2_2_i2v.py`	Same wiring for I2V pipeline
`pipeline_wan2_2_ti2v.py`	Same wiring for TI2V pipeline
`text_to_video.py`	Add `--quantization` and `--ignored-layers` args
`image_to_video.py`	Add `--quantization` and `--ignored-layers` args

Not quantized (same as Z-Image pattern): DistributedRMSNorm, Attention, Conv3dLayer, nn.Linear (proj_out), FP32LayerNorm, embedding layers.

Qwen Image VAE/Encoder Changes

Add FP8 weight-only storage for Linear/Conv2d/Conv3d layers in the Qwen Image VAE and text encoder. Weights are stored in float8_e4m3fn with per-tensor scales and dequantized to BF16 before each forward pass — saving ~50% memory for these components.

File	Change
`models/utils.py`	New `apply_fp8_weight_storage()` utility — quantizes weights, registers forward pre/post hooks for dequant
`pipeline_qwen_image.py`	Apply FP8 storage after VAE/text_encoder load, mark params as loaded
`pipeline_qwen_image_edit.py`	Same pattern
`pipeline_qwen_image_edit_plus.py`	Same pattern

Wan 2.2 Test Results

T2V Pipeline (Wan2.2-T2V-A14B-Diffusers)

Environment: 1x GPU, 1280×720, 81 frames, 40 steps, seed=42

Config	Model Memory (GiB)	Generation Time (s)
BF16 (baseline)	64.46	892.8
FP8	38.18	828.0
FP8 + ignored_layers=proj_out	38.18	826.0

I2V Pipeline (Wan2.2-I2V-A14B-Diffusers)

Environment: 1x GPU, auto-resolution, 81 frames, 50 steps, seed=42

Config	Model Memory (GiB)	Generation Time (s)
BF16 (baseline)	64.46	301.1
FP8	38.18	264.6

FP8 reduces model memory by ~26 GiB (~41%) across both pipelines
T2V: ~7% faster, I2V: ~12% faster
Visual quality is comparable

Test plan

Lint/type check passes
Wan 2.2 T2V/I2V with --quantization fp8 works end-to-end
Without --quantization, behavior is identical (quant_config=None is no-op)
Qwen Image VAE/encoder FP8 weight storage works correctly
Pre-commit passing

wan22_fp8_quantized.mp4

wan22_fp8_ignored_layers.mp4

wan22_bf16_baseline.mp4

i2v_bf16_baseline.mp4

i2v_fp8_quantized.mp4

lishunyang12

I will submit test results later.

SamitHuang

this PR is clear. it should be ready to merge after checking the visual quality of quantization.

SamitHuang · 2026-02-23T02:37:09Z

+        type=str,
+        default=None,
+        choices=["fp8"],
+        help="Quantization method for the transformer. "


how about text encoder?

Hii, thanks for review.
The text encoder (UMT5) is not quantized here — same as
what Z-Image does. Only the diffusion transformer layers
get FP8. The text encoder is relatively small compared to
the transformer, so quantizing it has less impact on
memory while potentially hurting prompt embedding quality.
We could add text encoder quantization as a follow-up.

hsliuustc0106 · 2026-02-24T07:06:48Z

@vllm-omni-reviewer

github-actions · 2026-02-24T07:15:19Z

🤖 VLLM-Omni PR Review

Code Review: FP8 Quantization Support for Wan 2.2 Transformer

1. Overview

This PR adds FP8 quantization support for the Wan 2.2 video transformer by threading quant_config through all parallel linear layers. The implementation follows the established pattern from Z-Image (commit b7604ae), making it a well-structured, consistent addition to the codebase.

Overall Assessment: Positive - The changes are clean, consistent, and follow existing patterns. A few minor suggestions for robustness.

2. Code Quality

Strengths

Consistent pattern: The quant_config threading follows the exact same pattern across all 6 classes in the transformer
Good use of TYPE_CHECKING: Properly avoids circular imports with QuantizationConfig
Clear CLI help text: The argument descriptions are informative and include examples

Potential Issues

1. Inconsistent API usage in example scripts (text_to_video.py:167-173, image_to_video.py:151-157):

if args.quantization and ignored_layers:
    quant_kwargs["quantization_config"] = {
        "method": args.quantization,
        "ignored_layers": ignored_layers,
    }
elif args.quantization:
    quant_kwargs["quantization"] = args.quantization

This uses different keys (quantization_config vs quantization) depending on whether ignored_layers is provided. This could lead to confusion or bugs if Omni/OmniDiffusionConfig doesn't handle both cases identically. Consider unifying to always use one format.

2. Variable reference in print statement (image_to_video.py:189-190):

if ignored_layers:
    print(f"  Ignored layers: {ignored_layers}")

This correctly references ignored_layers which is defined at function scope, but the variable is computed before the quant_kwargs logic. This is fine, but the ordering could be clearer.

3. Architecture & Design

Strengths

Clean separation: Quantization config is extracted at the pipeline level and passed down through the model hierarchy
Non-invasive: When quant_config=None, the behavior is identical to before (no-op)
Comprehensive coverage: All three Wan 2.2 pipelines (T2V, I2V, TI2V) are updated consistently

Design Considerations

1. Layer exclusion pattern: The PR correctly notes that DistributedRMSNorm, Attention, Conv3dLayer, nn.Linear (proj_out), FP32LayerNorm, and embedding layers are not quantized. This matches the Z-Image pattern and is appropriate for maintaining numerical stability.

2. Missing proj_out quantization: The final output projection (proj_out) uses nn.Linear instead of RowParallelLinear. This is intentional per the PR description but worth verifying this doesn't create a bottleneck.

4. Security & Safety

No significant security concerns. The changes are purely additive and don't introduce new attack vectors.

Minor consideration: The ignored_layers argument accepts arbitrary strings without validation. Malformed patterns could lead to unexpected behavior, but this is a power-user feature and the risk is acceptable.

5. Testing & Documentation

Test Plan Assessment

The test plan in the PR description is adequate but could be more comprehensive:

Suggested additions:

Verify memory reduction with FP8 enabled (compare GPU memory usage)
Test with various --ignored-layers patterns
Verify output quality/consistency between FP8 and BF16

Documentation

Missing: No documentation updates for the new feature
Suggested: Add a brief section to any existing quantization docs or a docstring in the transformer classes explaining FP8 support

6. Specific Suggestions

`wan2_2_transformer.py`

Line 100-113: Consider adding a docstring to ColumnParallelGELU.__init__ explaining the quant_config parameter:

def __init__(
    self,
    dim_in: int,
    dim_out: int,
    *,
    approximate: str = "tanh",
    bias: bool = True,
    quant_config: "QuantizationConfig | None" = None,
):
    """Column parallel linear with GELU activation.
    
    Args:
        quant_config: Optional quantization config for FP8/other quantization methods.
    """

`text_to_video.py` and `image_to_video.py`

Lines 167-173 / 151-157: Consider unifying the API:

# Option A: Always use quantization_config dict
if args.quantization:
    quant_kwargs["quantization_config"] = {
        "method": args.quantization,
        **({"ignored_layers": ignored_layers} if ignored_layers else {}),
    }

# Option B: Always use separate kwargs
if args.quantization:
    quant_kwargs["quantization"] = args.quantization
    if ignored_layers:
        quant_kwargs["ignored_layers"] = ignored_layers

`pipeline_wan2_2.py`, `pipeline_wan2_2_i2v.py`, `pipeline_wan2_2_ti2v.py`

Line 275 / 218 / 181: The comment is good, but consider adding a log message for visibility:

quant_config = get_vllm_quant_config_for_layers(od_config.quantization_config)
if quant_config is not None:
    logger.info("Enabling quantization for transformer: %s", quant_config)

7. Approval Status

LGTM with suggestions

The PR is well-structured and follows established patterns. The core implementation is correct and consistent across all files. The suggestions above are minor improvements for robustness and maintainability, not blockers.

Recommended before merge:

Verify the quantization vs quantization_config API behavior in Omni class
Consider adding a log message when quantization is enabled
Add basic docstrings for the new quant_config parameters

The PR achieves its stated goal of enabling FP8 quantization for Wan 2.2 transformers with minimal, focused changes.

This review was generated automatically by the VLLM-Omni PR Reviewer Bot
using glm-5.

lishunyang12 · 2026-02-24T08:26:48Z

@hsliuustc0106 @SamitHuang Please help check it as i uploaded the test results. Thx

lishunyang12 · 2026-02-24T08:30:37Z

Regarding the GLM-5's suggestions:

Both quantization and quantization_config are valid — OmniDiffusionConfig.__post_init__ handles both paths and logs a warning on conflicts. The dict form is needed when passing ignored_layers.
There's already a log message at config creation time (__init__.py:85): Creating diffusion quantization config: fp8
The quant_config param follows the same pattern established in Z-Image and vLLM's existing parallel layers, so I'll skip the docstrings to keep the diff minimal.

hsliuustc0106 · 2026-02-24T13:12:48Z

This PR adds FP8 W8A8 quantization support for Wan 2.2 video transformer, enabling significant memory reduction on Ada/Hopper GPUs. The implementation follows the established Z-Image pattern consistently, threading quant_config through all 6 transformer classes and their parallel linear layers. The changes are well-structured, properly scoped (excluding text encoder and normalization layers as expected), and include comprehensive CLI support. The author has provided test results and addressed review feedback thoroughly.

hsliuustc0106 · 2026-02-24T13:12:51Z

 import math
 from collections.abc import Iterable
-from typing import Any
+from typing import TYPE_CHECKING, Any


Good use of TYPE_CHECKING to avoid runtime import overhead while maintaining type hints for QuantizationConfig. This keeps the quantization dependency optional at runtime.

hsliuustc0106 · 2026-02-24T13:12:53Z

 from vllm_omni.diffusion.model_loader.diffusers_loader import DiffusersPipelineLoader
 from vllm_omni.diffusion.models.schedulers import FlowUniPCMultistepScheduler
 from vllm_omni.diffusion.models.wan2_2.wan2_2_transformer import WanTransformer3DModel
+from vllm_omni.diffusion.quantization import get_vllm_quant_config_for_layers


Proper integration with the existing quantization infrastructure. get_vllm_quant_config_for_layers handles the ignored_layers filtering and config validation.

hsliuustc0106 · 2026-02-24T13:12:56Z

        if load_transformer_2:
            transformer_2_config = load_transformer_config(model, "transformer_2", local_files_only)
-            self.transformer_2 = create_transformer_from_config(transformer_2_config)
+            self.transformer_2 = create_transformer_from_config(transformer_2_config, quant_config=quant_config)


Important: both transformer and transformer_2 receive the same quant_config, ensuring consistent quantization across the dual-transformer architecture.

hsliuustc0106 · 2026-02-24T13:12:59Z

        help="Number of GPUs used for tensor parallelism (TP) inside the DiT.",
    )
+    parser.add_argument(
+        "--quantization",


The CLI interface is well-designed with clear help text. The --quantization and --ignored-layers args provide flexibility for users to experiment with different quantization strategies.

hsliuustc0106 · 2026-02-24T13:13:01Z

    # Check if profiling is requested via environment variable
    profiler_enabled = bool(os.getenv("VLLM_TORCH_PROFILER_DIR"))

+    # Build quantization kwargs


The quantization_config dict construction properly handles both the quantization method and ignored_layers, matching the OmniDiffusionConfig expectations.

hsliuustc0106 · 2026-02-24T13:15:07Z



-def create_transformer_from_config(config: dict) -> WanTransformer3DModel:
+def create_transformer_from_config(config: dict, quant_config=None) -> WanTransformer3DModel:


Missing type annotation for quant_config parameter. Should be quant_config: QuantizationConfig | None = None to match the pattern used in the transformer classes and maintain type safety.

Fixed in the latest push. Added TYPE_CHECKING import and QuantizationConfig type annotation to match the pattern in wan2_2_transformer.py:

# Before def create_transformer_from_config(config: dict, quant_config=None) -> WanTransformer3DModel: # After def create_transformer_from_config( config: dict, quant_config: "QuantizationConfig | None" = None ) -> WanTransformer3DModel:

hsliuustc0106 · 2026-02-24T13:15:11Z

examples/offline_inference/text_to_video/text_to_video.py:97

Critical: This PR adds quantization support but provides no test coverage. We need tests to verify:

FP8 quantization actually reduces memory usage
Output quality remains acceptable with quantization
Invalid quantization configs are handled gracefully
The ignored_layers parameter works correctly

Without tests, we can't validate the 'significant memory reduction' claim or prevent regressions.

hsliuustc0106 · 2026-02-24T13:15:13Z

Looking more critically at this PR, there are several concerns that should be addressed:

Missing Test Coverage: This adds a significant feature (FP8 quantization) with zero test coverage. We need tests to validate:

Memory reduction claims (before/after measurements)
Output quality with quantization vs without
Error handling for invalid configs
The ignored_layers functionality

Missing Performance Data: The PR claims "significant memory reduction" but the test results only show latency metrics, not actual memory usage. We need:

Peak memory usage comparison (FP8 vs BF16)
VRAM consumption measurements
Quality metrics (FID/CLIP scores) to ensure quantization doesn't degrade output

Type Safety: The create_transformer_from_config function has quant_config=None without type annotation, breaking the type safety pattern used elsewhere.

Documentation: No documentation added explaining:

When to use FP8 quantization
Expected memory savings
Quality trade-offs
How to use ignored_layers effectively

While the implementation follows the Z-Image pattern correctly, these gaps make it difficult to validate the feature works as intended and prevent future regressions.

lishunyang12 · 2026-02-24T13:21:22Z

examples/offline_inference/text_to_video/text_to_video.py:97

Critical: This PR adds quantization support but provides no test coverage. We need tests to verify:

FP8 quantization actually reduces memory usage

Output quality remains acceptable with quantization

Invalid quantization configs are handled gracefully

The ignored_layers parameter works correctly

Without tests, we can't validate the 'significant memory reduction' claim or prevent regressions.

emmm, i already added the test to validate the memory reduction and show output quality consistency. False negative.

lishunyang12 · 2026-02-24T13:24:11Z

@hsliuustc0106 lets just ignore AI comments as they are not valid. Doc should be provided in a separate PR.

hsliuustc0106

Summary

This PR adds FP8 quantization support to Wan 2.2 transformers by threading quant_config through all parallel linear layers. The implementation follows the established Z-Image pattern and includes comprehensive test results showing ~41% memory reduction with minimal quality impact.

Pros:

Clean, consistent implementation across all 6 transformer classes
Follows established Z-Image pattern (commit b7604ae)
Comprehensive test results with actual memory measurements and video outputs
Proper use of TYPE_CHECKING to avoid circular imports
Good CLI help text with examples

Cons:

Inconsistent API usage in example scripts (two different ways to pass quantization config)
Minor code duplication between text_to_video.py and image_to_video.py

Recommendation: Approve with minor suggestions for API consistency.

hsliuustc0106 · 2026-02-27T01:48:13Z

+        "Example: --ignored-layers 'to_qkv,to_out'",
+    )
    return parser.parse_args()



Issue: Inconsistent API usage

The code uses two different approaches depending on whether ignored_layers is provided:

With ignored_layers: quantization_config dict with method and ignored_layers

Without: Simple quantization string

This could be confusing. Consider unifying to always use the same format:

if args.quantization: quant_kwargs["quantization_config"] = { "method": args.quantization, **(({"ignored_layers": ignored_layers} if ignored_layers else {})) }

Or verify that Omni handles both formats identically.

hsliuustc0106 · 2026-02-27T01:48:13Z

+        "Example: --ignored-layers 'to_qkv,to_out'",
+    )
    return parser.parse_args()



Issue: Same inconsistent API usage

Same concern as in text_to_video.py - consider unifying the quantization config format.

hsliuustc0106 · 2026-02-27T01:48:14Z

@@ -28,6 +28,11 @@
    SequenceParallelOutput,
 )


Good practice: TYPE_CHECKING usage

Nice use of TYPE_CHECKING to avoid circular imports while maintaining type safety.

hsliuustc0106 · 2026-02-27T01:48:14Z

@@ -92,14 +97,23 @@ def forward(self, x: torch.Tensor) -> torch.Tensor:
 class ColumnParallelGELU(nn.Module):
    """Column parallel linear with GELU activation."""


Suggestion: Add docstring

Consider adding a brief docstring explaining the quant_config parameter:

def __init__( self, dim_in: int, dim_out: int, *, approximate: str = "tanh", bias: bool = True, quant_config: "QuantizationConfig | None" = None, ): """Column parallel linear with GELU activation. Args: quant_config: Optional quantization config for FP8/other methods. """

hsliuustc0106 · 2026-02-27T01:48:14Z

@@ -23,10 +23,16 @@
 from vllm_omni.diffusion.model_loader.diffusers_loader import DiffusersPipelineLoader
 from vllm_omni.diffusion.models.schedulers import FlowUniPCMultistepScheduler


Good: Consistent pattern

The quantization config extraction and threading follows the same pattern as Z-Image. This consistency makes the codebase easier to maintain.

hsliuustc0106 · 2026-02-27T01:48:14Z

 from vllm_omni.diffusion.models.schedulers import FlowUniPCMultistepScheduler
 from vllm_omni.diffusion.models.wan2_2.wan2_2_transformer import WanTransformer3DModel
+from vllm_omni.diffusion.quantization import get_vllm_quant_config_for_layers
 from vllm_omni.diffusion.request import OmniDiffusionRequest


Suggestion: Add logging

Consider adding a log message when quantization is enabled for better visibility:

quant_config = get_vllm_quant_config_for_layers(od_config.quantization_config) if quant_config is not None: logger.info("Enabling quantization for Wan 2.2 transformer: %s", quant_config)

hsliuustc0106 · 2026-03-12T12:49:17Z

Hi @lishunyang12 👋

This FP8 quantization PR hasn't been updated for 13 days. Is this still on your radar? Let us know if you need any support.

Thanks!

… VAE/text encoder Signed-off-by: lishunyang <lishunyang12@163.com>

lishunyang12 requested a review from hsliuustc0106 as a code owner February 20, 2026 17:24

lishunyang12 force-pushed the feat/fp8-quant-wan22 branch from 8d0734d to d66e358 Compare February 20, 2026 17:27

lishunyang12 mentioned this pull request Feb 20, 2026

[RFC] Q1 Quantization Support #1057

Closed

lishunyang12 commented Feb 21, 2026

View reviewed changes

SamitHuang reviewed Feb 23, 2026

View reviewed changes

SamitHuang mentioned this pull request Feb 23, 2026

[DO NOT MERGE THIS] [Doc] Add quantization documentation for diffusion models #1418

Closed

hsliuustc0106 reviewed Feb 24, 2026

View reviewed changes

lishunyang12 mentioned this pull request Feb 24, 2026

[Design] FP8 KV Quantization for Diffusion Attention: Design Rationale & Open Questions #1454

Open

hsliuustc0106 mentioned this pull request Feb 26, 2026

add support for MammothModa2 model #336

Merged

5 tasks

hsliuustc0106 reviewed Feb 27, 2026

View reviewed changes

wtomin mentioned this pull request Feb 27, 2026

[RFC]: Continuous Diffusion Model Acceleration Support #1217

Open

1 task

lishunyang12 mentioned this pull request Mar 1, 2026

[Benchmark] Add quantization quality benchmark script (LPIPS) #1575

Closed

4 tasks

Bounty-hunter mentioned this pull request Mar 3, 2026

[RFC]: Wan2.2 Optimization JiusiServe/vllm-omni#151

Closed

6 tasks

david6666666 mentioned this pull request Mar 4, 2026

[RFC]: v0.18.0 diffusion support JiusiServe/vllm-omni#160

Closed

10 tasks

lishunyang12 mentioned this pull request Mar 9, 2026

[RFC]: Unified Quantization Framework for all models/all platforms/all methods #1763

Closed

1 task

lishunyang12 mentioned this pull request Mar 12, 2026

[Core] Unified quantization framework #1764

Merged

lishunyang12 changed the title ~~[Quantization] Add FP8 quantization support for Wan 2.2 transformer~~ [Quantization] Add FP8 support for Wan 2.2 transformer and Qwen Image VAE/text encoder Mar 12, 2026

lishunyang12 mentioned this pull request Mar 12, 2026

[Quantization] Enable FP8 weight storage for Qwen Image VAE and text encoder #1414

Closed

3 tasks

lishunyang12 force-pushed the feat/fp8-quant-wan22 branch from ea18a8d to 71a9035 Compare March 12, 2026 14:55

lishunyang12 mentioned this pull request Mar 12, 2026

[RFC]: Continuous Quantization Support #1854

Open

lishunyang12 force-pushed the feat/fp8-quant-wan22 branch 2 times, most recently from ee61360 to b9edf6b Compare March 13, 2026 14:41

This was referenced Mar 17, 2026

[RFC]: Add FP8 quantization support for Wan 2.2 Transformer #1042

Closed

[RFC]: Extend FP8 Quantization to Text Encoders and VAE in Diffusion Models #1044

Open

[Quantization] Add FP8 support for Wan 2.2 transformer and Qwen Image…

23fed75

… VAE/text encoder Signed-off-by: lishunyang <lishunyang12@163.com>

lishunyang12 force-pushed the feat/fp8-quant-wan22 branch from b9edf6b to 23fed75 Compare March 19, 2026 15:47

lishunyang12 closed this Mar 20, 2026

ArtificialRay mentioned this pull request Apr 19, 2026

[Doc]: Is HunyuanVideo-1.5 really support fp8 dynamic quantization #2912

Open

1 task



		def create_transformer_from_config(config: dict) -> WanTransformer3DModel:
		def create_transformer_from_config(config: dict, quant_config=None) -> WanTransformer3DModel:

		@@ -92,14 +97,23 @@ def forward(self, x: torch.Tensor) -> torch.Tensor:
		class ColumnParallelGELU(nn.Module):
		"""Column parallel linear with GELU activation."""

		@@ -23,10 +23,16 @@
		from vllm_omni.diffusion.model_loader.diffusers_loader import DiffusersPipelineLoader
		from vllm_omni.diffusion.models.schedulers import FlowUniPCMultistepScheduler

Conversation

lishunyang12 commented Feb 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Wan 2.2 Changes

Qwen Image VAE/Encoder Changes

Wan 2.2 Test Results

T2V Pipeline (Wan2.2-T2V-A14B-Diffusers)

I2V Pipeline (Wan2.2-I2V-A14B-Diffusers)

Test plan

Uh oh!

lishunyang12 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

SamitHuang left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

lishunyang12 Feb 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

hsliuustc0106 commented Feb 24, 2026

Uh oh!

github-actions Bot commented Feb 24, 2026

🤖 VLLM-Omni PR Review

Code Review: FP8 Quantization Support for Wan 2.2 Transformer

1. Overview

2. Code Quality

Strengths

Potential Issues

3. Architecture & Design

Strengths

Design Considerations

4. Security & Safety

5. Testing & Documentation

Test Plan Assessment

Documentation

6. Specific Suggestions

wan2_2_transformer.py

text_to_video.py and image_to_video.py

pipeline_wan2_2.py, pipeline_wan2_2_i2v.py, pipeline_wan2_2_ti2v.py

7. Approval Status

LGTM with suggestions

Uh oh!

lishunyang12 commented Feb 24, 2026

Uh oh!

lishunyang12 commented Feb 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hsliuustc0106 commented Feb 24, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

hsliuustc0106 commented Feb 24, 2026

Uh oh!

hsliuustc0106 commented Feb 24, 2026

Uh oh!

lishunyang12 commented Feb 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lishunyang12 commented Feb 24, 2026

Uh oh!

hsliuustc0106 left a comment

Choose a reason for hiding this comment

Summary

lishunyang12 commented Feb 20, 2026 •

edited

Loading

lishunyang12 left a comment •

edited

Loading

lishunyang12 Feb 24, 2026 •

edited

Loading

`wan2_2_transformer.py`

`text_to_video.py` and `image_to_video.py`

`pipeline_wan2_2.py`, `pipeline_wan2_2_i2v.py`, `pipeline_wan2_2_ti2v.py`

lishunyang12 commented Feb 24, 2026 •

edited

Loading

lishunyang12 commented Feb 24, 2026 •

edited

Loading