revert: "perf: optimise DeferredComposite (#2150)" by alandtse · Pull Request #2232 · community-shaders/skyrim-community-shaders

alandtse · 2026-04-29T10:12:57Z

This reverts commit 7f64e55.

closes #2223

Summary by CodeRabbit

Performance & Optimization
- Refactored deferred rendering pipeline to use compute-based processing for improved efficiency.
- Optimized normal encoding with enhanced precision handling.
Refactor
- Updated shader architecture to streamline rendering pass implementation and reduce state management overhead.

This reverts commit 7f64e55.

coderabbitai · 2026-04-29T10:13:09Z

📝 Walkthrough

Walkthrough

This PR converts the deferred composite pipeline from a graphics-based approach (vertex/pixel shaders) to a compute shader architecture, refactors normal encoding from sqrt-based to octahedral schemes, updates dependent systems to reference the new normal-roughness render target, and removes obsolete state management utilities.

Changes

Cohort / File(s)	Summary
Normal Encoding Refactor `package/Shaders/Common/GBuffer.hlsli`, `package/Shaders/Tests/TestGBuffer.hlsl`	Replaces sqrt-based normal encoding with octahedral encoding via new `OctWrap` and updated `EncodeNormal`/`DecodeNormal`; adds `EncodeNormalVanilla` as alternate path; converts types from `float` to `half`; updates test suite with new test functions and half-precision tolerance adjustments.
Deferred Composite Pipeline Conversion `package/Shaders/DeferredCompositeCS.hlsl`, `package/Shaders/DeferredCompositeVS.hlsl`, `src/Deferred.cpp`, `src/Deferred.h`	Converts deferred composite from graphics pipeline (VS/PS, render targets, fixed-function states) to compute shader with `[numthreads(8,8,1)]` dispatch; reorganizes resource bindings to SRVs and UAVs; integrates stereo handling via `StereoOptModeTexture`; eliminates vertex shader entirely; updates shader caching and rendering calls.
Normal Texture Binding Updates `src/Features/ScreenSpaceGI.cpp`, `src/Features/SubsurfaceScattering.cpp`	Redirects normal/roughness texture SRV bindings from `globals::deferred->normalRoughnessRT` to `rts[NORMALROUGHNESS]` render-target index across SSGI and SSS compute passes.
Documentation & Cleanup `src/Features/VRStereoOptimizations.h`, `src/Utils/D3DStateBackup.h`	Updates Doxygen comment to reference compute shader instead of pixel shader; removes `D3DStateBackup` state-snapshot utility struct no longer needed by compute pipeline.

Sequence Diagram(s)

sequenceDiagram
    participant Deferred as Deferred::Render
    participant Dispatch as GPU Dispatch
    participant ComputeShader as DeferredCompositeCS
    participant SRVs as Input SRVs<br/>(Albedo, Normal, Depth, etc.)
    participant UAVs as Output UAVs<br/>(Main, Normals, Motion)

    Deferred->>Deferred: Bind SRVs (Albedo, NormalRoughness, Masks, Depth, etc.)
    Deferred->>Deferred: Bind StereoOptModeTexture (optional)
    Deferred->>Deferred: Bind UAVs (Main, NormalMask, MotionVectors)
    Deferred->>Dispatch: Dispatch(screen width/8, screen height/8, 1)
    Dispatch->>ComputeShader: Launch threads (dispatchID)
    ComputeShader->>SRVs: Sample per-pixel data (albedo, normal, depth)
    alt Depth == 1.0
        ComputeShader->>ComputeShader: Generate sky motion vectors
    else Standard
        ComputeShader->>ComputeShader: Compute standard lighting
    end
    ComputeShader->>ComputeShader: Decode octahedral normal
    ComputeShader->>ComputeShader: Sample SSGI, reflections, sky
    ComputeShader->>UAVs: Write Main (final color)
    ComputeShader->>UAVs: Write NormalTAAMaskSpecular
    ComputeShader->>UAVs: Write MotionVectors
    Deferred->>Deferred: Unbind compute resources

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~60 minutes

Possibly related PRs

Sssr #2156 — Modifies DeferredCompositeCS.hlsl shader interface and SSRT/specular bindings alongside this compute-conversion refactor.
perf: optimise DeferredComposite #2150 — Directly conflicts/modifies the same normal encoding functions (Encode/Decode) in GBuffer.hlsli with an alternative analytic mapping.
perf: optimise ssgi normal #2189 — Alters ScreenSpaceGI normal texture handling in tandem with normal-roughness render-target changes.

Suggested reviewers

davo0411

Poem

🐰 From pixel shades we hop away,
To compute threads that light the day,
Octahedral normals encode so neat,
Dispatch by dispatch, our render's complete!
No vertices now—just UAVs write,
A shader reborn in parallel light! ✨

🚥 Pre-merge checks | ✅ 4 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 9.09% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (4 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The PR title accurately describes a reversion of commit `7f64e55` from PR `#2150`, matching the PR's stated objective to revert that specific optimization.
Linked Issues check	✅ Passed	The PR reverts changes from `#2150` to fix broken VR reflections (`#2223`). The code changes undo the compute-shader-based deferred compositing and restore prior implementations, directly addressing the reflection breakage.
Out of Scope Changes check	✅ Passed	All changes are directly related to reverting the deferred composite optimization. Modifications to shader files, C++ implementation, and header definitions all undo the compute-shader refactoring from `#2150`.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Review rate limit: 7/8 reviews remaining, refill in 7 minutes and 30 seconds.}

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

github-actions · 2026-04-29T10:14:02Z

Actionable Suggestions

Subsurface Scattering (Alan Tse): Needs version bump to 3-0-2

coderabbitai

Actionable comments posted: 2

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@package/Shaders/Tests/TestGBuffer.hlsl`:
- Around line 37-59: The tests currently only assert encoded range and decoded
unit length for the sample normals (testNormals) but miss verifying direction
fidelity; update the loop that uses GBuffer::EncodeNormal and
GBuffer::DecodeNormal to also compute the dot product between original and
decoded (e.g., dot(original, decoded)) and assert it exceeds a relaxed threshold
(choose ~0.90–0.97 depending on half precision tolerance) to catch
mirrored/reflected results; add this cosine/dot assertion alongside the existing
length check for variables original and decoded.

In `@src/Deferred.cpp`:
- Around line 356-373: The composite shader reads DepthTexture unguarded in
DeferredCompositeCS but srvs[16] leaves slot t4 null on builds without
dynamicCubemaps, causing bad depth reads; fix by binding the depth SRV into
srvs[4] unconditionally (use Util::GetCurrentSceneDepthSRV(true) for t4) instead
of only when dynamicCubemaps.loaded || REL::Module::IsVR(), and ensure the srvs
array still gracefully uses nullptr for optional textures (reflectance,
envTexture, skylighting, etc.) so resource lifetimes and DX11 binding remain
safe when features are absent.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Run ID: 5b1b00ef-be67-4da7-9759-36c316185e2a

📥 Commits

Reviewing files that changed from the base of the PR and between 651426e and dc41444.

📒 Files selected for processing (10)

package/Shaders/Common/GBuffer.hlsli
package/Shaders/DeferredCompositeCS.hlsl
package/Shaders/DeferredCompositeVS.hlsl
package/Shaders/Tests/TestGBuffer.hlsl
src/Deferred.cpp
src/Deferred.h
src/Features/ScreenSpaceGI.cpp
src/Features/SubsurfaceScattering.cpp
src/Features/VRStereoOptimizations.h
src/Utils/D3DStateBackup.h

💤 Files with no reviewable changes (2)

package/Shaders/DeferredCompositeVS.hlsl
src/Utils/D3DStateBackup.h

coderabbitai · 2026-04-29T10:20:08Z

+	// Test behavioral properties of octahedral encoding (not exact numerical accuracy)
+	// Half precision + quantization means we check: valid output, normalized, reasonable direction
+	half3 testNormals[4] = {
+		normalize(half3(1.0h, 1.0h, 1.0h)),
+		normalize(half3(-1.0h, 1.0h, 1.0h)),
+		normalize(half3(1.0h, -1.0h, 1.0h)),
+		normalize(half3(1.0h, 1.0h, -1.0h))
 	};

 	for (int i = 0; i < 4; i++) {
-		float3 original = testNormals[i];
-		float2 encoded = GBuffer::EncodeNormal(original);
-		float3 decoded = GBuffer::DecodeNormal(encoded);
+		half3 original = testNormals[i];
+		half2 encoded = GBuffer::EncodeNormal(original);
+		half3 decoded = GBuffer::DecodeNormal(encoded);
+
+		// Check behavioral properties (relaxed for half precision quantization):
+		// 1. Encoded values are in valid range [0, 1]
+		ASSERT(IsTrue, encoded.x >= 0.0h && encoded.x <= 1.0h);
+		ASSERT(IsTrue, encoded.y >= 0.0h && encoded.y <= 1.0h);

-		float length = sqrt(decoded.x * decoded.x + decoded.y * decoded.y + decoded.z * decoded.z);
-		ASSERT(IsTrue, abs(length - 1.0) < 0.05);
+		// 2. Decoded normal is normalized (unit length)
+		half length = sqrt(decoded.x * decoded.x + decoded.y * decoded.y + decoded.z * decoded.z);
+		ASSERT(IsTrue, abs(length - 1.0h) < 0.02h);  // Relaxed tolerance for half precision
 	}


⚠️ Potential issue | 🟡 Minor

Keep a direction check for the diagonal roundtrip.

This now only proves “valid range + unit length.” A broken octahedral fold/sign path can still return a normalized but mirrored vector, so these angled cases would pass while reflections drift. Add a cosine/dot threshold against original.

🧪 Suggested assertion

for (int i = 0; i < 4; i++) { half3 original = testNormals[i]; half2 encoded = GBuffer::EncodeNormal(original); half3 decoded = GBuffer::DecodeNormal(encoded); // Check behavioral properties (relaxed for half precision quantization): // 1. Encoded values are in valid range [0, 1] ASSERT(IsTrue, encoded.x >= 0.0h && encoded.x <= 1.0h); ASSERT(IsTrue, encoded.y >= 0.0h && encoded.y <= 1.0h); + + // 1b. Decoded direction should still match the source normal closely. + ASSERT(IsTrue, dot(decoded, original) > 0.98h); // 2. Decoded normal is normalized (unit length) half length = sqrt(decoded.x * decoded.x + decoded.y * decoded.y + decoded.z * decoded.z); ASSERT(IsTrue, abs(length - 1.0h) < 0.02h); // Relaxed tolerance for half precision }

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

// Test behavioral properties of octahedral encoding (not exact numerical accuracy)

// Half precision + quantization means we check: valid output, normalized, reasonable direction

half3 testNormals[4] = {

normalize(half3(1.0h, 1.0h, 1.0h)),

normalize(half3(-1.0h, 1.0h, 1.0h)),

normalize(half3(1.0h, -1.0h, 1.0h)),

normalize(half3(1.0h, 1.0h, -1.0h))

};

for (int i = 0; i < 4; i++) {

float3 original = testNormals[i];

float2 encoded = GBuffer::EncodeNormal(original);

float3 decoded = GBuffer::DecodeNormal(encoded);

half3 original = testNormals[i];

half2 encoded = GBuffer::EncodeNormal(original);

half3 decoded = GBuffer::DecodeNormal(encoded);

// Check behavioral properties (relaxed for half precision quantization):

// 1. Encoded values are in valid range [0, 1]

ASSERT(IsTrue, encoded.x >= 0.0h && encoded.x <= 1.0h);

ASSERT(IsTrue, encoded.y >= 0.0h && encoded.y <= 1.0h);

float length = sqrt(decoded.x * decoded.x + decoded.y * decoded.y + decoded.z * decoded.z);

ASSERT(IsTrue, abs(length - 1.0) < 0.05);

// 2. Decoded normal is normalized (unit length)

half length = sqrt(decoded.x * decoded.x + decoded.y * decoded.y + decoded.z * decoded.z);

ASSERT(IsTrue, abs(length - 1.0h) < 0.02h); // Relaxed tolerance for half precision

}

for (int i = 0; i < 4; i++) {

half3 original = testNormals[i];

half2 encoded = GBuffer::EncodeNormal(original);

half3 decoded = GBuffer::DecodeNormal(encoded);

// Check behavioral properties (relaxed for half precision quantization):

// 1. Encoded values are in valid range [0, 1]

ASSERT(IsTrue, encoded.x >= 0.0h && encoded.x <= 1.0h);

ASSERT(IsTrue, encoded.y >= 0.0h && encoded.y <= 1.0h);

// 1b. Decoded direction should still match the source normal closely.

ASSERT(IsTrue, dot(decoded, original) > 0.98h);

// 2. Decoded normal is normalized (unit length)

half length = sqrt(decoded.x * decoded.x + decoded.y * decoded.y + decoded.z * decoded.z);

ASSERT(IsTrue, abs(length - 1.0h) < 0.02h); // Relaxed tolerance for half precision

}

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@package/Shaders/Tests/TestGBuffer.hlsl` around lines 37 - 59, The tests currently only assert encoded range and decoded unit length for the sample normals (testNormals) but miss verifying direction fidelity; update the loop that uses GBuffer::EncodeNormal and GBuffer::DecodeNormal to also compute the dot product between original and decoded (e.g., dot(original, decoded)) and assert it exceeds a relaxed threshold (choose ~0.90–0.97 depending on half precision tolerance) to catch mirrored/reflected results; add this cosine/dot assertion alongside the existing length check for variables original and decoded.

github-actions · 2026-04-29T10:50:50Z

✅ A pre-release build is available for this PR:
Download

alandtse · 2026-04-30T02:41:52Z

the feature audit is spurious. Need to teach it about reverts.

…ommunity-shaders#2232)

revert: "perf: optimise DeferredComposite (community-shaders#2150)"

dc41444

This reverts commit 7f64e55.

coderabbitai Bot reviewed Apr 29, 2026

View reviewed changes

SkrubbySkrubInAShrub approved these changes Apr 29, 2026

View reviewed changes

alandtse merged commit 1e3b6fa into community-shaders:dev Apr 30, 2026
14 of 15 checks passed

alandtse mentioned this pull request Apr 30, 2026

fix: fix normals precision #2233

Closed

coderabbitai Bot mentioned this pull request Apr 30, 2026

fix: typed uav loads and add logging #2235

Merged

IgorAlanAlbuquerque pushed a commit to IgorAlanAlbuquerque/skyrim-community-shaders that referenced this pull request May 29, 2026

revert: "perf: optimise DeferredComposite (community-shaders#2150)" (c…

96efd3a

…ommunity-shaders#2232)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

revert: "perf: optimise DeferredComposite (#2150)"#2232

revert: "perf: optimise DeferredComposite (#2150)"#2232
alandtse merged 1 commit into
community-shaders:devfrom
alandtse:fix/revert-2150-fresh

alandtse commented Apr 29, 2026 •

edited by coderabbitai Bot

Loading

Uh oh!

coderabbitai Bot commented Apr 29, 2026 •

edited

Loading

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related PRs

Suggested reviewers

Poem

❌ Failed checks (1 warning)

Uh oh!

github-actions Bot commented Apr 29, 2026

Uh oh!

coderabbitai Bot left a comment

Uh oh!

coderabbitai Bot Apr 29, 2026

Uh oh!

Uh oh!

github-actions Bot commented Apr 29, 2026

Uh oh!

alandtse commented Apr 30, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

alandtse commented Apr 29, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

coderabbitai Bot commented Apr 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related PRs

Suggested reviewers

Poem

❌ Failed checks (1 warning)

Uh oh!

github-actions Bot commented Apr 29, 2026

Actionable Suggestions

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Apr 29, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

github-actions Bot commented Apr 29, 2026

Uh oh!

alandtse commented Apr 30, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

alandtse commented Apr 29, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented Apr 29, 2026 •

edited

Loading