Fix IR layout of 3-element vectors in cbuffers for -fvk-use-dx-layout by jhelferty-nv · Pull Request #7282 · shader-slang/slang

jhelferty-nv · 2025-05-29T23:00:21Z

Adhere better to the packing rules used by DXC with -fvk-use-dx-layout option.

The packing rules for D3D cbuffers are slightly different from how std140 and std430 handle packing, in particular around how they handle the alignment of vec3 types in structs. As an example, "struct { float a; float3 b; };" will get packed into 16 bytes in HLSL, while the std140 and std430 packing rules would align the float3 variable to an offset of 16. The front-end/AST already handled this correctly, but the IR layout code did not.

Also adds a documentation page for the GLSL target, which has some shortcomings relative to the SPIR-V target.

Fixes #6921.

Fixes shader-slang#6921 D3D cbuffers have slightly different packing rules that allow packing vectors into a 16-byte slot at element alignments, except when a field would cross a 16-byte boundary. In that case, we need to realign the field to the next 16-byte boundary. In particular, this impacts vec3s, which are not a power of two in size and thus require slightly different alignment logic, compared to std430 and std140. (Example: a float and float3 should fit together in that order in a single slot.) Also adds a test case.

This update introduces functions to determine if a struct or constant buffer requires scalar layout based on offset alignment rules. The GLSL source emitter now checks for scalar layout requirements when emitting parameter groups, ensuring proper alignment for various field types. Additionally, a new test case has been added to validate the changes in layout handling for constant buffers.

This update modifies the emitStructDeclarationsBlock function to include a new parameter, forceScalarOffsets, allowing for more precise control over struct field layout. The changes ensure that scalar offsets can be enforced when necessary, improving the handling of struct field attributes across various emitters (C-like, GLSL, HLSL, and WGSL). Additionally, adjustments were made to related function signatures to accommodate this new parameter, enhancing consistency and flexibility in struct emission.

jhelferty-nv · 2025-05-29T23:18:18Z

I think the SPIRV target is probably doing what it's supposed to at this point.

The GLSL target is the part that I'm still working my way through. In particular, would be good to get feedback on how best to handle what checkConstantBufferRequiresScalarLayout() is supposed to (but failing) to do. I don't really like this approach of iterating through the structs and re-checking the rules against std140 or std430. I think it might be better if I could apply a flag to the struct somehow in slang-ir-layout.cpp? It would be nice to get some guidance on whether that's the way to go, as well as how it might be accomplished.

Other issue is whether to just go ahead and promote everything that leaves a gap (vs std140/std430 rules) to either use layout(scalar) or the GL_ARB_enhanced_layouts extension (vs only doing it for stuff that needs scalar due to vec3 alignment). Forcing them to use those extensions would preserve the layout vs HLSL, but it might break things for any users relying on a 'broken' mapping of offsets from HLSL to GLSL. (I don't think I can justify adding padding variables in the IR for those without those features atm.)

kaizhangNV · 2025-05-30T03:51:38Z

source/slang/slang-ir-layout.cpp

            (int)(element.size * count),
            (int)(element.size * countForAlignment));
    }
+    virtual void adjustAlignmentForStructOffset(IRSizeAndAlignment& element, IRIntegerValue offset)


is offset the offset of the element in the struct?
The logic is not very easy to understand, can you make more comment to explain?

From my understand, what you are trying to do is that if the element is not cross the 16 byte boundary, do nothing.
And if it's cross the boundary, aligned with 16. So it means that there will be padding after the last element? If this is the case, then the offset for this element is changed, right?

I feel like this logic should be in the existing method adjustOffsetForNextAggregateMember.

The problem with putting it in adjustOffsetForNextAggregateMember is that we need to know the size of the next member in order to know whether or not to adjust the offset, and we won't know that until we iterate to the next field.

For the case of struct { float a; float3 b; }, at the point where we call adjustOffsetForNextAggregateMember we only know the offset and alignment of a, so we don't know the size of b yet and thus don't know whether or not the offset should be adjusted. For example, if b is a float4, it needs to be adjusted, but if it's a float3 it doesn't.

Hm, maybe you're suggesting I move adjustOffsetForNextAggregateMember up to where I've put adjustAlignmentForStructOffset and have it take the current element size as an additional argument. Let me give that a try.

Split test case in two, so we can check that the struct only uses layout(scalar) when necessary. Strips out additional alignment tests that aren't pertinent to 3-element vector offset calculation.

Consolidate constant buffer logic with the existing logic for adjusting offset for aggregate members.

When calculating offsets in the IR, take into account packOffset statements from HLSL. Necessary for SPIRV target to generate correct offset for any undecorated fields following the one with packOffset.

Revert changes to force GLSL to use scalar output. (Was incomplete)

Tests were originally named in a way that reflects the GLSL target. Changing names to reflect what the specific cases are instead of how a particular target interprets them.

Add a test for the case where a packoffset decoration leaves a gap. When calculating offsets in the IR, we need to take this into account or else we can end up assigning multiple items to the same offset.

This reverts commit 8077a94.

This change would need a matching one in the AST/front-end in order to work properly.

source/slang/slang-ir-layout.cpp

kaizhangNV · 2025-06-06T16:36:21Z

You need to take care of the Falcor test failure, but address the comment first, I think they might be related.

jhelferty-nv · 2025-06-09T22:30:30Z

I think the falcor failure might be spurious? Updating the branch to trigger another validation run.

kaizhangNV

LGTM.

source/slang/slang-ir-layout.h

jhelferty-nv · 2025-06-10T12:55:40Z

Just to add some commentary breadcrumbs, the patch fixes the SPIR-V target, but the GLSL target will still be broken for this case. GLSL uses a base alignment of 4N for a 3 component vector and the final value becomes padding, so it's not possible to directly translate this packing from D3D to GLSL. Fixing GLSL target for the general case would likely involve using a 4-element vector and then some reinterpret casts to extract the elements.

Vulkan has laxer rules on offsets and padding when the scalarBlockLayout feature is enabled, so it doesn't have these problems.

Regarding the packoffset interaction, which I tried to fix at one point, I ended up backing out my changes from the final patch. Fixing it would require changes to the frontend/AST processing to match, and it's not really the target of this change. I'll open a separate bug.

csyonghe · 2025-06-10T16:53:18Z

tests/expected-failure.txt

 tests/bugs/byte-address-buffer-interlocked-add-f32.slang (vk)
 tests/ir/loop-unroll-0.slang.1 (vk)
 tests/hlsl-intrinsic/texture/float-atomics.slang (vk)
+tests/hlsl/cbuffer-float3-offsets-aligned.slang.2 (vk)


Why do we add a test that is expected to fail? Any plan to fix this?

These succeed on Vulkan using the SPIRV target, but do not succeed on Vulkan using the GLSL target. Kai told me to add the tests here to disable error reporting for Vulkan with GLSL target. It sounds like that's not correct; will fix.

csyonghe · 2025-06-10T16:55:06Z

tests/hlsl/cbuffer-float3-offsets-aligned.slang

@@ -0,0 +1,115 @@
+//TEST:SIMPLE(filecheck=SPIRV): -target spirv -profile cs_6_2 -entry computeMain -line-directive-mode none -fvk-use-dx-layout
+//TEST(compute):COMPARE_COMPUTE_EX(filecheck-buffer=BUFFER):-slang -compute -dx12 -use-dxil -profile cs_6_2 -Xslang... -Xdxc -fvk-use-dx-layout -Xdxc -enable-16bit-types -X. -output-using-type
+//TEST(compute):COMPARE_COMPUTE_EX(filecheck-buffer=BUFFER):-slang -compute -vk -profile cs_6_2 -Xslang... -fvk-use-dx-layout -X. -output-using-type


If this test is failing under -emit-spirv-via-glsl path, add a -emit-spirv-directly option here to prevent the glsl test failure.

Thanks, will do.

…shader-slang#7282) * Better handling for 16-byte boundary of d3d cbuffer Fixes shader-slang#6921 D3D cbuffers have slightly different packing rules that allow packing vectors into a 16-byte slot at element alignments, except when a field would cross a 16-byte boundary. In that case, we need to realign the field to the next 16-byte boundary. In particular, this impacts vec3s, which are not a power of two in size and thus require slightly different alignment logic, compared to std430 and std140. (Example: a float and float3 should fit together in that order in a single slot.) Adds test cases. Adds documentation page for GLSL target

Results of these tests had been marked ignored, because they failed on VK with the GLSL backend. This change removes them from the expected-failure.txt file and adds the correct command line option to avoid using the GLSL target. Addresses concern raised on #7282

jhelferty-nv added 3 commits May 27, 2025 11:20

jhelferty-nv self-assigned this May 29, 2025

kaizhangNV reviewed May 30, 2025

View reviewed changes

jhelferty-nv added 15 commits May 30, 2025 19:27

Clean up test cases for float3

e4e1a0e

Split test case in two, so we can check that the struct only uses layout(scalar) when necessary. Strips out additional alignment tests that aren't pertinent to 3-element vector offset calculation.

Merge branch 'shader-slang:master' into feature/cbuffer-wrap-fix

31cf97c

Merge branch 'shader-slang:master' into feature/cbuffer-wrap-fix

82e7ea3

Revise adjustOffset logic

72c531b

Consolidate constant buffer logic with the existing logic for adjusting offset for aggregate members.

Take into account pack offset declarations

17cc9f4

When calculating offsets in the IR, take into account packOffset statements from HLSL. Necessary for SPIRV target to generate correct offset for any undecorated fields following the one with packOffset.

Revert changes for GLSL target support

fb397e4

Revert changes to force GLSL to use scalar output. (Was incomplete)

Add GLSL target documentation

c62aedf

Rename tests

435cef3

Tests were originally named in a way that reflects the GLSL target. Changing names to reflect what the specific cases are instead of how a particular target interprets them.

Adds compute tests to test cases, minor cleanups

77ff4a6

Update packoffset test

8077a94

Add a test for the case where a packoffset decoration leaves a gap. When calculating offsets in the IR, we need to take this into account or else we can end up assigning multiple items to the same offset.

Remove packoffset from tests, add reflection

aea2435

Revert "Update packoffset test"

7ea6e3f

This reverts commit 8077a94.

Revert packoffset use in IR layout calculation

caeb217

This change would need a matching one in the AST/front-end in order to work properly.

Update documentation

00701cb

Merge branch 'shader-slang:master' into feature/cbuffer-wrap-fix

039b932

jhelferty-nv added the pr: non-breaking PRs without breaking changes label Jun 5, 2025

jhelferty-nv added 3 commits June 5, 2025 18:13

Fix formatting

57e8a2c

docs: Update toc

f757149

Mark new tests as expected failures for VK via GLSL

42e4171

jhelferty-nv changed the title ~~WIP: Improve handling of cbuffer offset packing rules for -fvk-use-dx-layout~~ Fix IR layout of 3-element vectors in cbuffers for -fvk-use-dx-layout Jun 6, 2025

jhelferty-nv marked this pull request as ready for review June 6, 2025 04:41

jhelferty-nv requested a review from a team as a code owner June 6, 2025 04:41

Merge branch 'master' into feature/cbuffer-wrap-fix

c6fb662

kaizhangNV reviewed Jun 6, 2025

View reviewed changes

source/slang/slang-ir-layout.cpp Show resolved Hide resolved

Merge branch 'master' into feature/cbuffer-wrap-fix

ca0ee09

kaizhangNV previously approved these changes Jun 10, 2025

View reviewed changes

source/slang/slang-ir-layout.h Outdated Show resolved Hide resolved

Remove assert

edc77ff

jhelferty-nv dismissed kaizhangNV’s stale review via edc77ff June 10, 2025 12:31

jhelferty-nv enabled auto-merge (squash) June 10, 2025 13:01

Merge branch 'master' into feature/cbuffer-wrap-fix

42e2671

kaizhangNV approved these changes Jun 10, 2025

View reviewed changes

jhelferty-nv merged commit e372020 into shader-slang:master Jun 10, 2025
17 checks passed

csyonghe reviewed Jun 10, 2025

View reviewed changes

jhelferty-nv mentioned this pull request Jun 10, 2025

Enable some float3 cbuffer tests #7391

Merged

Conversation

jhelferty-nv commented May 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jhelferty-nv commented May 29, 2025

Uh oh!

kaizhangNV May 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kaizhangNV May 30, 2025

Choose a reason for hiding this comment

Uh oh!

jhelferty-nv May 31, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

kaizhangNV commented Jun 6, 2025

Uh oh!

jhelferty-nv commented Jun 9, 2025

Uh oh!

kaizhangNV left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

jhelferty-nv commented Jun 10, 2025

Uh oh!

Uh oh!

csyonghe Jun 10, 2025

Choose a reason for hiding this comment

Uh oh!

jhelferty-nv Jun 10, 2025

Choose a reason for hiding this comment

Uh oh!

csyonghe Jun 10, 2025

Choose a reason for hiding this comment

Uh oh!

jhelferty-nv Jun 10, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

jhelferty-nv commented May 29, 2025 •

edited

Loading

kaizhangNV May 30, 2025 •

edited

Loading