Add checks against expanded pair bias by GMNGeoffrey · Pull Request #228 · aqlaboratory/openfold-3

GMNGeoffrey · 2026-05-21T00:06:57Z

Summary
The Triton kernel assumes that pair bias has size 1 in the N_seq dimension and does an implicit broadcast (pair_bias.stride(1) is not even passed to the kernel), but this isn't actually checked. This is the reason chunking is turned off when batch size > 1 and we're using optimized kernels, because the chunker expands the implicit broadcast dimensions. Added asserts for this and documented the issue.

Changes

Assert on expected bias shapes in Triton triangle attention
document why chunking gets turned off in prediction_heads.py when using optimized kernels
Update tests for Triton kernels in test_kernels.py to avoid batch_size > 1 + chunking (was only set for deepspeed). Cuequivariance has a workaround, so its tests still use the larger batch.

Related Issues

Related to Restoring chunking for batch_size > 1 with optimized kernels #206

Testing

Updated the tests to not use batch size > 1 (existing tests would have thrown)
I don't think a test checking for the assert is very useful

Other Notes
This, along with #226, contains the parts of #207 that I think we still want even if the approach in #213 is preferred overall. If we merge this, then I'll close #207 and iterate on #213 instead.

Assert on bias shapes in Triton triangle attention. Notably that the second dim of pair bias has size 1. This was implicitly assumed with `pair_bias.stride(1)` not even passed into the kernel. Document that this is why chunking gets turned off in prediction_heads.py when using optimized kernels. The current chunking expands the broadcast dimension. Update tests for all optimized kernels in test_kernels.py to avoid batch_size > 1 + chunking (currently only set for deepspeed).

GMNGeoffrey · 2026-05-21T00:07:24Z

@christinaflo PTAL

There's a workaround in place for the cueq kernel.

christinaflo

LGTM!

christinaflo added the safe-to-test Internal only label used to indicate PRs that are ready for automated CI testing. label May 21, 2026

christinaflo reviewed May 21, 2026

View reviewed changes

Comment thread openfold3/tests/test_kernels.py Outdated

Restore batch_size > 1 for cueq kernel tests

d6a93c8

There's a workaround in place for the cueq kernel.

GMNGeoffrey requested a review from christinaflo May 21, 2026 04:33

christinaflo approved these changes May 28, 2026

View reviewed changes

christinaflo merged commit 4e1add7 into aqlaboratory:main May 28, 2026
2 checks passed

This was referenced May 28, 2026

Apply power-of-two chunking consistently #207

Closed

[BUG] AssertionError: sys.getrefcount(si_chunk) == 2 during predict with low_mem preset #239

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add checks against expanded pair bias#228

Add checks against expanded pair bias#228
christinaflo merged 2 commits into
aqlaboratory:mainfrom
GMNGeoffrey:pair-bias-broadcast-asserts

GMNGeoffrey commented May 21, 2026 •

edited

Loading

Uh oh!

GMNGeoffrey commented May 21, 2026

Uh oh!

Uh oh!

christinaflo left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

GMNGeoffrey commented May 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

GMNGeoffrey commented May 21, 2026

Uh oh!

Uh oh!

christinaflo left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

GMNGeoffrey commented May 21, 2026 •

edited

Loading