Skip to content

Add Tier 2 operator documentation guides#6

Merged
sunway513 merged 1 commit into
mainfrom
docs/tier2-operator-guides
Feb 8, 2026
Merged

Add Tier 2 operator documentation guides#6
sunway513 merged 1 commit into
mainfrom
docs/tier2-operator-guides

Conversation

@sunway513
Copy link
Copy Markdown
Owner

Summary

  • Add Causal Conv1D guide — prefill/decode, fused QKV split, speculative decoding, Gluon variants
  • Add Gated Delta Net guide — fused recurrent, chunk-based, sigmoid gating, GVA support
  • Add Grouped GEMM guide — GMM (Triton) and DeepGEMM (CK-tile) for MoE and variable-length workloads
  • Add Weight Shuffle & Preshuffle guide — layout transforms for CK/ASM/Triton GEMM backends
  • Add BERT Padding & Variable-Length guide — pad/unpad utilities for variable-length attention
  • Update README with new operator rows and Infrastructure & Tuning section

Note

This PR includes some README changes that overlap with PR #5 (Tier 1 infrastructure guides). Merge PR #5 first, then rebase this branch to resolve any conflicts.

Test plan

  • Verify all markdown links resolve correctly
  • Verify API examples match current source code
  • Verify source file and test file paths are accurate

🤖 Generated with Claude Code

Add five new documentation guides for specialized operators:
- Causal Conv1D — prefill/decode, fused QKV split, speculative decoding
- Gated Delta Net — fused recurrent, chunk-based, sigmoid gating variants
- Grouped GEMM — GMM (Triton) and DeepGEMM (CK) for MoE workloads
- Weight Shuffle & Preshuffle — layout transforms for CK/ASM/Triton GEMM
- BERT Padding — pad/unpad utilities for variable-length attention

Update README with new operator rows (Causal Conv1D, GDN, Grouped GEMM,
Sampling, Top-K, Communication) and Infrastructure & Tuning section.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
@sunway513 sunway513 force-pushed the docs/tier2-operator-guides branch from f141c31 to 038df2f Compare February 8, 2026 01:41
@sunway513 sunway513 merged commit 4c321ad into main Feb 8, 2026
11 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant