Skip to content

[AMD] Add Claude skills for AMD CI workflows#8

Closed
michaelzhang-ai wants to merge 924 commits intomainfrom
amd/add-claude-skills
Closed

[AMD] Add Claude skills for AMD CI workflows#8
michaelzhang-ai wants to merge 924 commits intomainfrom
amd/add-claude-skills

Conversation

@michaelzhang-ai
Copy link
Owner

Summary

Add three Claude Code skills that encode AMD-specific development workflows for SGLang, complementing the existing upstream skills (add-jit-kernel, add-sgl-kernel, sglang-bisect-ci-regression, write-sglang-test).

New Skills

Skill Lines What it does
enable-amd-nightly-model 193 End-to-end workflow: HuggingFace architecture research → AMD backend selection (aiter/triton/NSA) → accuracy test files for MI30x + MI35x → CI workflow YAML updates (2 files × 3 edit locations) → documentation → local validation
fix-amd-ci-regression 157 Debug AMD nightly CI failures caused by upstream changes. Covers the common pattern where a PR updates triton_backend.py but misses aiter_backend.py, plus classification of error types and fix patterns
write-amd-nightly-test 161 Guide for writing AMD nightly accuracy/performance tests with correct CI registration (register_amd_ci), suite naming, MI30x/MI35x platform variants, and GSM8K benchmark patterns

Motivation

These workflows have been used repeatedly for AMD model enablement (MiniMax-M2.5, GLM-5, Kimi-K2.5, DeepSeek-V3.2, Qwen-3.5, etc.) and bug fixes (PR sgl-project#19113, PR sgl-project#19736). Formalizing them as skills allows AI agents to execute them consistently without needing to rediscover the patterns each time.

Design

  • Follows the exact same SKILL.md format as existing upstream skills
  • All files under 200 lines (well within the 500-line guideline)
  • References real test files and workflow YAML patterns from the codebase
  • Includes decision trees (backend selection), file templates, and checklists

Test plan

  • Verify SKILL.md files render correctly on GitHub
  • Verify skill descriptions are specific enough for agent discovery
  • Validate referenced file paths exist in the codebase

SoluMilken and others added 30 commits March 1, 2026 14:09
…gl-project#18442)

Co-authored-by: Zeyu Wang <zeyu.wang@yahooinc.com>
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Brayden Zhong <b8zhong@uwaterloo.ca>
Co-authored-by: ishandhanani <82981111+ishandhanani@users.noreply.github.com>
Co-authored-by: Your Name <you@example.com>
Signed-off-by: Shangming Cai <csmthu@gmail.com>
Co-authored-by: yingluosanqian <yingluosanqian@gmail.com>
Co-authored-by: daiweitao <dwti614707404@163.com>
Co-authored-by: Mick <mickjagger19@icloud.com>
Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>
…e_batch (sgl-project#19568)

Co-authored-by: vincent <vincent@vincentdeMacBook-Pro.local>
Co-authored-by: hnyls2002 <lsyincs@gmail.com>
Co-authored-by: Liangsheng Yin <hnyls2002@gmail.com>
Co-authored-by: Bingxu Chen <Bingxu.Chen@amd.com>
Add two Claude Code skills under .claude/skills/amd/ that encode
AMD-specific development workflows, complementing the existing
upstream skills:

- enable-amd-nightly-model: End-to-end workflow for enabling a new
  model in AMD nightly CI (architecture research, backend selection
  with auto-detection logic, test files for MI30x/MI35x, CI YAML
  updates, validation)
- write-amd-nightly-test: Guide for writing AMD nightly accuracy
  and performance tests covering all test patterns (standalone GSM8K,
  shared evaluator, LMEvalMixin, VLM MMMU, NightlyBenchmarkRunner),
  CI registration, runner labels, and platform variants

All backend choices, runner labels, suite names, and test patterns
verified against the current codebase.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.