Skip to content

Conversation

@motinwing
Copy link
Collaborator

@motinwing motinwing commented Dec 2, 2025

PR Category

[ Operator ]

Type of Change

[ New Feature ]

Description

Added three Triton operators required for DeepSeekV3.2 inference (Deepseek Sparse Attention, DSA), including bin_topk, indexer, and sparse_mla. The operators have been validated.

Issue

This PR addresses the lack of core operators required for DeepSeek-V3.2 in the operator library, thereby providing the necessary low-level support for inference with this model.

Progress

  • [√] Change is properly reviewed (1 reviewer required, 2 recommended).
  • [√] Change is responded to an issue.
  • [√] Change is fully covered by a UT.

Performance

Using the TileLang operator implementations as the benchmark, tests show that these Triton operators achieve over 80% of the benchmark's performance.

@CLAassistant
Copy link

CLAassistant commented Dec 2, 2025

CLA assistant check
All committers have signed the CLA.

@motinwing motinwing closed this by deleting the head repository Dec 5, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants