Skip to content

Add --miles-nsa-topk-backend#1058

Open
zianglih wants to merge 1 commit into
radixark:mainfrom
zianglih:nsa-topk
Open

Add --miles-nsa-topk-backend#1058
zianglih wants to merge 1 commit into
radixark:mainfrom
zianglih:nsa-topk

Conversation

@zianglih
Copy link
Copy Markdown
Contributor

@zianglih zianglih commented May 1, 2026

Copy link
Copy Markdown
Contributor

@gemini-code-assist gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request introduces a configurable backend for the Top-k operation in the Miles NSA indexer, allowing users to select between torch and flashinfer. The implementation includes a new command-line argument, propagation of the backend setting through the GLM5 model architecture, and the integration of FlashInfer's top-k logic within the indexer's forward pass. I have no feedback to provide.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant