Skip to content

Fix DeepSeek V3.2 indexer and weight loading#866

Merged
awni merged 1 commit intoml-explore:mainfrom
kernelpool:fix-deepseek_v32
Feb 10, 2026
Merged

Fix DeepSeek V3.2 indexer and weight loading#866
awni merged 1 commit intoml-explore:mainfrom
kernelpool:fix-deepseek_v32

Conversation

@kernelpool
Copy link
Copy Markdown
Contributor

@kernelpool kernelpool commented Feb 10, 2026

This fixes two issues:

  • A bug causing a NameError: name 'keys' is not defined when the indexer is triggered (sequences exceeding index_topk tokens).
  • A bug where the kv_b_proj absorption in sanitize used the wrong key prefix (note that deepseek_v3.py handles this correctly).

Copy link
Copy Markdown
Member

@awni awni left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for fixing that!!

@awni awni merged commit 8f1c56e into ml-explore:main Feb 10, 2026
2 checks passed
0xDaizz added a commit to 0xDaizz/mlx-lm-server that referenced this pull request Feb 11, 2026
- 645a326 Bump version for next release (ml-explore#865)
- 84ae19e Pythonic tool calling for LFM2 models (ml-explore#864)
- 8f1c56e Fix DeepSeek V3.2 indexer and weight loading (ml-explore#866)
- aca4c14 Make validation set optional in training process (ml-explore#857)

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants