Skip to content

Conversation

@bigximik
Copy link
Contributor

✨ Description

Fix Qwen converter to correctly load QKV biases

πŸ” Type of change

Select all that apply:

  • πŸ› Bug fix (non-breaking change that addresses a specific issue)
  • πŸš€ New feature (non-breaking change that adds functionality)
  • ⚠️ Breaking change (a change that could affect existing functionality)
  • πŸ“ˆ Performance improvement/optimization (improves speed, memory usage, or efficiency)
  • πŸ› οΈ Code refactor (non-functional changes that improve code readability, structure, etc.)
  • πŸ“¦ Dependency bump (updates dependencies, including Dockerfile or package changes)
  • πŸ“ Documentation change (updates documentation, including new content or typo fixes)
  • πŸ”§ Infrastructure/Build change (affects build process, CI/CD, or dependencies)

@tscholak
Copy link
Collaborator

why are we only discovering this now?

@bigximik
Copy link
Contributor Author

Tests are marked as broken for Qwen model i will enable and fix them before concluding this PR.

@bigximik bigximik marked this pull request as draft November 28, 2025 15:35
@bigximik bigximik marked this pull request as ready for review December 1, 2025 08:26
@bigximik bigximik marked this pull request as draft December 1, 2025 09:30
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants