It should be "vb.contains_tensor("lm_head.weight")"? #2443

ilookee · 2024-08-23T10:03:06Z

candle/candle-transformers/src/models/qwen2.rs

Line 364 in 2ec8729

let lm_head = if vb.contains_tensor("lm_head") {

When I load Qwen2-7B, "vb.contains_tensor("lm_head")" return false, and "vb.contains_tensor("lm_head.weight")" return true.

LaurentMazare · 2024-08-23T12:23:20Z

Ah good point, do you want to make a PR with the change and test that it works well on your use case? If not I can take a stab at it.

Co-authored-by: Yi Xu <[email protected]>

LaurentMazare · 2024-08-23T19:57:04Z

Closing this now as it should be fixed by the associated PR.

* Bump the version to 0.6.1. (huggingface#2438) * onnx: workaround pow with negative base (huggingface#2439) * onnx: workaround pow with negative base rather than fully defining pow in the cpu backend (as in huggingface#2318), this implements a much smaller change which is sufficient to evaluate silero-vad onnx models. Specifically, checking if pow is run with 2.0 exponent, and if so evaluate as simply `x*x` instead of the cpu backend of `e^(2.0 * ln(x))`. * PR: use Tensor::powf insead powf correctly handles a negative base. * onnx: support negative index in Gather (huggingface#2440) index_select does not support negative indexing, but this change adds just enough workarounds in onnx to allow evaluating silero-vad models (which make use of negative indices). * silero-vad v5 example (huggingface#2321) * silero-vad v5 example This change adds an example of how to run silero-vad v5 * PR: rename 'vad' to 'silero-vad' * Update README.md --------- Co-authored-by: Laurent Mazare <[email protected]> * Fix for parler-tts, do not add the last slice of padding tokens. (huggingface#2442) * Fix for parler-tts, do not add the last slice of padding tokens. * Support for the mini model. * Add FastViT model. (huggingface#2444) * fix: qwen2 lm_head loading huggingface#2443 (huggingface#2445) Co-authored-by: Yi Xu <[email protected]> * Update cudarc to 0.12. (huggingface#2451) * Update cudarc to 0.12. * Some cudnn tweaks. * FastViT fixes. (huggingface#2452) * correct optional SE layer dimensions. * head_dim instead of num_heads is 32. * update test example output. * MobileCLIP models S1 and S2 (huggingface#2454) * Allow loading images with given std and mean * OpenCLIP text encoder component * Two MobileCLIP models * Clippy fixes. --------- Co-authored-by: Laurent <[email protected]> * Fix FLUX.1 weights (huggingface#2457) * fix FLUX.1 weights * added flux1-dev.safetensors * Clippy fixes for 1.81.0. (huggingface#2461) * Clippy fixes for 1.81.0. * Another fix. * Make Error::msg more in line with anyhow::Error::msg * Add context trait * Even more flexible * Format --------- Co-authored-by: Laurent Mazare <[email protected]> Co-authored-by: shua <[email protected]> Co-authored-by: Jani Monoses <[email protected]> Co-authored-by: ilookee <[email protected]> Co-authored-by: Yi Xu <[email protected]> Co-authored-by: Eugene Hauptmann <[email protected]>

ilookee pushed a commit to ilookee/candle that referenced this issue Aug 23, 2024

fix: qwen2 lm_head loading huggingface#2443

0c18b54

LaurentMazare pushed a commit that referenced this issue Aug 23, 2024

fix: qwen2 lm_head loading #2443 (#2445)

fdc2622

Co-authored-by: Yi Xu <[email protected]>

LaurentMazare closed this as completed Aug 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

It should be "vb.contains_tensor("lm_head.weight")"? #2443

It should be "vb.contains_tensor("lm_head.weight")"? #2443

ilookee commented Aug 23, 2024

LaurentMazare commented Aug 23, 2024

LaurentMazare commented Aug 23, 2024

It should be "vb.contains_tensor("lm_head.weight")"? #2443

It should be "vb.contains_tensor("lm_head.weight")"? #2443

Comments

ilookee commented Aug 23, 2024

LaurentMazare commented Aug 23, 2024

LaurentMazare commented Aug 23, 2024