MobileCLIP models S1 and S2 #2454

janimo · 2024-08-29T11:58:38Z

Generic OpenCLIP text encoder and a subset of the Apple MobileCLIP models.

LaurentMazare · 2024-08-29T13:39:03Z

Thanks!

* Bump the version to 0.6.1. (huggingface#2438) * onnx: workaround pow with negative base (huggingface#2439) * onnx: workaround pow with negative base rather than fully defining pow in the cpu backend (as in huggingface#2318), this implements a much smaller change which is sufficient to evaluate silero-vad onnx models. Specifically, checking if pow is run with 2.0 exponent, and if so evaluate as simply `x*x` instead of the cpu backend of `e^(2.0 * ln(x))`. * PR: use Tensor::powf insead powf correctly handles a negative base. * onnx: support negative index in Gather (huggingface#2440) index_select does not support negative indexing, but this change adds just enough workarounds in onnx to allow evaluating silero-vad models (which make use of negative indices). * silero-vad v5 example (huggingface#2321) * silero-vad v5 example This change adds an example of how to run silero-vad v5 * PR: rename 'vad' to 'silero-vad' * Update README.md --------- Co-authored-by: Laurent Mazare <[email protected]> * Fix for parler-tts, do not add the last slice of padding tokens. (huggingface#2442) * Fix for parler-tts, do not add the last slice of padding tokens. * Support for the mini model. * Add FastViT model. (huggingface#2444) * fix: qwen2 lm_head loading huggingface#2443 (huggingface#2445) Co-authored-by: Yi Xu <[email protected]> * Update cudarc to 0.12. (huggingface#2451) * Update cudarc to 0.12. * Some cudnn tweaks. * FastViT fixes. (huggingface#2452) * correct optional SE layer dimensions. * head_dim instead of num_heads is 32. * update test example output. * MobileCLIP models S1 and S2 (huggingface#2454) * Allow loading images with given std and mean * OpenCLIP text encoder component * Two MobileCLIP models * Clippy fixes. --------- Co-authored-by: Laurent <[email protected]> * Fix FLUX.1 weights (huggingface#2457) * fix FLUX.1 weights * added flux1-dev.safetensors * Clippy fixes for 1.81.0. (huggingface#2461) * Clippy fixes for 1.81.0. * Another fix. * Make Error::msg more in line with anyhow::Error::msg * Add context trait * Even more flexible * Format --------- Co-authored-by: Laurent Mazare <[email protected]> Co-authored-by: shua <[email protected]> Co-authored-by: Jani Monoses <[email protected]> Co-authored-by: ilookee <[email protected]> Co-authored-by: Yi Xu <[email protected]> Co-authored-by: Eugene Hauptmann <[email protected]>

janimo added 3 commits August 29, 2024 14:46

Allow loading images with given std and mean

430a88b

OpenCLIP text encoder component

156e28d

Two MobileCLIP models

cfa83ec

LaurentMazare approved these changes Aug 29, 2024

View reviewed changes

Clippy fixes.

a0aa849

LaurentMazare merged commit 86613c0 into huggingface:main Aug 29, 2024
10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MobileCLIP models S1 and S2 #2454

MobileCLIP models S1 and S2 #2454

janimo commented Aug 29, 2024

LaurentMazare commented Aug 29, 2024

MobileCLIP models S1 and S2 #2454

MobileCLIP models S1 and S2 #2454

Conversation

janimo commented Aug 29, 2024

LaurentMazare commented Aug 29, 2024