Qwen3 4B model embs only 1024 dims (not 2560)?

### System Info

HF inference endpoint with custom container `ghcr.io/huggingface/text-embeddings-inference:1.7.2` and deployed `Qwen/Qwen3-Embedding-4B` with task "sentence-embeddings" on a single L4 machine from AWS.

### Information

- [x] Docker
- [ ] The CLI directly

### Tasks

- [x] An officially supported command
- [ ] My own modifications

### Reproduction

Just deploy a HF endpoint with `Qwen/Qwen3-Embedding-4B` and using `ghcr.io/huggingface/text-embeddings-inference:1.7.2` image and make a test embedding.

### Expected behavior

The embeddings I'm getting back are dim 1024 but this model should support up to 2560?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Qwen3 4B model embs only 1024 dims (not 2560)? #658

System Info

Information

Tasks

Reproduction

Expected behavior

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Qwen3 4B model embs only 1024 dims (not 2560)? #658

Description

System Info

Information

Tasks

Reproduction

Expected behavior

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions