BERT example is slower than huggingface transformer with larger model on M1 MacBook Pro


I benchmarked computing sentence embeddings with e5.py script 
small model = intfloat/e5-small-v2, result:
<img width="280" alt="image" src="https://github.com/huggingface/candle/assets/142485875/6def08a5-c0aa-43d4-a797-88fb8600401f">

larger model=BAAI/bge-large-zh-v1.5, result:
<img width="304" alt="image" src="https://github.com/huggingface/candle/assets/142485875/3c49bada-3e83-4d16-bc98-b8779f63f45d">
Pure rust BERT example is still slower than huggingface transformer with BAAI/bge-large-zh-v1.5 model 




Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

BERT example is slower than huggingface transformer with larger model on M1 MacBook Pro #1062

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

BERT example is slower than huggingface transformer with larger model on M1 MacBook Pro #1062

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions