is deepspeed inference now supporting llama2 replace_with_kernel_inject=True? #4290

liveforfun · 2023-09-07T23:22:24Z

liveforfun
Sep 7, 2023

I tried to inference with replace_with_kernel_inject=True option with llama2-70b.
but I got some errors.
as far as I know replace_with_kernel_inject=True this option is injecting the high-performance kernels.
but it might be not supporting now right?

RezaYazdaniAminabadi · 2023-09-14T17:52:14Z

RezaYazdaniAminabadi
Sep 14, 2023

Hi @liveforfun

I am adding the support for this here: #4313
Best,
Reza

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

is deepspeed inference now supporting llama2 replace_with_kernel_inject=True? #4290

{{title}}

Replies: 1 comment

{{title}}

Select a reply

is deepspeed inference now supporting llama2 replace_with_kernel_inject=True? #4290

liveforfun Sep 7, 2023

Replies: 1 comment

RezaYazdaniAminabadi Sep 14, 2023

liveforfun
Sep 7, 2023

RezaYazdaniAminabadi
Sep 14, 2023