Skip to content

Recent changes to scheduler leading to error pre-allocated tensor in a buffer that cannot run the operation #10611

Answered by slaren
marty1885 asked this question in Q&A
Discussion options

You must be logged in to vote

I am not aware of any changes to this behavior recently, other than the occasional minor bug fix. It would be possible to copy the tensor to the CPU and copy it back again after the operation, but that's not implemented. You can try using -nkvo to avoid offloadling the KV cache until you have implemented the CPY operation.

Replies: 1 comment 4 replies

Comment options

You must be logged in to vote
4 replies
@marty1885
Comment options

@slaren
Comment options

slaren Dec 1, 2024
Collaborator

@marty1885
Comment options

@slaren
Comment options

slaren Dec 1, 2024
Collaborator

Answer selected by marty1885
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants