🚀 The feature, motivation and pitch
Currently, setting enable_block_reuse to true in kv_cache_config will trigger an IMA when flashinfer is used as the attention backend in AutoDeploy
Alternatives
No response
Additional context
No response
Before submitting a new issue...