Anything you want to discuss about vllm.
Updated flashinfer to v0.0.9 in the following test scripts:
- Async Engine, Inputs, Utils, Worker Test
- Tensorizer, Metrics, Tracing Test
- Basic Correctness Test
- Core Test
- Distributed Tests (2 GPUs)
- Distributed Tests (4 GPUs)
- Kernels Test
- Models Test
- Vision Language Models Test
This update ensures compatibility with the latest flashinfer version.