Minimal example of deploying LLM with flashinfer APIs (e.g. through gpt-fast). #1811

Open

Labels

priority: should have (P1)

opened

on Sep 30, 2025

@yzh119 proposed this in #67. I'm pulling it into its own issue to track separately, so we can close that old large tracking issue.

Metadata

Assignees

No one assigned

Labels

priority: should have (P1)

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests