Similar PRs: - https://github.com/sgl-project/sglang/pull/4693 - https://github.com/huggingface/transformers/pull/36878 We are most interested in the regular QWen (non moe) model. Happy to provide additonal details. Edit: Now that the model is released, some more additional details: - it would be great to have the qwen3 model in the `quantize_and_export` workflow with fp8 precision - support for the modeling code of engine in qwen3 and qwen3_moe