Skip to content

[Bug] ROCm Qwen3 TVM FFI encountered a Segfault #3283

@Erxl

Description

@Erxl

🐛 Bug

To Reproduce

Steps to reproduce the behavior:

  1. mlc_llm gen_config /mnt/mytmp/Qwen3-32B-abliterated/config.json --quantization q4f16_0 --model-type qwen3 --conv-template deepseek_r1_qwen --context-window-size 16384 -o qwen3-32b-abliterated-q4f16_0
  2. mlc_llm convert_weight /mnt/mytmp/Qwen3-32B-abliterated/config.json --quantization q4f16_0 --model-type qwen3 -o qwen3-32b-abliterated-q4f16_0
  3. mlc_llm serve qwen3-32b-abliterated-q4f16_0 --host 192.168.1.100 --port 8080 --mode server --device rocm --prefix-cache-mode radix

换成 q4f16_0 q4f16_1 q4f16_2 均有相同错误

Expected behavior

Environment

  • Platform (e.g. WebGPU/Vulkan/IOS/Android/CUDA):ROCm 6.2.0

  • Operating system (e.g. Ubuntu/Windows/MacOS/...):Ubuntu 22

  • Device (e.g. iPhone 12 Pro, PC+RTX 3090, ...) 7900xtx

  • How you installed MLC-LLM (conda, source):python -m pip install --pre -U -f https://mlc.ai/wheels mlc-llm-nightly-rocm62 mlc-ai-nightly-rocm62

  • How you installed TVM-Unity (pip, source):

  • Python version (e.g. 3.10): 3.11

  • GPU driver version (if applicable):

  • CUDA/cuDNN version (if applicable):

  • TVM Unity Hash Tag (python -c "import tvm; print('\n'.join(f'{k}: {v}' for k, v in tvm.support.libinfo().items()))", applicable if you compile models):BUILD_STATIC_RUNTIME: OFF
    BUILD_DUMMY_LIBTVM: OFF
    COMPILER_RT_PATH: 3rdparty/compiler-rt
    CUDA_VERSION: NOT-FOUND
    DLPACK_PATH: 3rdparty/dlpack/include
    DMLC_PATH: 3rdparty/dmlc-core/include
    GIT_COMMIT_HASH: 2d2d2ea7763b3cf5ed42cda79315103cc82d2309
    GIT_COMMIT_TIME: 2025-07-09 10:06:40 -0400
    HIDE_PRIVATE_SYMBOLS: ON
    INDEX_DEFAULT_I64: ON
    INSTALL_DEV: OFF
    LLVM_VERSION: 18.0.0git
    MLIR_VERSION: NOT-FOUND
    PICOJSON_PATH: 3rdparty/picojson
    RANG_PATH: 3rdparty/rang/include
    ROCM_PATH: /opt/rocm
    SUMMARIZE: OFF
    TVM_CXX_COMPILER_PATH: /opt/rh/gcc-toolset-11/root/usr/bin/c++
    USE_ALTERNATIVE_LINKER: AUTO
    USE_ARM_COMPUTE_LIB_GRAPH_EXECUTOR: OFF
    USE_ARM_COMPUTE_LIB: OFF
    USE_BLAS: none
    USE_BNNS: OFF
    USE_BYODT_POSIT: OFF
    USE_COREML: OFF
    USE_CPP_RPC: OFF
    USE_CPP_RTVM:
    USE_CUBLAS: OFF
    USE_CUDA: OFF
    USE_NVTX: OFF
    USE_NCCL: OFF
    USE_MSCCL: OFF
    USE_CUDNN: OFF
    USE_CUSTOM_LOGGING: OFF
    USE_CUTLASS: OFF
    USE_AMX: OFF
    USE_DNNL: OFF
    USE_FALLBACK_STL_MAP: OFF
    USE_GTEST: AUTO
    USE_HEXAGON: OFF
    USE_HEXAGON_RPC: OFF
    USE_HEXAGON_SDK: /path/to/sdk
    USE_HEXAGON_GTEST: /path/to/hexagon/gtest
    USE_HEXAGON_EXTERNAL_LIBS: OFF
    USE_IOS_RPC: OFF
    USE_KHRONOS_SPIRV: OFF
    USE_LIBBACKTRACE: AUTO
    USE_LIBTORCH: OFF
    USE_LLVM: /opt/rocm/llvm/bin/llvm-config --ignore-libllvm --link-static
    USE_MLIR: OFF
    USE_METAL: OFF
    USE_MIOPEN: OFF
    USE_MKL: OFF
    USE_MRVL: OFF
    USE_MSVC_MT: OFF
    USE_NNPACK: OFF
    USE_OPENCL: OFF
    USE_OPENCL_ENABLE_HOST_PTR: OFF
    USE_OPENCL_EXTN_QCOM: NOT-FOUND
    USE_OPENCL_GTEST: /path/to/opencl/gtest
    USE_OPENMP: OFF
    USE_PAPI: OFF
    USE_RANDOM: ON
    TVM_DEBUG_WITH_ABI_CHANGE: OFF
    TVM_LOG_BEFORE_THROW: OFF
    USE_ROCBLAS: OFF
    USE_HIPBLAS: ON
    USE_ROCM: ON
    USE_RCCL: /opt/rocm/
    USE_RPC: ON
    USE_RTTI: ON
    USE_RUST_EXT: OFF
    USE_SORT: ON
    USE_SPIRV_KHR_INTEGER_DOT_PRODUCT: OFF
    USE_TENSORFLOW_PATH: none
    USE_TENSORRT_CODEGEN: OFF
    USE_TENSORRT_RUNTIME: OFF
    USE_TFLITE: OFF
    USE_THREADS: ON
    USE_THRUST: OFF
    USE_CURAND: OFF
    USE_VULKAN: ON
    USE_CLML: OFF
    TVM_CLML_VERSION:
    USE_CLML_GRAPH_EXECUTOR: OFF
    USE_UMA: OFF
    USE_MSC: OFF
    USE_CCACHE: AUTO
    USE_NVSHMEM: OFF
    USE_NNAPI_CODEGEN: OFF
    USE_NNAPI_RUNTIME: OFF
    BACKTRACE_ON_SEGFAULT: OFF

  • Any other relevant information:

Additional context

[2025-07-18 12:06:11] INFO auto_device.py:79: Found device: rocm:0
[2025-07-18 12:06:11] INFO auto_device.py:79: Found device: rocm:1
[2025-07-18 12:06:11] INFO auto_device.py:79: Found device: rocm:2
[2025-07-18 12:06:11] INFO auto_device.py:79: Found device: rocm:3
[2025-07-18 12:06:11] INFO auto_device.py:79: Found device: rocm:4
[2025-07-18 12:06:11] INFO auto_device.py:79: Found device: rocm:5
[2025-07-18 12:06:11] INFO auto_device.py:79: Found device: rocm:6
[2025-07-18 12:06:11] INFO auto_device.py:79: Found device: rocm:7
[2025-07-18 12:06:11] INFO jit.py:43: MLC_JIT_POLICY = ON. Can be one of: ON, OFF, REDO, READONLY
[2025-07-18 12:06:11] INFO jit.py:118: Compiling using commands below:
[2025-07-18 12:06:11] INFO jit.py:119: /home/a/miniconda3/envs/mlcllm/bin/python -m mlc_llm compile qwen3-32b-abliterated-q4f16_0 --opt 'flashinfer=1;cublas_gemm=1;faster_transformer=0;cudagraph=1;cutlass=1;ipc_allreduce_strategy=NONE' --overrides '' --device rocm:0 --output /tmp/tmpby858nrj/lib.so
[2025-07-18 12:06:13] INFO auto_config.py:70: Found model configuration: qwen3-32b-abliterated-q4f16_0/mlc-chat-config.json
[2025-07-18 12:06:13] INFO auto_target.py:91: Detecting target device: rocm:0
[2025-07-18 12:06:14] INFO auto_target.py:93: Found target: {'kind': 'rocm', 'tag': '', 'keys': ['rocm', 'gpu'], 'mcpu': 'gfx1100', 'thread_warp_size': 32, 'max_threads_per_block': 1024, 'max_shared_memory_per_block': 65536, 'max_num_threads': 256, 'mtriple': 'amdgcn-amd-amdhsa-hcc'}
[2025-07-18 12:06:14] INFO auto_target.py:110: Found host LLVM triple: x86_64-unknown-linux-gnu
[2025-07-18 12:06:14] INFO auto_target.py:111: Found host LLVM CPU: znver3
[2025-07-18 12:06:14] INFO auto_config.py:154: Found model type: qwen3. Use --model-type to override.
Compiling with arguments:
--config Qwen3Config(hidden_act='silu', hidden_size=5120, intermediate_size=25600, attention_bias=False, num_attention_heads=64, num_hidden_layers=64, num_key_value_heads=8, rms_norm_eps=1e-06, rope_theta=1000000, vocab_size=151936, tie_word_embeddings=False, context_window_size=16384, prefill_chunk_size=2048, tensor_parallel_shards=1, head_dim=128, dtype='float32', max_batch_size=128, weight_block_size=None, kwargs={})
--quantization GroupQuantize(name='q4f16_0', kind='group-quant', group_size=32, quantize_dtype='int4', storage_dtype='uint32', model_dtype='float16', linear_weight_layout='KN', quantize_embedding=True, quantize_final_fc=True, num_elem_per_storage=8, num_storage_per_group=4, max_int_value=7, tensor_parallel_shards=0)
--model-type qwen3
--target {'kind': 'rocm', 'tag': '', 'keys': ['rocm', 'gpu'], 'host': {'kind': 'llvm', 'tag': '', 'keys': ['cpu'], 'mtriple': 'x86_64-unknown-linux-gnu', 'mcpu': 'znver3'}, 'libs': ['thrust', 'rocblas', 'miopen', 'hipblas'], 'mtriple': 'amdgcn-amd-amdhsa-hcc', 'max_num_threads': 256, 'max_shared_memory_per_block': 65536, 'max_threads_per_block': 1024, 'thread_warp_size': 32, 'mcpu': 'gfx1100'}
--opt flashinfer=0;cublas_gemm=0;faster_transformer=0;cudagraph=0;cutlass=0;ipc_allreduce_strategy=NONE
--system-lib-prefix ""
--output /tmp/tmpby858nrj/lib.so
--overrides context_window_size=None;sliding_window_size=None;prefill_chunk_size=None;attention_sink_size=None;max_batch_size=None;tensor_parallel_shards=None;pipeline_parallel_stages=None;disaggregation=None
[2025-07-18 12:06:14] INFO compile.py:140: Creating model from: Qwen3Config(hidden_act='silu', hidden_size=5120, intermediate_size=25600, attention_bias=False, num_attention_heads=64, num_hidden_layers=64, num_key_value_heads=8, rms_norm_eps=1e-06, rope_theta=1000000, vocab_size=151936, tie_word_embeddings=False, context_window_size=16384, prefill_chunk_size=2048, tensor_parallel_shards=1, head_dim=128, dtype='float32', max_batch_size=128, weight_block_size=None, kwargs={})
[2025-07-18 12:06:14] INFO compile.py:158: Exporting the model to TVM Unity compiler
[2025-07-18 12:06:17] INFO compile.py:164: Running optimizations using TVM Unity
[2025-07-18 12:06:17] INFO compile.py:186: Registering metadata: {'model_type': 'qwen3', 'quantization': 'q4f16_0', 'context_window_size': 16384, 'sliding_window_size': -1, 'attention_sink_size': -1, 'prefill_chunk_size': 2048, 'tensor_parallel_shards': 1, 'pipeline_parallel_stages': 1, 'disaggregation': False, 'kv_state_kind': 'kv_cache', 'max_batch_size': 128}
[2025-07-18 12:06:19] INFO pipeline.py:57: Running TVM Relax graph-level optimizations
[2025-07-18 12:06:23] INFO pipeline.py:57: Lowering to TVM TIR kernels
[12:06:25] /workspace/tvm/include/tvm/topi/transform.h:1219: Warning: Fast mode segfaults when there are out-of-bounds indices. Make sure input indices are in bound
[12:06:27] /workspace/tvm/include/tvm/topi/transform.h:1219: Warning: Fast mode segfaults when there are out-of-bounds indices. Make sure input indices are in bound
[2025-07-18 12:06:33] INFO pipeline.py:57: Running TVM TIR-level optimizations
[2025-07-18 12:07:14] INFO pipeline.py:57: Running TVM Dlight low-level optimizations
[2025-07-18 12:07:15] INFO pipeline.py:57: Lowering to VM bytecode
!!!!!!! TVM FFI encountered a Segfault !!!!!!!
File "", in pyx_pw_3tvm_3ffi_4core_8Function_1__call(object*, object* const*, long, object*)
File "", in pyx_f_3tvm_3ffi_4core_FuncCall(void*, object*, TVMFFIAny*, int*) [clone .constprop.0]
File "", in tvm::ffi::FunctionObj::SafeCall(void*, TVMFFIAny const*, int, TVMFFIAny*)
File "", in tvm::ffi::Function::FromTyped<tvm::transform::TVMFFIFuncReg8::{lambda(tvm::transform::Pass, tvm::ffi::RValueRef<tvm::IRModule, void>)#1}>(tvm::transform::TVMFFIFuncReg8::{lambda(tvm::transform::Pass, tvm::ffi::RValueRef<tvm::IRModule, void>)#1}, std::cxx11::basic_string<char, std::char_traits, std::allocator >)::{lambda(tvm::ffi::AnyView const*, int, tvm::ffi::Any*)#1}::operator()(tvm::ffi::AnyView const, int, tvm::ffi::Any)
File "", in tvm::transform::Pass::operator()(tvm::IRModule) const
File "", in tvm::transform::Pass::operator()(tvm::IRModule, tvm::transform::PassContext const&) const
File "", in tvm::transform::SequentialNode::operator()(tvm::IRModule, tvm::transform::PassContext const&) const
File "", in tvm::transform::Pass::operator()(tvm::IRModule, tvm::transform::PassContext const&) const
File "", in tvm::tir::transform::PrimFuncPassNode::operator()(tvm::IRModule, tvm::transform::PassContext const&) const
File "", in std::Function_handler<tvm::tir::PrimFunc (tvm::tir::PrimFunc, tvm::IRModule, tvm::transform::PassContext), tvm::tir::transform::ForceNarrowIndexToInt32()::{lambda(tvm::tir::PrimFunc, tvm::IRModule, tvm::transform::PassContext)#1}>::M_invoke(std::Any_data const&, tvm::tir::PrimFunc&&, tvm::IRModule&&, tvm::transform::PassContext&&)
File "", in tvm::tir::ForceNarrowIndexToInt32(tvm::tir::PrimFunc)
File "", in tvm::tir::Int32DTypeNarrower::RewriteDataType(tvm::tir::PrimFunc)
File "", in tvm::tir::IndexDataTypeNormalizer::Rewrite(tvm::tir::PrimFunc)
File "", in tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>::VisitStmt(tvm::tir::Stmt const&)
File "", in tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>::InitVTable()::{lambda(tvm::ffi::ObjectRef const&, tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>)#15}::_FUN(tvm::ffi::ObjectRef const&, tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>)
File "", in tvm::tir::IndexDataTypeRewriter::VisitStmt
(tvm::tir::BlockRealizeNode const*)
File "", in tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>::VisitStmt(tvm::tir::Stmt const&)
File "", in tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>::InitVTable()::{lambda(tvm::ffi::ObjectRef const&, tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>)#14}::_FUN(tvm::ffi::ObjectRef const&, tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>)
File "", in tvm::tir::Int32DTypeNarrower::VisitStmt
(tvm::tir::BlockNode const*)
File "", in tvm::tir::IndexDataTypeRewriter::VisitStmt(tvm::tir::BlockNode const*)
File "", in tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>::VisitStmt(tvm::tir::Stmt const&)
File "", in tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>::InitVTable()::{lambda(tvm::ffi::ObjectRef const&, tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>)#3}::_FUN(tvm::ffi::ObjectRef const&, tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>)
File "", in tvm::tir::IndexDataTypeRewriter::VisitStmt(tvm::tir::IfThenElseNode const*)
File "", in tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>::VisitStmt(tvm::tir::Stmt const&)
File "", in tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>::InitVTable()::{lambda(tvm::ffi::ObjectRef const&, tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>)#3}::_FUN(tvm::ffi::ObjectRef const&, tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>)
File "", in tvm::tir::IndexDataTypeRewriter::VisitStmt(tvm::tir::IfThenElseNode const*)
File "", in tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>::VisitStmt(tvm::tir::Stmt const&)
File "", in tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>::InitVTable()::{lambda(tvm::ffi::ObjectRef const&, tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>)#15}::_FUN(tvm::ffi::ObjectRef const&, tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>)
File "", in tvm::tir::IndexDataTypeRewriter::VisitStmt(tvm::tir::BlockRealizeNode const*)
File "", in tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>::VisitStmt(tvm::tir::Stmt const&)
File "", in tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>::InitVTable()::{lambda(tvm::ffi::ObjectRef const&, tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>)#14}::_FUN(tvm::ffi::ObjectRef const&, tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>)
File "", in tvm::tir::Int32DTypeNarrower::VisitStmt(tvm::tir::BlockNode const*)
File "", in tvm::tir::IndexDataTypeRewriter::VisitStmt(tvm::tir::BlockNode const*)
File "", in tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>::VisitStmt(tvm::tir::Stmt const&)
File "", in tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>::InitVTable()::{lambda(tvm::ffi::ObjectRef const&, tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>)#4}::_FUN(tvm::ffi::ObjectRef const&, tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>)
File "", in tvm::tir::IndexDataTypeRewriter::VisitStmt(tvm::tir::ForNode const*)
File "", in tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>::VisitStmt(tvm::tir::Stmt const&)
File "", in tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>::InitVTable()::{lambda(tvm::ffi::ObjectRef const&, tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>)#4}::_FUN(tvm::ffi::ObjectRef const&, tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>)
File "", in tvm::tir::IndexDataTypeRewriter::VisitStmt(tvm::tir::ForNode const*)
File "", in tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>::VisitStmt(tvm::tir::Stmt const&)
File "", in tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>::InitVTable()::{lambda(tvm::ffi::ObjectRef const&, tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>)#4}::_FUN(tvm::ffi::ObjectRef const&, tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>)
File "", in tvm::tir::IndexDataTypeRewriter::VisitStmt(tvm::tir::ForNode const*)
File "", in tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>::VisitStmt(tvm::tir::Stmt const&)
File "", in tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>::InitVTable()::{lambda(tvm::ffi::ObjectRef const&, tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>)#4}::_FUN(tvm::ffi::ObjectRef const&, tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>)
File "", in tvm::tir::IndexDataTypeRewriter::VisitStmt(tvm::tir::ForNode const*)
File "", in tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>::VisitStmt(tvm::tir::Stmt const&)
File "", in tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>::InitVTable()::{lambda(tvm::ffi::ObjectRef const&, tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>)#4}::_FUN(tvm::ffi::ObjectRef const&, tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>)
File "", in tvm::tir::IndexDataTypeRewriter::VisitStmt(tvm::tir::ForNode const*)
File "", in tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>::VisitStmt(tvm::tir::Stmt const&)
File "", in tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>::InitVTable()::{lambda(tvm::ffi::ObjectRef const&, tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>)#4}::_FUN(tvm::ffi::ObjectRef const&, tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>)
File "", in tvm::tir::IndexDataTypeRewriter::VisitStmt(tvm::tir::ForNode const*)
File "", in tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>::VisitStmt(tvm::tir::Stmt const&)
File "", in tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>::InitVTable()::{lambda(tvm::ffi::ObjectRef const&, tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>)#10}::_FUN(tvm::ffi::ObjectRef const&, tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>)
File "", in tvm::tir::StmtMutator::VisitStmt(tvm::tir::SeqStmtNode const*)
File "", in tvm::ffi::Array<tvm::tir::Stmt, std::enable_if<storage_enabled_vtvm::tir::Stmt, void>::type> tvm::tir::StmtMutator::Internal::MutateArray<tvm::tir::Stmt, tvm::tir::StmtMutator::Internal::Mutate(tvm::tir::StmtMutator*, tvm::ffi::Array<tvm::tir::Stmt, void> const&)::{lambda(tvm::tir::Stmt const&)#1}>(tvm::tir::StmtMutator*, tvm::ffi::Array<tvm::tir::Stmt, std::enable_if<storage_enabled_vtvm::tir::Stmt, void>::type> const&, tvm::tir::StmtMutator::Internal::Mutate(tvm::tir::StmtMutator*, tvm::ffi::Array<tvm::tir::Stmt, void> const&)::{lambda(tvm::tir::Stmt const&)#1})
File "", in tvm::ffi::ObjectPtrtvm::ffi::Object tvm::ffi::Array<tvm::tir::Stmt, void>::MapHelper<tvm::tir::StmtMutator::Internal::Mutate(tvm::tir::StmtMutator*, tvm::ffi::Array<tvm::tir::Stmt, void> const&)::{lambda(tvm::tir::Stmt const&)#1}, tvm::tir::Stmt>(tvm::ffi::ObjectPtrtvm::ffi::Object, tvm::tir::StmtMutator::Internal::Mutate(tvm::tir::StmtMutator*, tvm::ffi::Array<tvm::tir::Stmt, void> const&)::{lambda(tvm::tir::Stmt const&)#1})
File "", in tvm::tir::StmtMutator::VisitStmt(tvm::tir::Stmt const&)
File "", in tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>::VisitStmt(tvm::tir::Stmt const&)
File "", in tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>::InitVTable()::{lambda(tvm::ffi::ObjectRef const&, tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>)#4}::_FUN(tvm::ffi::ObjectRef const&, tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>)
File "", in tvm::tir::IndexDataTypeRewriter::VisitStmt_(tvm::tir::ForNode const*)
File "", in tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>::VisitStmt(tvm::tir::Stmt const&)
File "", in tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>::InitVTable()::{lambda(tvm::ffi::ObjectRef const&, tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>)#10}::_FUN(tvm::ffi::ObjectRef const&, tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>)
File "", in tvm::tir::StmtMutator::VisitStmt_(tvm::tir::SeqStmtNode const*)
File "", in tvm::ffi::Array<tvm::tir::Stmt, std::enable_if<storage_enabled_vtvm::tir::Stmt, void>::type> tvm::tir::StmtMutator::Internal::MutateArray<tvm::tir::Stmt, tvm::tir::StmtMutator::Internal::Mutate(tvm::tir::StmtMutator*, tvm::ffi::Array<tvm::tir::Stmt, void> const&)::{lambda(tvm::tir::Stmt const&)#1}>(tvm::tir::StmtMutator*, tvm::ffi::Array<tvm::tir::Stmt, std::enable_if<storage_enabled_vtvm::tir::Stmt, void>::type> const&, tvm::tir::StmtMutator::Internal::Mutate(tvm::tir::StmtMutator*, tvm::ffi::Array<tvm::tir::Stmt, void> const&)::{lambda(tvm::tir::Stmt const&)#1})
File "", in tvm::ffi::ObjectPtrtvm::ffi::Object tvm::ffi::Array<tvm::tir::Stmt, void>::MapHelper<tvm::tir::StmtMutator::Internal::Mutate(tvm::tir::StmtMutator*, tvm::ffi::Array<tvm::tir::Stmt, void> const&)::{lambda(tvm::tir::Stmt const&)#1}, tvm::tir::Stmt>(tvm::ffi::ObjectPtrtvm::ffi::Object, tvm::tir::StmtMutator::Internal::Mutate(tvm::tir::StmtMutator*, tvm::ffi::Array<tvm::tir::Stmt, void> const&)::{lambda(tvm::tir::Stmt const&)#1})
File "", in tvm::tir::StmtMutator::VisitStmt(tvm::tir::Stmt const&)
File "", in tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>::VisitStmt(tvm::tir::Stmt const&)
File "", in tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>::InitVTable()::{lambda(tvm::ffi::ObjectRef const&, tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>)#4}::_FUN(tvm::ffi::ObjectRef const&, tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>)
File "", in tvm::tir::IndexDataTypeRewriter::VisitStmt_(tvm::tir::ForNode const*)
File "", in tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>::VisitStmt(tvm::tir::Stmt const&)
File "", in tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>::InitVTable()::{lambda(tvm::ffi::ObjectRef const&, tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>)#4}::_FUN(tvm::ffi::ObjectRef const&, tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>)
File "", in tvm::tir::IndexDataTypeRewriter::VisitStmt_(tvm::tir::ForNode const*)
File "", in tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>::VisitStmt(tvm::tir::Stmt const&)
File "", in tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>::InitVTable()::{lambda(tvm::ffi::ObjectRef const&, tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>)#4}::_FUN(tvm::ffi::ObjectRef const&, tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>)
File "", in tvm::tir::IndexDataTypeRewriter::VisitStmt_(tvm::tir::ForNode const*)
File "", in tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>::VisitStmt(tvm::tir::Stmt const&)
File "", in tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>::InitVTable()::{lambda(tvm::ffi::ObjectRef const&, tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>)#4}::_FUN(tvm::ffi::ObjectRef const&, tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>)
File "", in tvm::tir::IndexDataTypeRewriter::VisitStmt_(tvm::tir::ForNode const*)
File "", in tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>::VisitStmt(tvm::tir::Stmt const&)
File "", in tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>::InitVTable()::{lambda(tvm::ffi::ObjectRef const&, tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>)#15}::_FUN(tvm::ffi::ObjectRef const&, tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>)
File "", in tvm::tir::IndexDataTypeRewriter::VisitStmt_(tvm::tir::BlockRealizeNode const*)
File "", in tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>::VisitStmt(tvm::tir::Stmt const&)
File "", in tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>::InitVTable()::{lambda(tvm::ffi::ObjectRef const&, tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>)#14}::_FUN(tvm::ffi::ObjectRef const&, tvm::tir::StmtFunctor<tvm::tir::Stmt (tvm::tir::Stmt const&)>)
File "", in tvm::tir::Int32DTypeNarrower::VisitStmt_(tvm::tir::BlockNode const*)
File "", in tvm::tir::IndexDataTypeRewriter::VisitStmt_(tvm::tir::BlockNode const*)
File "", in tvm::tir::IndexDataTypeRewriter::VisitBlockAnnotations(tvm::ffi::Map<tvm::ffi::String, tvm::ffi::Any, void> const&)
File "", in std::_Function_handler<tvm::ffi::ObjectRef (tvm::ffi::ObjectRef const&), tvm::tir::IndexDataTypeRewriter::VisitBlockAnnotations(tvm::ffi::Map<tvm::ffi::String, tvm::ffi::Any, void> const&)::{lambda(tvm::ffi::ObjectRef const&)#1}>::_M_invoke(std::_Any_data const&, tvm::ffi::ObjectRef const&)
File "", in tvm::ffi::ObjectPtrtvm::ffi::Object tvm::ffi::Array<tvm::ffi::ObjectRef, void>::MapHelper<std::function<tvm::ffi::ObjectRef (tvm::ffi::ObjectRef const&)>, tvm::ffi::ObjectRef>(tvm::ffi::ObjectPtrtvm::ffi::Object, std::function<tvm::ffi::ObjectRef (tvm::ffi::ObjectRef const&)>)
File "", in std::_Function_handler<tvm::ffi::ObjectRef (tvm::ffi::ObjectRef const&), tvm::tir::IndexDataTypeRewriter::VisitBlockAnnotations(tvm::ffi::Map<tvm::ffi::String, tvm::ffi::Any, void> const&)::{lambda(tvm::ffi::ObjectRef const&)#1}>::_M_invoke(std::_Any_data const&, tvm::ffi::ObjectRef const&)
File "", in tvm::ffi::ObjectPtrtvm::ffi::Object tvm::ffi::Array<tvm::ffi::ObjectRef, void>::MapHelper<std::function<tvm::ffi::ObjectRef (tvm::ffi::ObjectRef const&)>, tvm::ffi::ObjectRef>(tvm::ffi::ObjectPtrtvm::ffi::Object, std::function<tvm::ffi::ObjectRef (tvm::ffi::ObjectRef const&)>)
File "./signal/../sysdeps/unix/sysv/linux/x86_64/libc_sigaction.c", in 0x00007fc547d3851f
File "", in tvm::ffi::(anonymous namespace)::backtrace_handler(int)
File "", in tvm::ffi::(anonymous namespace)::Traceback()

Traceback (most recent call last):
File "/home/a/miniconda3/envs/mlcllm/bin/mlc_llm", line 8, in
sys.exit(main())
^^^^^^
File "/home/a/miniconda3/envs/mlcllm/lib/python3.11/site-packages/mlc_llm/main.py", line 50, in main
cli.main(sys.argv[2:])
File "/home/a/miniconda3/envs/mlcllm/lib/python3.11/site-packages/mlc_llm/cli/serve.py", line 208, in main
serve(
File "/home/a/miniconda3/envs/mlcllm/lib/python3.11/site-packages/mlc_llm/interface/serve.py", line 57, in serve
async_engine = engine.AsyncMLCEngine(
^^^^^^^^^^^^^^^^^^^^^^
File "/home/a/miniconda3/envs/mlcllm/lib/python3.11/site-packages/mlc_llm/serve/engine.py", line 895, in init
super().init(
File "/home/a/miniconda3/envs/mlcllm/lib/python3.11/site-packages/mlc_llm/serve/engine_base.py", line 590, in init
) = _process_model_args(models, device, engine_config)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/home/a/miniconda3/envs/mlcllm/lib/python3.11/site-packages/mlc_llm/serve/engine_base.py", line 171, in _process_model_args
model_args: List[Tuple[str, str]] = [_convert_model_info(model) for model in models]
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/home/a/miniconda3/envs/mlcllm/lib/python3.11/site-packages/mlc_llm/serve/engine_base.py", line 171, in
model_args: List[Tuple[str, str]] = [_convert_model_info(model) for model in models]
^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/home/a/miniconda3/envs/mlcllm/lib/python3.11/site-packages/mlc_llm/serve/engine_base.py", line 164, in _convert_model_info
model_lib = jit.jit(
^^^^^^^^
File "/home/a/miniconda3/envs/mlcllm/lib/python3.11/site-packages/mlc_llm/interface/jit.py", line 164, in jit
_run_jit(
File "/home/a/miniconda3/envs/mlcllm/lib/python3.11/site-packages/mlc_llm/interface/jit.py", line 124, in _run_jit
raise RuntimeError("Cannot find compilation output, compilation failed")
RuntimeError: Cannot find compilation output, compilation failed

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugConfirmed bugs

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions