Skip to content

Issues: mlc-ai/mlc-llm

Project Tracking
#647 opened Aug 2, 2023 by tqchen
Open
Model Request Tracking
#1042 opened Oct 9, 2023 by CharlieFRuan
Open 4
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

[Bug] bug Confirmed bugs
#3143 opened Feb 20, 2025 by leezear2022
[Question] mlc-llm server cannot return correct logprobs question Question about the usage
#3142 opened Feb 19, 2025 by kunxiongzhu
[Question] how to use function call question Question about the usage
#3141 opened Feb 19, 2025 by tebie6
[Bug] Gemma 2 models fail due to errors in tokenizer bug Confirmed bugs
#3138 opened Feb 17, 2025 by julioasotodv
[Bug] Softmax op is very slow bug Confirmed bugs
#3132 opened Feb 13, 2025 by gesanqiu
[Bug] Is it compiling? CUDA 12.8 bug Confirmed bugs
#3129 opened Feb 12, 2025 by johnnynunez
Very slow time to first token on ROCM question Question about the usage
#3119 opened Feb 5, 2025 by Jyers
How to stop a stream? question Question about the usage
#3113 opened Jan 30, 2025 by hpssjellis
[Question] Android App Crash question Question about the usage
#3091 opened Jan 16, 2025 by mhollis1980
[Question] semantic description of different quantization methods question Question about the usage
#3088 opened Jan 9, 2025 by phgcha
[Bug] Broken for Intel Macs since v0.15 (or earlier) bug Confirmed bugs
#3078 opened Dec 31, 2024 by zxcat
[Feature Request] Provide a C++ API feature request New feature or request
#3066 opened Dec 16, 2024 by tranlm
ProTip! Follow long discussions with comments:>50.