We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
你好,非常感谢作者的工作和无私奉献 通过对比我发现以下两个问题: 1.chatglm-6b的chatglm.cpp首个token的推理速度比Python版本慢了好几倍,特别是输入长度大于100。 2. 输入长度超过1000字符,chatglm.cpp的结果更差,输出的长度比Python版本短了50%以上。 机器型号:CPU型号:Intel(R) Xeon(R) Platinum 8475B,CPU核数:16,内存:60Gi
The text was updated successfully, but these errors were encountered:
可以提供测试数据,和脚本?
Sorry, something went wrong.
No branches or pull requests
你好,非常感谢作者的工作和无私奉献
通过对比我发现以下两个问题:
1.chatglm-6b的chatglm.cpp首个token的推理速度比Python版本慢了好几倍,特别是输入长度大于100。
2. 输入长度超过1000字符,chatglm.cpp的结果更差,输出的长度比Python版本短了50%以上。
机器型号:CPU型号:Intel(R) Xeon(R) Platinum 8475B,CPU核数:16,内存:60Gi
The text was updated successfully, but these errors were encountered: