mnn-llm

English

示例工程

cli: 使用命令行编译，android编译参考android_build.sh
web: 使用命令行编译，运行时需要指定web资源
android: 使用Android Studio打开编译；
ios: 使用Xcode打开编译；🚀🚀🚀该示例代码100%由ChatGPT生成🚀🚀🚀
python: 对mnn-llm的python封装mnnllm；
other: 新增文本embedding，向量查询，文本解析，记忆库与知识库能力🔥；

模型导出与下载

llm模型导出onnx和mnn模型请使用llm-export

模型下载

构建

CI构建状态：

本地编译

# clone
git clone --recurse-submodules https://github.com/wangzhaode/mnn-llm.git
cd mnn-llm

# linux
./script/build.sh

# macos
./script/build.sh

# windows msvc
./script/build.ps1

# python wheel
./script/py_build.sh

# android
./script/android_build.sh

# android apk
./script/android_app_build.sh

# ios
./script/ios_build.sh

一些编译宏：

BUILD_FOR_ANDROID: 编译到Android设备；
LLM_SUPPORT_VISION: 是否支持视觉处理能力；
DUMP_PROFILE_INFO: 每次对话后dump出性能数据到命令行中；

默认使用CPU，如果使用其他后端或能力，可以在编译MNN时添加MNN编译宏

cuda: -DMNN_CUDA=ON
opencl: -DMNN_OPENCL=ON
metal: -DMNN_METAL=ON

4. 执行

# linux/macos
./cli_demo ./Qwen2-1.5B-Instruct-MNN/config.json # cli demo
./web_demo ./Qwen2-1.5B-Instruct-MNN/config.json ../web # web ui demo

# windows
.\Debug\cli_demo.exe ./Qwen2-1.5B-Instruct-MNN/config.json
.\Debug\web_demo.exe ./Qwen2-1.5B-Instruct-MNN/config.json ../web

# android
adb push android_build/MNN/OFF/arm64-v8a/libMNN.so /data/local/tmp
adb push android_build/MNN/express/OFF/arm64-v8a/libMNN_Express.so /data/local/tmp
adb push android_build/libllm.so android_build/cli_demo /data/local/tmp
adb push Qwen2-1.5B-Instruct-MNN /data/local/tmp
adb shell "cd /data/local/tmp && export LD_LIBRARY_PATH=. && ./cli_demo ./Qwen2-1.5B-Instruct-MNN/config.json"

Reference

reference

Name		Name	Last commit message	Last commit date
Latest commit History 221 Commits
.github/workflows		.github/workflows
MNN @ ddd9a61		MNN @ ddd9a61
android		android
demo		demo
docs		docs
include		include
ios		ios
python		python
resource		resource
script		script
src		src
watchos/mnn-llm		watchos/mnn-llm
web		web
.gitignore		.gitignore
.gitmodules		.gitmodules
.readthedocs.yaml		.readthedocs.yaml
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
README.md		README.md
README_en.md		README_en.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

mnn-llm

示例工程

模型导出与下载

构建

本地编译

4. 执行

Reference

About

Releases 25

Packages

Contributors 10

Languages

License

wangzhaode/mnn-llm

Folders and files

Latest commit

History

Repository files navigation

mnn-llm

示例工程

模型导出与下载

构建

本地编译

4. 执行

Reference

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 25

Packages 0

Contributors 10

Languages

Packages