Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

what different with mlc-ai/web-llm #21

Closed
novohool opened this issue Apr 24, 2023 · 5 comments
Closed

what different with mlc-ai/web-llm #21

novohool opened this issue Apr 24, 2023 · 5 comments

Comments

@novohool
Copy link

https://github.com/mlc-ai/web-llm

@cnrpman
Copy link

cnrpman commented Apr 24, 2023

Same question.
Also WebGPT is usually referring this paper in the field of LLM: https://arxiv.org/abs/2112.09332v3
Hope clarifying the project name decision

@0hq
Copy link
Owner

0hq commented Apr 24, 2023

Sure.

WebLLM is using TVM and compiling from C++ to WebAssembly, which is more performant but vastly more complicated. This is built to be both a solid educational reference as well as a place to start more complex projects from, being only written in a small amount of Javascript.

Considering using Javascript here has minor performance impacts vs. the simplicity it provides (mostly all computation is done on WebGPU, meaning Javascript isn't involved), this is preferable in various circumstances.

@novohool
Copy link
Author

Sure.

WebLLM is using TVM and compiling from C++ to WebAssembly, which is more performant but vastly more complicated. This is built to be both a solid educational reference as well as a place to start more complex projects from, being only written in a small amount of Javascript.

Considering using Javascript here has minor performance impacts vs. the simplicity it provides (mostly all computation is done on WebGPU, meaning Javascript isn't involved), this is preferable in various circumstances.

the same as this proj? https://xenova.github.io/transformers.js

@cnrpman
Copy link

cnrpman commented Apr 25, 2023

@jingslunt It's using onnxruntimeweb, mature but no webGPU backend for now.

@novohool
Copy link
Author

@jingslunt It's using onnxruntimeweb, mature but no webGPU backend for now.
webgpu merged
microsoft/onnxruntime#14579

@0hq 0hq closed this as completed May 1, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants