package exo as installable #470

ghost · 2024-11-18T16:56:25Z

#302

exo/main.py

exo/api/chatgpt_api.py

exo/download/hf/hf_helpers.py

scripts/build_exo.py

exo/api/chatgpt_api.py

exo/main.py

exo/download/hf/hf_helpers.py

exo/main.py

Bumps [aiohttp](https://github.com/aio-libs/aiohttp) from 3.10.2 to 3.10.11. - [Release notes](https://github.com/aio-libs/aiohttp/releases) - [Changelog](https://github.com/aio-libs/aiohttp/blob/master/CHANGES.rst) - [Commits](aio-libs/aiohttp@v3.10.2...v3.10.11) --- updated-dependencies: - dependency-name: aiohttp dependency-type: direct:production ... Signed-off-by: dependabot[bot] <[email protected]>

AlexCheema · 2024-11-19T12:55:11Z

LGTM

OKHand-Zy · 2024-11-20T05:38:13Z

I've pulled the latest main branch 1fa42f3 and the code runs, but I'm getting a "not defined" in exit exo.
the exo can run but have this message.

❯ exo  --inference-engine mlx --run-model llama-3.2-3b
None of PyTorch, TensorFlow >= 2.0, or Flax have been found. Models won't be available and only tokenizers, configuration and file/data utilities can be used.
None of PyTorch, TensorFlow >= 2.0, or Flax have been found. Models won't be available and only tokenizers, configuration and file/data utilities can be used.
Selected inference engine: mlx

  _____  _____  
 / _ \ \/ / _ \ 
|  __/>  < (_) |
 \___/_/\_\___/ 
    
Detected system: Apple Silicon Mac
Inference engine name after selection: mlx
Using inference engine: MLXDynamicShardInferenceEngine with shard downloader: HFShardDownloader
[59392, 62317, 62890, 61755, 51505, 54822, 59529, 58544, 58825, 58707, 54319, 59382, 57740, 55399, 62061, 56510, 61677, 54465, 58521]
Chat interface started:
 - http://172.20.10.8:52415
 - http://127.0.0.1:52415
ChatGPT API endpoint served at:
 - http://172.20.10.8:52415/v1/chat/completions
 - http://127.0.0.1:52415/v1/chat/completions
has_read=True, has_write=True
Processing prompt: <|begin_of_text|><|start_header_id|>system<|end_header_id|>

Cutting Knowledge Date: December 2023
Today Date: 26 Jul 2024

<|eot_id|><|start_header_id|>user<|end_header_id|>

Who are you?<|eot_id|><|start_header_id|>assistant<|end_header_id|>


Removing download task for Shard(model_id='llama-3.2-3b', start_layer=0, end_layer=27, n_layers=28): True

Generated response:
I'm an artificial intelligence model known as Llama. Llama stands for "Large Language Model Meta AI."<|eot_id|>
Received exit signal SIGTERM...
Thank you for using exo.

  _____  _____  
 / _ \ \/ / _ \ 
|  __/>  < (_) |
 \___/_/\_\___/ 
    
Cancelling 4 outstanding tasks
Traceback (most recent call last):
  File "/Users/ziyu/miniconda3/envs/exo/bin/exo", line 33, in <module>
    sys.exit(load_entry_point('exo', 'console_scripts', 'exo')())
  File "/Users/ziyu/RemoteFolder/ziyu-pr/exo/exo/main.py", line 247, in run
    loop.run_until_complete(shutdown(signal.SIGTERM, loop))
  File "/Users/ziyu/miniconda3/envs/exo/lib/python3.10/asyncio/base_events.py", line 649, in run_until_complete
    return future.result()
  File "/Users/ziyu/RemoteFolder/ziyu-pr/exo/exo/helpers.py", line 249, in shutdown
    await server.stop()
NameError: name 'server' is not defined
╭────────────────────────────────────────────────────────────────────── Exo Cluster (1 node) ──────────────────────────────────────────────────────────────────────╮
╭──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╮
│ 💬️ Who are you?                                                                                                                                                  │
│                                                                                                                                                                  │
│ 🤖 I'm an artificial intelligence model known as Llama. Llama stands for "Large Language Model Meta AI."<|eot_id|>                                               │
.....

ghost · 2024-11-21T04:49:05Z

this is addressed on pr #473

OKHand-Zy · 2024-11-21T06:06:17Z

this is addressed on pr #473

Hi, I'd like to know if the change made on line 186 of exo/inference/mlx/sharded_utils.py is crucial.

tokenizer = load_tokenizer(model_path, tokenizer_config) => tokenizer = await resolve_tokenizer(model_path)

When I try to support local models, I encounter an issue, but it's resolved after reverting the change. So I'm wondering if this change is really necessary. If it's not that important, I'd prefer to use the old method.

ghost · 2024-11-21T06:28:49Z

change was made by @dtnewman , for now I will revert it back to the old method

AlexCheema · 2024-11-21T12:56:56Z

This change is correct. We should keep using resolve_tokenizer as it's async. We should not have sync blocking I/O code.

@OKHand-Zy you will need to fix your code to work with resolve_tokenizer

OKHand-Zy · 2024-11-21T13:40:36Z

This change is correct. We should keep using resolve_tokenizer as it's async. We should not have sync blocking I/O code.

@OKHand-Zy you will need to fix your code to work with resolve_tokenizer

Okay, I'll try to modify my code to make resolve_tokenizer work properly.

josh added 2 commits November 18, 2024 08:47

clean branch

fea1c0f

update setup

5916def

dtnewman reviewed Nov 19, 2024

View reviewed changes

exo/main.py Outdated Show resolved Hide resolved

dtnewman reviewed Nov 19, 2024

View reviewed changes

exo/api/chatgpt_api.py Outdated Show resolved Hide resolved

removed unused import

0ac1b87

AlexCheema requested changes Nov 19, 2024

View reviewed changes

exo/api/chatgpt_api.py Outdated Show resolved Hide resolved

exo/api/chatgpt_api.py Outdated Show resolved Hide resolved

exo/download/hf/hf_helpers.py Outdated Show resolved Hide resolved

scripts/build_exo.py Outdated Show resolved Hide resolved

josh added 6 commits November 18, 2024 23:02

pr suggestions fix

e991438

fix build script

00d4bda

moving models

867f348

pr suggestion

8ce0fe2

code clean

ea33472

cleaned code

bcd885d

AlexCheema requested changes Nov 19, 2024

View reviewed changes

exo/api/chatgpt_api.py Outdated Show resolved Hide resolved

exo/api/chatgpt_api.py Outdated Show resolved Hide resolved

AlexCheema requested changes Nov 19, 2024

View reviewed changes

exo/main.py Outdated Show resolved Hide resolved

exo/download/hf/hf_helpers.py Outdated Show resolved Hide resolved

josh added 2 commits November 19, 2024 01:24

removed response return

06c3f52

pr suggestion fixes:

8ad70b2

AlexCheema requested changes Nov 19, 2024

View reviewed changes

exo/download/hf/hf_helpers.py Outdated Show resolved Hide resolved

exo/main.py Outdated Show resolved Hide resolved

josh and others added 13 commits November 19, 2024 02:04

changes to args

65817ab

added file exist check

cb53e71

typo fix

c422cea

merge conflict resolve

ef372ab

moved func

3491b74

add --default-model command line arg

0ab302a

remove redundant dummy import

3022aab

check if user has read/write access to HF_HOME and warn them if not

559f12e

fix modelpool, add tests in test/test_model_helpers.py

1b7e678

Merge branch 'main' into package-exo-app

dda0d08

build error fix

aae23ce

typo

9489b99

error fix

520d9d1

AlexCheema merged commit 0501efa into exo-explore:main Nov 19, 2024

ghost mentioned this pull request Nov 21, 2024

Package exo fixes #473

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

package exo as installable #470

package exo as installable #470

ghost commented Nov 18, 2024

AlexCheema commented Nov 19, 2024

OKHand-Zy commented Nov 20, 2024 •

edited

Loading

ghost commented Nov 21, 2024

OKHand-Zy commented Nov 21, 2024

ghost commented Nov 21, 2024

AlexCheema commented Nov 21, 2024

OKHand-Zy commented Nov 21, 2024 •

edited

Loading

package exo as installable #470

package exo as installable #470

Conversation

ghost commented Nov 18, 2024

AlexCheema commented Nov 19, 2024

OKHand-Zy commented Nov 20, 2024 • edited Loading

ghost commented Nov 21, 2024

OKHand-Zy commented Nov 21, 2024

ghost commented Nov 21, 2024

AlexCheema commented Nov 21, 2024

OKHand-Zy commented Nov 21, 2024 • edited Loading

OKHand-Zy commented Nov 20, 2024 •

edited

Loading

OKHand-Zy commented Nov 21, 2024 •

edited

Loading