Fix ChatGPT API endpoint #6

AlexCheema · 2024-07-16T05:40:14Z

Currently doesn't work - which means the only way to access inference is via peer handles in Python, which isn't very user-friendly for applications to add a new library.

AlexCheema · 2024-07-16T05:44:18Z

One of the issues here is that since we don't treat any node as a "master" or "worker" (all nodes are equal), the "head" and "tail" nodes are dynamic. However, the prompt needs to be sent to the "head" and the tokens are received by the "tail".

We could hack this by breaking the p2p equality assumption, but that would need to be fixed down the line.

The better thing is to:

forward the prompt until you hit the "head" (this should be fairly simple).
forward API requests to the "tail" that has all the generated tokens (this sounds hard and hacky). The other thing we can do is only serve the API from the "tail"

AlexCheema · 2024-07-16T06:03:29Z

Fixed the former: 1d5c28a
The latter is harder and for now, we'll only support API requests to the "tail". You can always force a node to be the "tail" by specifying its node-id to be sorted last e.g. with python3 main.py --node-id "xxx-node"

AlexCheema · 2024-07-16T07:22:13Z

Fixed in f2895cb

merge fork to pr branch

AlexCheema closed this as completed Jul 16, 2024

HysenX-LI mentioned this issue Aug 27, 2024

Segmentation fault(Core dumped) in tinygrad #180

Open

lipere123 referenced this issue in lipere123/exo Oct 11, 2024

Merge pull request #6 from exo-explore/main

fb7c73f

merge fork to pr branch

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix ChatGPT API endpoint #6

Fix ChatGPT API endpoint #6

AlexCheema commented Jul 16, 2024

AlexCheema commented Jul 16, 2024

AlexCheema commented Jul 16, 2024 •

edited

Loading

AlexCheema commented Jul 16, 2024

Fix ChatGPT API endpoint #6

Fix ChatGPT API endpoint #6

Comments

AlexCheema commented Jul 16, 2024

AlexCheema commented Jul 16, 2024

AlexCheema commented Jul 16, 2024 • edited Loading

AlexCheema commented Jul 16, 2024

AlexCheema commented Jul 16, 2024 •

edited

Loading