Make OpenAI API the default API by oobabooga · Pull Request #4430 · oobabooga/textgen

oobabooga · 2023-10-31T21:49:10Z

The most popular LLM API in the world is the OpenAI API, so I think that it makes sense to emulate it in this project when the --api flag is provided. This is already the case for vLLM and FastChat.

The goal is to start from the current openai extension and make the following changes before merging:

Replace the current http.server implementation with FastAPI, such that the API docs can be accessed at 127.0.0.1:5001/docs.
Use sse-starlette for streaming.
Find some sensible and transparent way to handle instruction templates by model.
The same for truncation lengths.
Make the overall structure more similar to the AUTOMATIC1111 api, with the input and output parameters and types for each endpoint declared in classes.
Include the additional sampling parameters present in this project as optional parameters in the completion endpoints.

Status

I have converted everything to fastAPI, and the completion endpoints seemingly work (both with and without streaming).

OpenAI API reference

https://platform.openai.com/docs/api-reference

cc @matatonic

matatonic · 2023-10-31T22:24:25Z

@oobabooga

I think this is awesome, I don't have the time to contribute right now, but this is great! One additional idea I had was to also allow virtual model aliases to be configured via a UI for the API. It could allow mapping of things like "gpt-3.5-turbo" to a full model config (generation params, model file, loading params, etc.) - and allow switching models on the fly easily, like if an API call came in with "code-davinci-002" you could switch to a code completion model.

oobabooga · 2023-10-31T22:30:36Z

That's an interesting idea @matatonic, I'll see if I can come up with something in this direction -- the only caveat is that the user will have to have downloaded an appropriate model.

This PR will probably take forever to complete, but I think it's very important. Eventually I'll get there.

This reverts commit 5d0a6f4.

teddybear082 · 2023-11-01T13:12:10Z

So this is going to sound bananas as someone who almost exclusively uses the openAI API with textgen but I think your core approach of having a core “textgen” API with addons or extensions for other APIs feels like the right one. On the other hand yes I do think it would be great for the openai API extension for now to be treated as a “first class citizen” of sorts and get a fresh coat of paint. The reason I would not change everything over to JUST be the openai API is that we are still in 2023, ChatGPT just launched and made waves months ago so of course as the current market leader all current “cool” stuff is being made to utilize it. But who is to say that will still be the case in 2024? Meta is in the process of launching Llama on so many services for free and already has the server infrastructure to transmit text to servers at high speed all around the world. What if 2024 sees meta launch free API access to Llama2 or Llama2.5 or whatever and they have their own API fornat? Suddenly we may see services crested mostly for that platform as people will not have to pay to use it. Same for Microsoft, google, apple. And it’s not like the OpenAI API is the most user friendly - having to deal with all those dictionaries within dictionaries within arrays. As they continue to push toward multi-modal they could even change their own API again the way they did from completions to chat completions. So I just worry about you basing the core of your API software on open AI and then it getting deprecated or outdated. Maybe I am misunderstanding the intent here though - maybe by “one API” you just mean people wouldn’t have to turn on and off different endpoints they would just be available and you could add on to them and all would have access to the core program features and changes made to the GUI, which would be great.

Anyway I’m just one voice and don’t know very much so take this with a grain of salt. I do think you’ve accomplished something amazing here which is democratizing AI and making it as easy to use as possible.

oobabooga · 2023-11-01T15:18:08Z

@teddybear082 even if another API becomes more popular, using FastAPI, type hints, SSE instead of websockets, and having an API documentation will still be a benefit. It will be a matter of changing the endpoints and parameters while retaining the overall structure.

It will also be possible for extensions to add endpoints very easily by simply importing app from extensions.openai.script.

…enai-api

oobabooga · 2023-11-06T05:38:08Z

I think that the goal of this PR has been achieved -- moving the features of the old API to the OpenAI API. It is passing all my tests.

The docs need some more polishing, but that can be done later. I'll merge the PR so that more people can test the updated API in the dev branch.

teddybear082 · 2023-11-06T15:36:01Z

just for completeness here, I did pull down the dev branch today and it also cleared my openai-api tests.

Igoorx · 2023-11-07T00:47:49Z

How do I do to abort the completion? Closing the connection doesn't make the completion stop, and /api/v1/stop-stream doesn't exist, so I guess this was a oversight?

oobabooga · 2023-11-07T01:34:16Z

@Igoorx this was lacking in the OpenAI API. I have added a /v1/internal/stop-generation endpoint in this PR: #4498. Can you check if it works as expected?

Igoorx · 2023-11-07T02:14:55Z

@oobabooga Yes, it's working as expected. Thank you!

However, I found another potential problem with this API, the first generated token never comes with space, and an empty token appears before it.

{
    "id": "conv-1699323154472440320",
    "object": "text_completion.chunk",
    "created": 1699323154,
    "model": "mistral-7b-v0.1.Q6_K.gguf",
    "choices": [
        {
            "index": 0,
            "finish_reason": null,
            "text": "",
            "logprobs": {
                "top_logprobs": [
                    {}
                ]
            }
        }
    ]
}
{
    "id": "conv-1699323154472440320",
    "object": "text_completion.chunk",
    "created": 1699323154,
    "model": "mistral-7b-v0.1.Q6_K.gguf",
    "choices": [
        {
            "index": 0,
            "finish_reason": null,
            "text": "upon",
            "logprobs": {
                "top_logprobs": [
                    {}
                ]
            }
        }
    ]
}

This is llama.cpp's OpenAI proxy for reference:

{
    "id": "cmpl",
    "object": "text_completion.chunk",
    "created": 1699322948,
    "model": "LLaMA_CPP",
    "choices": [
        {
            "finish_reason": null,
            "index": 0,
            "text": " Upon"
        }
    ]
}

oobabooga · 2023-11-07T03:12:13Z

That is indeed a bug. It should be fixed after 97c21e5

If you notice anything else weird, please let me know!

teddybear082 · 2023-11-07T15:53:31Z

I don't know if this has any impact on any thing and you may have seen it but on the openai playground, the scripts now recommend a slightly different format to call the openai API in python code. Perhaps this doesn't impact at all the server side of trying to emulate the openai server, only the requestor, but figured I would pass along just in case. There's also a new "response format" field that can be passed, maybe something like jsonformers can be used or grammar implemented when someone passes that field, not sure:

# This code is for v1 of the openai package: pypi.org/project/openai
from openai import OpenAI
client = OpenAI()

response = client.chat.completions.create(
  model="gpt-4-1106-preview",
  messages=[
    {
      "role": "user",
      "content": ""
    }
  ],
  temperature=1,
  max_tokens=256,
  top_p=1,
  frequency_penalty=0,
  presence_penalty=0
)

https://platform.openai.com/playground?mode=chat&model=gpt-4-1106-preview

https://platform.openai.com/docs/quickstart?context=python

https://platform.openai.com/docs/api-reference/chat/streaming

oobabooga added 4 commits October 31, 2023 14:28

Make OpenAI API the default API

458712b

Minor linting

5f209cb

Minor changes

a1906e0

Default parameter

d8ed9a0

oobabooga changed the base branch from main to dev October 31, 2023 21:57

oobabooga marked this pull request as draft October 31, 2023 21:58

oobabooga added 2 commits October 31, 2023 15:06

Add fastapi draft

a8777ec

Add requirement

a485cf2

oobabooga mentioned this pull request Oct 31, 2023

update google colab to provide easy toggle option to use openai api extension #4399

Closed

1 task

Remove files

277d1f6

Move documentation to docs/

a9e005f

oobabooga added 2 commits October 31, 2023 16:00

Long shot

5d0a6f4

Revert "Long shot"

c389ef0

This reverts commit 5d0a6f4.

AlpinDale reviewed Nov 1, 2023

View reviewed changes

Comment thread extensions/openai/fastapi (broken).py Outdated

oobabooga added 4 commits October 31, 2023 18:30

Barely functional fastapi

e52e202

Add some typing

a2b86c0

Progress on SSE streaming

a4a8fb7

Fix streaming for real

96a8453

This was referenced Nov 1, 2023

Allow for changing shared.settings during model load via API #4431

Closed

Bugfix: Updating the shared settings object when loading a model #4425

Merged

Minor bug fix

d92b725

oobabooga mentioned this pull request Nov 1, 2023

[api, openai] 8k fixes, lora api updates, and bug fixes #2942

Closed

oobabooga added 2 commits October 31, 2023 22:17

Add presence/frequency penalty

9ca5b48

Fix a port

d1aa434

Make sure that openai is the last extension

c2fee44

Add response models

2cab1bd

oobabooga added 6 commits November 5, 2023 13:07

Count the chat prompt tokens properly

51b619a

Character support

8c0996e

Add "continue" support

764c2b3

Remove unused things

120f907

Better handle exllama/llama.cpp truncation

95eefac

Better handle metadata

8468cfa

oobabooga mentioned this pull request Nov 5, 2023

[Fix] fix openai embedding_model loading as str #4147

Merged

1 task

oobabooga added 12 commits November 5, 2023 15:45

Merge branch 'dev' into openai-api

653f3a9

Add some descriptions

6752414

Improve --public_api

b03afe3

Better handle port variable

cfd67a3

Update 12 - OpenAI API.md

45ef68a

Merge remote-tracking branch 'refs/remotes/origin/openai-api' into op…

3b914dd

…enai-api

Update 12 - OpenAI API.md

dfe6baf

Typing fixes

59692a1

Add API key support

20407bb

Better handle stopping strings

1a8afaa

Clean up

9a8e92d

Update README

fd93533

oobabooga merged commit ec17a5d into dev Nov 6, 2023

oobabooga deleted the openai-api branch November 6, 2023 05:39

oobabooga mentioned this pull request Nov 6, 2023

Implement Content-Length Header in the Blocking API #4124

Closed

1 task

lelandbatey mentioned this pull request Nov 6, 2023

Merge dev branch #4488

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make OpenAI API the default API#4430

Make OpenAI API the default API#4430
oobabooga merged 64 commits intodevfrom
openai-api

oobabooga commented Oct 31, 2023 •

edited

Loading

Uh oh!

matatonic commented Oct 31, 2023

Uh oh!

oobabooga commented Oct 31, 2023

Uh oh!

Uh oh!

teddybear082 commented Nov 1, 2023

Uh oh!

oobabooga commented Nov 1, 2023

Uh oh!

oobabooga commented Nov 6, 2023

Uh oh!

teddybear082 commented Nov 6, 2023

Uh oh!

Igoorx commented Nov 7, 2023 •

edited

Loading

Uh oh!

oobabooga commented Nov 7, 2023

Uh oh!

Igoorx commented Nov 7, 2023

Uh oh!

oobabooga commented Nov 7, 2023

Uh oh!

teddybear082 commented Nov 7, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

9 participants

Conversation

oobabooga commented Oct 31, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Status

OpenAI API reference

Uh oh!

matatonic commented Oct 31, 2023

Uh oh!

oobabooga commented Oct 31, 2023

Uh oh!

Uh oh!

teddybear082 commented Nov 1, 2023

Uh oh!

oobabooga commented Nov 1, 2023

Uh oh!

oobabooga commented Nov 6, 2023

Uh oh!

teddybear082 commented Nov 6, 2023

Uh oh!

Igoorx commented Nov 7, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

oobabooga commented Nov 7, 2023

Uh oh!

Igoorx commented Nov 7, 2023

Uh oh!

oobabooga commented Nov 7, 2023

Uh oh!

teddybear082 commented Nov 7, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

9 participants

oobabooga commented Oct 31, 2023 •

edited

Loading

Igoorx commented Nov 7, 2023 •

edited

Loading