New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Make all outputs in "streaming" format #806

Merged

rossdanlm merged 1 commit into main from pr806

Jan 8, 2024

Contributor

rossdanlm commented Jan 8, 2024 •

edited

Loading

Make all outputs in "streaming" format

Doing this to make it easier for Ryan to parse with a singular unified output format, regardless of whether the model parser actually supports streaming or not.

moved runPromptStream -> runPrompt, overriding the old definition of runPrompt by now passing in an enableStreaming flag
still relying on isStreamingSupported() function to set the enableStreaming param to true or false
default to stream param being True on backend server (however this has no effect for non-streaming models like Dall-E)

Test Plan

Test that both streaming and non-streaming settings work as expected, as well as test that a model which does not support streaming (ex: Dall-E) still works. Now when user hasn't explicitly clicked the "stream" setting, it will default to streaming. However if they explictly toggle it turns off. Follow up we should have the "stream" button auto-enabled to reflect this (doesn't have to actually be in the config, we should just have the UI show it as on by default to match user expectation)

Update: Updated this so that the default value for stream is now true inside of OpenAIChatModelParserPromptSchema, HuggingFaceTextGenerationParserPromptSchema and AnyscaleEndpointPromptSchema. Couldn't see it defined in PaLMTextParserPromptSchema

ef5ecba9-baae-4797-98ff-c1b1a834834d.mp4

rossdanlm force-pushed the pr806 branch from 690d4d1 to 0cdbb72 Compare

January 8, 2024 13:09

rossdanlm commented

View reviewed changes

python/src/aiconfig/editor/client/src/Editor.tsx Outdated Show resolved Hide resolved

rossdanlm marked this pull request as ready for review

January 8, 2024 13:11

rossdanlm requested review from saqadri, rholinshead, suyoglastmileai, Ankush-lastmile and jonathanlastmileai as code owners

January 8, 2024 13:11

rholinshead reviewed

View reviewed changes

python/src/aiconfig/editor/client/src/components/EditorContainer.tsx Outdated Show resolved Hide resolved

python/src/aiconfig/editor/client/src/Editor.tsx Outdated Show resolved Hide resolved

python/src/aiconfig/editor/client/src/components/EditorContainer.tsx Outdated Show resolved Hide resolved

python/src/aiconfig/editor/client/src/components/EditorContainer.tsx Outdated

-                          config: serverConfigRes.aiconfig,
-                        });
-                      }
+                      const enableStreaming = isStreamingSupported(

Contributor

rholinshead Jan 8, 2024

Can we change isStreamingSupported to something like getModelSettingsStream or something similar, and have it return either a boolean or undefined? Thinking here is:

by default, models that support streaming but don't have 'stream' value set in the settings should still stream in editor to have the best UX by default
if user explicitly states 'no' for stream value in settings, then we shouldn't stream
our server implementation can technically support 'stream' style response for all models (so we can handle the undefined 'enableStreaming' param in Editor.tsx runPrompt function by defaulting to true. But other external implementations might not, so they can handle the undefined case however they want

python/src/aiconfig/editor/client/src/components/EditorContainer.tsx Outdated Show resolved Hide resolved

python/src/aiconfig/editor/client/src/components/EditorContainer.tsx Outdated Show resolved Hide resolved

rossdanlm force-pushed the pr806 branch 3 times, most recently from c08f51b to bf9b17e Compare

January 8, 2024 17:31

rholinshead approved these changes

View reviewed changes

Contributor

rholinshead left a comment

LGTM, just two minor things

python/src/aiconfig/editor/client/src/components/EditorContainer.tsx Outdated Show resolved Hide resolved

python/src/aiconfig/editor/client/src/components/EditorContainer.tsx Outdated Show resolved Hide resolved


          Make all outputs in "streaming" format

5a811c3

Doing this to make it easier for Ryan to parse with a singular unified output format, regardless of whether the model parser actually supports streaming or not.

- moved `runPromptStream` -> `runPrompt`, overriding the old definition of `runPrompt` by now passing in an `enableStreaming` flag
- still relying on `isStreamingSupported()` function to set the `enableStreaming` param to true or false
- default to `stream` param being `True` on backend server (however this has no effect for non-streaming models like Dall-E)

## Test Plan
Test that both streaming and non-streaming settings work as expected, as well as test that a model which does not support streaming (ex: Dall-E) still works. Now when user hasn't explicitly clicked the "stream" setting, it will default to streaming. However if they explictly toggle it turns off. Follow up we should have the "stream" button auto-enabled to reflect this (doesn't have to actually be in the config, we should just have the UI show it as on by default to match user expectation)

Update: Updated this so that the default value for `stream` is now `true` inside of `OpenAIChatModelParserPromptSchema`, `HuggingFaceTextGenerationParserPromptSchema` and `AnyscaleEndpointPromptSchema`. Couldn't see it defined in `PaLMTextParserPromptSchema`


https://github.com/lastmile-ai/aiconfig/assets/151060367/34214a66-0cea-4774-a917-9476359f172c

rossdanlm force-pushed the pr806 branch from bf9b17e to 5a811c3 Compare

January 8, 2024 18:41

rossdanlm merged commit 375f1a3 into main

2 checks passed

rossdanlm deleted the pr806 branch

January 8, 2024 19:13

rossdanlm mentioned this pull request

[do not land][teaching] Proof of concept diff to show how streaming events get mapped from oboe helper for us to process #910

Closed

rossdanlm pushed a commit that referenced this pull request


          Delete aiconfig_complete stream response, replace with aiconfig

7139c4a

Before we used to not support streaming, so when we would return `aiconfig` it would be from a blocking hanging operation. This meant that we needed to set `isRunning` prompt state to be true while we were waiting, but now we don't need to do that anymore after we migrated all run events to return in streaming response format, even for non-streaming models: #806

Also we are now no longer using the `streamApi` helper since we added and are now using `streamingApiChain`, which was added in #789

Finally, if you want more resources on how streaming is connected, you can check out #910 which is a teaching guide I built for adding explaining how the code is connected

## Test Plan
Both streaming and non-streaming models work as before

rossdanlm mentioned this pull request

Delete aiconfig_complete stream response, replace with aiconfig #911

Merged

rossdanlm pushed a commit that referenced this pull request


          Delete aiconfig_complete stream response, replace with aiconfig

Before we used to not support streaming, so when we would return `aiconfig` it would be from a blocking hanging operation. This meant that we needed to set `isRunning` prompt state to be true while we were waiting, but now we don't need to do that anymore after we migrated all run events to return in streaming response format, even for non-streaming models: #806

Also we are now no longer using the `streamApi` helper since we added and are now using `streamingApiChain`, which was added in #789

Finally, if you want more resources on how streaming is connected, you can check out #910 which is a teaching guide I built for adding explaining how the code is connected

## Test Plan
Both streaming and non-streaming models work as before

https://github.com/lastmile-ai/aiconfig/assets/151060367/b62e7887-20af-4c0c-ab85-eeaacaab64e0

rossdanlm added a commit that referenced this pull request


          Delete aiconfig_complete stream response, replace with aiconfig (#…

68c2c3e

…911)

Delete `aiconfig_complete` stream response, replace with `aiconfig`


Before we used to not support streaming, so when we would return
`aiconfig` it would be from a blocking hanging operation. This meant
that we needed to set `isRunning` prompt state to be true while we were
waiting, but now we don't need to do that anymore after we migrated all
run events to return in streaming response format, even for
non-streaming models: #806

Also we are now no longer using the `streamApi` helper since we added
and are now using `streamingApiChain`, which was added in
#789

Finally, if you want more resources on how streaming is connected, you
can check out #910 which is
a teaching guide I built for adding explaining how the code is connected

## Test Plan
Both streaming and non-streaming models work as before


https://github.com/lastmile-ai/aiconfig/assets/151060367/b62e7887-20af-4c0c-ab85-eeaacaab64e0

---
Stack created with [Sapling](https://sapling-scm.com). Best reviewed
with
[ReviewStack](https://reviewstack.dev/lastmile-ai/aiconfig/pull/911).
* #912
* __->__ #911

This pull request was closed.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Reviewers

rholinshead rholinshead approved these changes

saqadri Awaiting requested review from saqadri saqadri is a code owner

suyoglastmileai Awaiting requested review from suyoglastmileai

Ankush-lastmile Awaiting requested review from Ankush-lastmile Ankush-lastmile is a code owner

jonathanlastmileai Awaiting requested review from jonathanlastmileai jonathanlastmileai is a code owner

Labels

None yet