Handle streamed function calls #1118

bitnom · 2024-01-02T09:19:39Z

Why are these changes needed?

Currently, the user's setting of stream: True is disregarded (Set to False) whenever function calling is used. We should honor the user's decision, and pave the way for incremental response processing and chunked callback functionality.

Related issue number

Resolves reviews of #786 , making the work done in #597 more complete.

Closes #785.

Let's also ping #831 since it was linked to #786 for some reason.

Checks

I've included any doc changes needed for https://microsoft.github.io/autogen/. See https://microsoft.github.io/autogen/docs/Contribute#documentation to build and test documentation locally.
I've added tests (if relevant) corresponding to the changes introduced in this PR.
I've made sure all auto checks have passed.

bitnom · 2024-01-02T09:22:17Z

@microsoft-github-policy-service agree

davorrunje · 2024-01-02T10:40:40Z

HI! Could you please include a test for it?

bitnom · 2024-01-02T12:32:54Z

HI! Could you please include a test for it?

Thanks. I'm pretty sure this PR doesn't warrant a new test. No input or return schemas have been modified. It's just supplying what's already expected via the existing pydantic model, which was already being used for the return data, and should already have coverage.

I could be mistaken but if so, I'm not yet seeing the test-case.

codecov-commenter · 2024-01-02T13:53:55Z

Codecov Report

Attention: 2 lines in your changes are missing coverage. Please review.

Comparison is base (3680197) 31.92% compared to head (379f1c2) 51.19%.
Report is 2 commits behind head on main.

Files	Patch %	Lines
autogen/oai/client.py	87.50%	1 Missing and 1 partial ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##             main    #1118       +/-   ##
===========================================
+ Coverage   31.92%   51.19%   +19.26%     
===========================================
  Files          29       29               
  Lines        4097     4112       +15     
  Branches      955     1012       +57     
===========================================
+ Hits         1308     2105      +797     
+ Misses       2695     1806      -889     
- Partials       94      201      +107

Flag	Coverage Δ
unittests	`51.09% <87.50%> (+19.21%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

davorrunje · 2024-01-02T14:07:48Z

Two things:

Please install pre-commit hook and run it on all files:

pre-commit install
pre-commit run --all-files

That should reformat the source using black. Otherwise, the code formatting check will fail (https://github.com/microsoft/autogen/actions/runs/7384108079/job/20093405650?pr=1118).

Please run the existing tests, but remove .cache before:

rm -rf .cache
coverage run -a -m pytest test/oai/test_client_stream.py

You should get the following error:

====================================================== test session starts ======================================================
platform linux -- Python 3.10.12, pytest-7.4.3, pluggy-1.3.0
rootdir: /workspaces/autogen
configfile: pyproject.toml
plugins: asyncio-0.23.2, anyio-4.1.0
asyncio: mode=strict
collected 4 items                                                                                                               

test/oai/test_client_stream.py ...F                                                                                       [100%]

=========================================================== FAILURES ============================================================
____________________________________________________ test_completion_stream _____________________________________________________

    @pytest.mark.skipif(skip, reason="openai>=1 not installed")
    def test_completion_stream():
        config_list = config_list_openai_aoai(KEY_LOC)
        client = OpenAIWrapper(config_list=config_list)
>       response = client.create(prompt="1+1=", model="gpt-3.5-turbo-instruct", stream=True)

test/oai/test_client_stream.py:79: 
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
autogen/oai/client.py:272: in create
    response.cost = self.cost(response)
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _

self = <autogen.oai.client.OpenAIWrapper object at 0x7f8274375120>, response = <openai.Stream object at 0x7f8275026d70>

    def cost(self, response: Union[ChatCompletion, Completion]) -> float:
        """Calculate the cost of the response."""
>       model = response.model
E       AttributeError: 'Stream' object has no attribute 'model'

autogen/oai/client.py:468: AttributeError
==================================================== short test summary info ====================================================
FAILED test/oai/test_client_stream.py::test_completion_stream - AttributeError: 'Stream' object has no attribute 'model'
================================================== 1 failed, 3 passed in 2.93s ==================================================

davorrunje

Please fix black formatting and failed test as described in the comment.

dismiss

ekzhu · 2024-01-07T17:10:07Z

@sonichi run openai tests

davorrunje · 2024-01-07T19:46:11Z

@sonichi I made local changes to use tool_calls instead of function_call, but cannot test it properly before #974 is merged. I propose to wait for #974 and then test this properly before merging it.

That PR was merged, so I updated from main and passed the tests. Manually firing up streaming and non-streaming agents worked as well.

After this, I have 2 PRs incoming which will resolve some dependent issues (mostly feature requests), and I'll replace all token counters in the repo with tiktoken (or similar/faster based on tests).

@bitnom as @sonichi noticed, choice.delta.function_call is deprecated and we should use choice.delta.tool_calls instead (https://platform.openai.com/docs/api-reference/chat/streaming). If tests are passing, we are probably not triggering tool_calls but a deprecated function_call. This could be fixed in this PR or a new one. I can do it tomorrow in either case.

ekzhu · 2024-01-07T20:36:39Z

Thanks @davorrunje for pointing this out I missed it. Let's do it in this PR as it is the most relevant.

bitnom · 2024-01-07T22:37:39Z

@sonichi I made local changes to use tool_calls instead of function_call, but cannot test it properly before #974 is merged. I propose to wait for #974 and then test this properly before merging it.

That PR was merged, so I updated from main and passed the tests. Manually firing up streaming and non-streaming agents worked as well.
After this, I have 2 PRs incoming which will resolve some dependent issues (mostly feature requests), and I'll replace all token counters in the repo with tiktoken (or similar/faster based on tests).

@bitnom as @sonichi noticed, choice.delta.function_call is deprecated and we should use choice.delta.tool_calls instead (https://platform.openai.com/docs/api-reference/chat/streaming). If tests are passing, we are probably not triggering tool_calls but a deprecated function_call. This could be fixed in this PR or a new one. I can do it tomorrow in either case.

Thanks @davorrunje for pointing this out I missed it. Let's do it in this PR as it is the most relevant.

This is correct. My initial reaction was to want to do it separately since autogen was founded on the deprecated methods. I know there have already been commits merged for using the tools stuff though.

I have yet to come to terms with this deprecation myself. I'll read up on it. If someone can zip through it ahead of me, please feel free to go ahead with it. I have some tasks I must complete before I can get to it.

sonichi · 2024-01-08T06:22:25Z

Perhaps we can merge this PR first and add support for tool call in a different PR.

tyler-suard-parker · 2024-01-08T17:49:31Z

@bitnom I need streaming with functions calling too. Is there anything I can do to help?

davorrunje · 2024-01-08T18:23:05Z

@bitnom I need streaming with functions calling too. Is there anything I can do to help?

This is merged, but supports only deprecated function calls. I am working to support tool calls that replaced function calls. Should be finished this week.

tyler-suard-parker · 2024-01-09T00:45:55Z

@davorrunje Ok, thank you.

tyler-suard-parker · 2024-01-09T01:44:12Z

I tried downloading and using this updated repo, and it is still not streaming.

davorrunje · 2024-01-09T18:45:27Z

@tyler-suard-parker I made #1184 which should fix it. You could try it out by installing pyautogen from the branch.

tyler-suard-parker · 2024-01-09T20:28:58Z

@davorrunje thank you, I really appreciate your help. I will try it now.

* update colab link * typo * upload file instruction

* Handle streamed function calls * apply black formatting * rm unnecessary stdout print * bug fix --------- Co-authored-by: Davor Runje <[email protected]> Co-authored-by: Eric Zhu <[email protected]>

Handle streamed function calls

b8e5684

bitnom had a problem deploying to openai1 January 2, 2024 09:19 — with GitHub Actions Failure

bitnom mentioned this pull request Jan 2, 2024

#785 allows agents to utilise streaming even when they have a functio… #786

Closed

3 tasks

davorrunje self-requested a review January 2, 2024 10:40

bitnom mentioned this pull request Jan 2, 2024

[Roadmap] Streaming support #217

Closed

davorrunje suggested changes Jan 2, 2024

View reviewed changes

apply black formatting

5bd9d77

bitnom had a problem deploying to openai1 January 2, 2024 18:52 — with GitHub Actions Failure

rm unnecessary stdout print

9844c69

bitnom had a problem deploying to openai1 January 2, 2024 20:16 — with GitHub Actions Failure

ekzhu had a problem deploying to openai1 January 7, 2024 16:46 — with GitHub Actions Failure

ekzhu approved these changes Jan 7, 2024

View reviewed changes

Merge branch 'main' into main

379f1c2

davorrunje temporarily deployed to openai1 January 7, 2024 23:30 — with GitHub Actions Inactive

davorrunje had a problem deploying to openai1 January 7, 2024 23:30 — with GitHub Actions Failure

davorrunje temporarily deployed to openai1 January 7, 2024 23:30 — with GitHub Actions Inactive

davorrunje had a problem deploying to openai1 January 7, 2024 23:30 — with GitHub Actions Failure

davorrunje temporarily deployed to openai1 January 7, 2024 23:30 — with GitHub Actions Inactive

davorrunje had a problem deploying to openai1 January 7, 2024 23:30 — with GitHub Actions Failure

sonichi added this pull request to the merge queue Jan 8, 2024

Merged via the queue into microsoft:main with commit 78a2d84 Jan 8, 2024
79 of 84 checks passed

davorrunje mentioned this pull request Jan 8, 2024

[Feature Request]: add support for tools for streamed function calls #1178

Closed

davorrunje mentioned this pull request Jan 9, 2024

Added support for streaming tool calls #1184

Merged

3 tasks

whiskyboy pushed a commit to whiskyboy/autogen that referenced this pull request Apr 17, 2024

update colab link (microsoft#1118)

bba4751

* update colab link * typo * upload file instruction

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Handle streamed function calls #1118

Handle streamed function calls #1118

bitnom commented Jan 2, 2024 •

edited

Loading

bitnom commented Jan 2, 2024

davorrunje commented Jan 2, 2024

bitnom commented Jan 2, 2024

codecov-commenter commented Jan 2, 2024 •

edited

Loading

davorrunje commented Jan 2, 2024

davorrunje left a comment

ekzhu commented Jan 7, 2024

davorrunje commented Jan 7, 2024

ekzhu commented Jan 7, 2024

bitnom commented Jan 7, 2024

sonichi commented Jan 8, 2024

tyler-suard-parker commented Jan 8, 2024

davorrunje commented Jan 8, 2024

tyler-suard-parker commented Jan 9, 2024

tyler-suard-parker commented Jan 9, 2024

davorrunje commented Jan 9, 2024

tyler-suard-parker commented Jan 9, 2024

Handle streamed function calls #1118

Handle streamed function calls #1118

Conversation

bitnom commented Jan 2, 2024 • edited Loading

Why are these changes needed?

Related issue number

Checks

bitnom commented Jan 2, 2024

davorrunje commented Jan 2, 2024

bitnom commented Jan 2, 2024

codecov-commenter commented Jan 2, 2024 • edited Loading

Codecov Report

davorrunje commented Jan 2, 2024

davorrunje left a comment

Choose a reason for hiding this comment

ekzhu commented Jan 7, 2024

davorrunje commented Jan 7, 2024

ekzhu commented Jan 7, 2024

bitnom commented Jan 7, 2024

sonichi commented Jan 8, 2024

tyler-suard-parker commented Jan 8, 2024

davorrunje commented Jan 8, 2024

tyler-suard-parker commented Jan 9, 2024

tyler-suard-parker commented Jan 9, 2024

davorrunje commented Jan 9, 2024

tyler-suard-parker commented Jan 9, 2024

bitnom commented Jan 2, 2024 •

edited

Loading

codecov-commenter commented Jan 2, 2024 •

edited

Loading