[EAGLE-3698] - model upload handles multiple batch #227

phatvo9 · 2023-11-28T07:35:21Z

Why

For now model only predicts one by one input even sending a batch with size >1.

How

get_predictions() method in inference.py will take a list of inputs instead of single input.
Update examples with batch input.
Update doc

Other updates:

insert triton decorator function when initializing model repository, so user won't forget to do this.
enable infer param description from_kwargs.

Note:

Models generated by a lower version will not function on this version.

HarmitMinhas96

This PR is currently very large. Can it be broken down into more manageable chunks for review?
E.g.:

Add multiple batches handling in one PR (maybe two if it can be logicially split)
Add new model type example text-embedder
Add new model type example multimodal-embedder
Add vllm example

Or the model type and vllm examples can be added first if you prefer

phatvo9 · 2023-11-30T11:05:30Z

This PR is currently very large. Can it be broken down into more manageable chunks for review? E.g.:

Add multiple batches handling in one PR (maybe two if it can be logicially split)

Add new model type example text-embedder

Add new model type example multimodal-embedder

Add vllm example

Or the model type and vllm examples can be added first if you prefer

Broke down into #236 : update code for batching and update old examples and #237 added text-embedder and multimodal-embedder examples
Since vllm example is merged #217, so I added it together with old examples.

phatvo9 · 2023-12-04T04:12:07Z

Closed it since #236 and #237 merged

phatvo9 added 9 commits November 22, 2023 13:54

init

241b82e

init

53b362e

update

9de2162

Merge branch 'EAGLE-3652' into EAGLE-3698

24bf68f

update examples

4c88e5b

update doc

fd02274

update clarifai version

db8c11b

enable infer param description from_kwargs

d329119

update infer param doc

1612822

phatvo9 requested review from deigen, ackizilkale and HarmitMinhas96 November 28, 2023 07:35

HarmitMinhas96 reviewed Nov 29, 2023

View reviewed changes

This was referenced Nov 30, 2023

[EAGLE-3698]-split-handle batch #236

Merged

[EAGLE-3698]-split- add new model type example #237

Merged

Merge branch 'master' into EAGLE-3698

9a7f0e0

phatvo9 closed this Dec 4, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[EAGLE-3698] - model upload handles multiple batch #227

[EAGLE-3698] - model upload handles multiple batch #227

phatvo9 commented Nov 28, 2023

HarmitMinhas96 left a comment •

edited

Loading

phatvo9 commented Nov 30, 2023

phatvo9 commented Dec 4, 2023

[EAGLE-3698] - model upload handles multiple batch #227

[EAGLE-3698] - model upload handles multiple batch #227

Conversation

phatvo9 commented Nov 28, 2023

Why

How

Other updates:

Note:

HarmitMinhas96 left a comment • edited Loading

Choose a reason for hiding this comment

phatvo9 commented Nov 30, 2023

phatvo9 commented Dec 4, 2023

HarmitMinhas96 left a comment •

edited

Loading