[HETERO] Support splitting new graph pattern for pipeline parallel and correct the number of submodels #25224

WeldonWangwang · 2024-06-26T08:54:31Z

Details:

Fix qwen1.5-14b-chat with HETERO pipeline parallelism
Add supported to patten:

ReadValue->Gather->Concat
             |------>ShapeOf(fused on other different affinity node) ->....

Correct the value of HETERO_NUMBER_OF_SUBMODELS by subtracting the number of independent submodels to reduce confusion

Tickets:

ticket-id

songbell · 2024-07-05T00:56:08Z

src/inference/src/dev/iplugin.cpp

+                    bool is_shapeof = ov::is_type<op::util::ShapeOfBase>(op);
+                    if (((fused_model_op_map.find(name) != fused_model_op_map.end()) || is_shapeof) &&
+                        supported.count(name)) {
+                        if ((!supported.count(fused_model_op_map[name]) || is_shapeof) &&


seems this is a special case(shape of) of codes line 401 to 408?
maybe we can consider optimize in future?

It is a problem that occurs when recursively looping through the entire graph, move shape of case to L401-L408 can not fix this issues, we will try the simpler way to optimize this API

Please create a ticket to follow up if there is a way to make this solution more common.

…d correct the number of submodels (openvinotoolkit#25224) ### Details: - Fix qwen1.5-14b-chat with HETERO pipeline parallelism Add supported to patten: ``` ReadValue->Gather->Concat |------>ShapeOf(fused on other different affinity node) ->.... ``` - Correct the value of HETERO_NUMBER_OF_SUBMODELS by subtracting the number of independent submodels to reduce confusion ### Tickets: - *ticket-id*

github-actions bot added category: inference OpenVINO Runtime library - Inference category: HETERO OpenVINO HETERO plugin labels Jun 26, 2024

Fix qwen1.5-14b-chat

5c0d8ec

WeldonWangwang force-pushed the wangwang/Fix_qwen1.5-14b-chat branch from aa1e1c0 to 09c2109 Compare June 28, 2024 02:44

WeldonWangwang added 2 commits June 28, 2024 18:40

Cleanup code

09c2109

Add test case

7d1b5bc

peterchen-intel added this to the 2024.3 milestone Jul 1, 2024

peterchen-intel requested review from songbell and wangleis July 1, 2024 02:09

WeldonWangwang force-pushed the wangwang/Fix_qwen1.5-14b-chat branch from 67c96a4 to 7d1b5bc Compare July 1, 2024 03:25

WeldonWangwang marked this pull request as ready for review July 1, 2024 03:26

WeldonWangwang requested review from a team as code owners July 1, 2024 03:26

Merge branch 'master' into wangwang/Fix_qwen1.5-14b-chat

d1f4be4

WeldonWangwang changed the title ~~Fix qwen1.5-14b-chat~~ [HETERO] Fix qwen1.5-14b-chat with pipeline parallel and the number of submodels Jul 1, 2024

Fix code style

df89215

peterchen-intel changed the title ~~[HETERO] Fix qwen1.5-14b-chat with pipeline parallel and the number of submodels~~ [HETERO] Support splitting new graph pattern for pipeline parallel and correct the number of submodels Jul 3, 2024

peterchen-intel assigned wangleis Jul 3, 2024

Merge branch 'master' into wangwang/Fix_qwen1.5-14b-chat

fbf3d0d

peterchen-intel requested a review from riverlijunjie July 4, 2024 02:15

songbell reviewed Jul 5, 2024

View reviewed changes

songbell approved these changes Jul 5, 2024

View reviewed changes

peterchen-intel enabled auto-merge July 5, 2024 02:19

peterchen-intel added this pull request to the merge queue Jul 5, 2024

Merged via the queue into openvinotoolkit:master with commit a5c0d67 Jul 5, 2024
122 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[HETERO] Support splitting new graph pattern for pipeline parallel and correct the number of submodels #25224

[HETERO] Support splitting new graph pattern for pipeline parallel and correct the number of submodels #25224

WeldonWangwang commented Jun 26, 2024 •

edited

Loading

songbell Jul 5, 2024

WeldonWangwang Jul 5, 2024

peterchen-intel Jul 5, 2024

[HETERO] Support splitting new graph pattern for pipeline parallel and correct the number of submodels #25224

[HETERO] Support splitting new graph pattern for pipeline parallel and correct the number of submodels #25224

Conversation

WeldonWangwang commented Jun 26, 2024 • edited Loading

Details:

Tickets:

songbell Jul 5, 2024

Choose a reason for hiding this comment

WeldonWangwang Jul 5, 2024

Choose a reason for hiding this comment

peterchen-intel Jul 5, 2024

Choose a reason for hiding this comment

WeldonWangwang commented Jun 26, 2024 •

edited

Loading