static support mp_layers #33700

wangxicoding · 2021-06-21T08:37:37Z

PR types

New features

PR changes

APIs

Describe

c_softmax_with_cross_entropy API support static graph.
mp_layers接口支持静态图，mp_layers实现动静统一。

import paddle
import paddle.distributed.fleet as fleet

paddle.enable_static()

class ColumnLinearNet(paddle.nn.Layer):
    def __init__(self, input_size, output_size):
        super(ColumnLinearNet, self).__init__()
        self.parallel_linear = fleet.meta_parallel.ColumnParallelLinear(
            in_features=input_size,
            out_features=output_size,
            weight_attr=None,
            has_bias=True,
            gather_output=True,
            name="test_column_linear")

    def forward(self, x):
        output = self.parallel_linear(x)
        return output

strategy = fleet.DistributedStrategy()
strategy.sharding = True
strategy.sharding_configs = {
    "mp_degree": 2,
    "sharding_degree": 2,
}
fleet.init(is_collective=True, strategy=strategy)

input_size, output_size = 28, 64
model_a = ColumnLinearNet(input_size, output_size)

x = paddle.static.data(name='x', shape=[None, input_size])
y = model_a(x)

python -m paddle.distributed.launch --gpus 0,1,2,3 test.py

paddle-bot-old · 2021-06-21T08:37:41Z

Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

kuizhiqing

lgtm for new group part

JZ-LIANG

LGTM

JZ-LIANG · 2021-06-25T06:23:41Z

python/paddle/distributed/fleet/base/fleet_base.py

+            # global group
+            global_rank = self.worker_index()
+            global_world_size = self.worker_num()
+            # NOTE(wangxi): see sharding_optimizer


or we could explain the parallel2id mapping here again?

pre-assign ring ids mp: 0 sharding: 1 pure-dp: 2 global: 3 pp: >= 20 if one parallelism is not enable: -1 and only support parallelism hierarchy: mp --> sharding --> pp --> dp

done

ForFishes

LGTM

wangxicoding changed the title ~~static support c_softmax_with_cross_entropy~~ static support mp_layers Jun 22, 2021

wangxicoding added 5 commits June 22, 2021 07:37

static support c_softmax_with_cross_entropy

0881739

unified static graph tensor parallel layer

4fce47f

refine test

af76ce2

fix ci

3eb3014

fix group get_group_rank

018ab9e

wangxicoding force-pushed the static_c_cross_entorpy branch from 9bbc2bb to 018ab9e Compare June 22, 2021 07:37

wangxicoding requested review from ForFishes, kuizhiqing, sandyhouse, JZ-LIANG and gongweibao June 22, 2021 11:35

kuizhiqing approved these changes Jun 24, 2021

View reviewed changes

JZ-LIANG approved these changes Jun 25, 2021

View reviewed changes

ForFishes approved these changes Jun 25, 2021

View reviewed changes

wangxicoding merged commit 91a0acd into PaddlePaddle:develop Jun 25, 2021

wangxicoding deleted the static_c_cross_entorpy branch June 25, 2021 08:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

static support mp_layers #33700

static support mp_layers #33700

wangxicoding commented Jun 21, 2021 •

edited

Loading

paddle-bot-old bot commented Jun 21, 2021

kuizhiqing left a comment

JZ-LIANG left a comment

JZ-LIANG Jun 25, 2021

JZ-LIANG Jun 25, 2021

ForFishes left a comment

static support mp_layers #33700

static support mp_layers #33700

Conversation

wangxicoding commented Jun 21, 2021 • edited Loading

PR types

PR changes

Describe

paddle-bot-old bot commented Jun 21, 2021

kuizhiqing left a comment

Choose a reason for hiding this comment

JZ-LIANG left a comment

Choose a reason for hiding this comment

JZ-LIANG Jun 25, 2021

Choose a reason for hiding this comment

JZ-LIANG Jun 25, 2021

Choose a reason for hiding this comment

ForFishes left a comment

Choose a reason for hiding this comment

wangxicoding commented Jun 21, 2021 •

edited

Loading