[WIP] Add paddle support #206

HydrogenSulfate · 2024-11-26T04:49:45Z

Support Paddle framework in array-api-compat, but still working in progress.

TODO List:

Related issue: PaddlePaddle/Paddle#68618

rgommers · 2024-11-26T10:11:40Z

Cool, thanks for working on this @HydrogenSulfate!

I am curious to learn a bit more about Paddle. In particular conceptually what is supported - https://www.paddlepaddle.org.cn/documentation/docs/en/guides/jit/index_en.html and a few other guides tell me a bit, but not quite what I was most interested in. A few questions if you don't mind:

Is the default execution model eager or lazy/graph-based?
It looks like there is a JIT compiler, what's the syntax and does it work similar to, for example, jax.jit or torch.compile?
Is item and slice assignment supported via __setitem__? And indexing with boolean mask?
Is mixed integer and floating-point type promotion supported?
I see it has CPU and NVIDIA GPU support, plus some other vendors of accelerators that I don't immediately recognize. Are those all GPUs as well? And ROCm and Intel XPUs are not supported (now or in the near future)?

HydrogenSulfate · 2024-11-26T14:00:52Z

Cool, thanks for working on this @HydrogenSulfate!

I am curious to learn a bit more about Paddle. In particular conceptually what is supported - https://www.paddlepaddle.org.cn/documentation/docs/en/guides/jit/index_en.html and a few other guides tell me a bit, but not quite what I was most interested in. A few questions if you don't mind:

Is the default execution model eager or lazy/graph-based?

It looks like there is a JIT compiler, what's the syntax and does it work similar to, for example, jax.jit or torch.compile?

Is item and slice assignment supported via __setitem__? And indexing with boolean mask?

Is mixed integer and floating-point type promotion supported?

I see it has CPU and NVIDIA GPU support, plus some other vendors of accelerators that I don't immediately recognize. Are those all GPUs as well? And ROCm and Intel XPUs are not supported (now or in the near future)?

Thanks much for reply and attention to this PR,

Paddle use eager execution mode as default(eager Tensor running with dynamic graph), and can be manually switched to static graph(lazy Tensor running with static computational Program) by model = paddle.jit.to_static(model).
The usage of paddle.jit.to_static and torch.compile/jiax.jit is very similar. When designing these interfaces, we referred to influential and great tools such as pytorch/jax. The usage of paddle.jit.* is roughly as follows:
1. Firstly, users will use dynamic graphs for programming and training models
2. Secondly, if users need it, they can use one line of code to convert the model: model = paddle.jit.to_static(model) without any other modifications, convert the model to a static graph model, and then start training. Due to the advantages of static graph models, this usually results in a small performance improvement, and our conversion rate has been extensively tested on our existing models, with a success rate close to 100%
3. If there is a higher performance requirement, users can add the option to enable the CINN compiler in jit.to_static: modulus-sym code for exmample, which can capture the entire computation graph, including forward pass, backward pass, even double-backward pass(or higher-order), and further accelerate the program. We have tested it on 40+models in the NVIDIA/modulus-sym suite and achieved IPS performance that exceeds PyTorch by about 70% when the CINN compiler is enabled (of course, this is partly because PyTorch does not seem to support capturing and compiling high-order backward)
4. After training, we can save the computational program of model via: paddle.jit.save(model, output_path) to get a deployable model(like .pb of tensorflow).

item and slice assignment are supported with broadcasting as below

import paddle

x = paddle.randn([4, 3, 2])
v = paddle.randn([3, 2])
x[0, 1] = 3.0
print(x)

x[:] = v
print(x)

mask = paddle.to_tensor([True, False, True, False])

x[mask] = paddle.zeros([3, 2])
print(x)

Our implicit promotion support fp32/fp64, c32/c64 promotion, but do not support mixed integer and bool type(the purpose is to avoid covert transformations that are easily overlooked by users, which can lead to the model giving unexpected results), detailed table can be checked at url:
We support XPU and ROCM, I will supplement these devices type in subsequent commits

…ements

rgommers · 2024-11-27T07:31:45Z

Thanks for the detailed explanation @HydrogenSulfate, much appreciated.

5. We support XPU and ROCM, I will supplement these devices type in subsequent commits

I'll note that I tried inferring supported devices from this Install page, where ROCm/XPU aren't yet present:

HydrogenSulfate · 2024-11-27T07:54:21Z

谢谢你的详细解释@HydrogenSulfate，非常感谢。

我们支持XPU和ROCM，我会在后续的提交中补充这些设备类型

我会注意到，我尝试从此安装页面推断支持的设备，但其中 ROCm/XPU 尚不存在：

Embarrassingly, our English documents are somewhat outdated, so you can use the browser's translation feature to translate the Chinese documents into English.

ROCm is used in HYGON:

XPU is used in KUNLUNXIN:

rgommers · 2024-11-27T08:04:11Z

Embarrassingly, our English documents are somewhat outdated

Not embarrassing at all - we still haven't even deployed our Chinese translations on https://numpy.org/ (they're coming though!).

Thanks for the tips. Once this is ready, I'll try giving Paddle + SciPy a spin.

lucascolley · 2025-03-29T22:31:48Z

I converted this PR to draft since it looks like it is still a work in progress @HydrogenSulfate , but feel free to let us know whenever it is ready for another look!

Add broadcast_tensors alias, modify result_type

HydrogenSulfate added 2 commits November 26, 2024 12:44

add paddle support in array-api-compat

8e5cc94

update README

7118894

HydrogenSulfate force-pushed the support_paddle branch from 14e0927 to 7118894 Compare November 26, 2024 05:06

HydrogenSulfate added 5 commits November 26, 2024 14:05

update promotion table and can_cast table

85dc3ba

update doc

c5b82db

restore code

7b99449

update docstring

bb40851

refine more code

a7163f9

add suffix for test_python_scalars and add paddle index-url in rqeuir…

ec46178

…ements

rgommers added the enhancement New feature or request label Nov 27, 2024

HydrogenSulfate mentioned this pull request Nov 29, 2024

[Ehance & Fix] Support any slice interval for indexing(__getitem__) in eager/static mode PaddlePaddle/Paddle#69827

Merged

HydrogenSulfate added 7 commits December 3, 2024 14:53

update paddle code

dfd4485

fix

5ae8ec8

update code

b10273b

fix moveaxis

8d2425e

fix default floating dtype of paddle.assaray

7b8555e

use default_dtype only when dtype is None

603c852

add floor and ceil with same return dtype

742792f

HydrogenSulfate force-pushed the support_paddle branch from 4527c3a to 742792f Compare January 9, 2025 08:10

update code

fd6eea0

lucascolley marked this pull request as draft March 29, 2025 22:31

cangtianhuang and others added 3 commits April 1, 2025 15:05

Add broadcast_tensors alias, modify result_type

37785d4

refine

0651731

Merge pull request #1 from cangtianhuang/support_paddle

2d4e571

Add broadcast_tensors alias, modify result_type

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Add paddle support #206

[WIP] Add paddle support #206

HydrogenSulfate commented Nov 26, 2024 •

edited

Loading

rgommers commented Nov 26, 2024

HydrogenSulfate commented Nov 26, 2024 •

edited

Loading

rgommers commented Nov 27, 2024

HydrogenSulfate commented Nov 27, 2024

rgommers commented Nov 27, 2024

lucascolley commented Mar 29, 2025

[WIP] Add paddle support #206

Are you sure you want to change the base?

[WIP] Add paddle support #206

Conversation

HydrogenSulfate commented Nov 26, 2024 • edited Loading

rgommers commented Nov 26, 2024

HydrogenSulfate commented Nov 26, 2024 • edited Loading

rgommers commented Nov 27, 2024

HydrogenSulfate commented Nov 27, 2024

rgommers commented Nov 27, 2024

lucascolley commented Mar 29, 2025

HydrogenSulfate commented Nov 26, 2024 •

edited

Loading

HydrogenSulfate commented Nov 26, 2024 •

edited

Loading