[Target] Replace utility functions with target.features #12455

Mousius · 2022-08-16T10:07:34Z

Following on from #12454 this patch removes the utility functions in favour of the centralised target.features property.

Following on from apache#12454 this patch removes the utility functions in favour of the centralised `target.features` property.

This removes many references to `is_aarch64` in favour of `is_asimd` for which it was often a proxy. Also relaxed a Compute Library test as the schedules are now different with proper arch detection

tvm-bot · 2022-10-20T16:33:12Z

Thanks for contributing to TVM! Please refer to the contributing guidelines https://tvm.apache.org/docs/contribute/ for useful information and tips. Please request code reviews from Reviewers by @-ing them in a comment.

Built docs for commit 4561d40 can be found here.

_{Generated by tvm-bot}

lhutton1

Looks good, apart from one small thing!

python/tvm/relay/op/strategy/arm_cpu.py

tests/python/contrib/test_arm_compute_lib/test_network.py

Mousius · 2022-10-21T11:41:44Z

@lhutton1 fixed the typo, PTAL 😸

lhutton1

LGTM!

lhutton1 · 2022-10-24T11:56:18Z

Thanks @Mousius!

Following on from apache#12454 this patch removes the utility functions in favour of the centralised `target.features` property.

32-bit targets apache#12455 slightly altered the behaviour when selecting an int8 conv2d schedule. Previously conditions that decide which schedule to select used `is_aarch64` which checks for the existance of `aarch64` in the target triple. However, the conditions now use `has_asimd` which is true if `aarch64` exists in the target triple OR `+neon` is used in the mattr. Both `conv2d_NHWC_quantized_interleaved.arm_cpu` and `depthwise_conv2d_nhwc.arm_cpu` makes calls to LLVM intrinsics that require both `aarch64` and `+neon`. But in the case of the target `rasp4b`, the updated conditions result in compilation failure since the target has `+neon` but doesn't have `aarch64` in the target triple. The conditions have been updated to fix the compilation failure. Likewise, the previous behaviour of the condition for `conv2d_nhwc_spatial_pack.arm_cpu` has been restored ensure a program with a 32-bit target can still be compiled. Finally, we should only select the `depthwise_conv2d_nhwc_dsp.arm_cpu` schedule when a backend that understands `pragma_import_c` has been selected, i.e. "c". For a more detailed discussion of the issue please see: https://discuss.tvm.apache.org/t/tflite-llvm-llvm-error-when-compiling-tflite-model/15411 Change-Id: Idcf541ecdb7fee7d392bfbe5bd1f7cb478408938

…-bit targets (#15468) [Relay][Strategy] Fix `arm_cpu` int8 conv2d schedule selection for 32-bit targets #12455 slightly altered the behaviour when selecting an int8 conv2d schedule. Previously conditions that decide which schedule to select used `is_aarch64` which checks for the existance of `aarch64` in the target triple. However, the conditions now use `has_asimd` which is true if `aarch64` exists in the target triple OR `+neon` is used in the mattr. Both `conv2d_NHWC_quantized_interleaved.arm_cpu` and `depthwise_conv2d_nhwc.arm_cpu` makes calls to LLVM intrinsics that require both `aarch64` and `+neon`. But in the case of the target `rasp4b`, the updated conditions result in compilation failure since the target has `+neon` but doesn't have `aarch64` in the target triple. The conditions have been updated to fix the compilation failure. Likewise, the previous behaviour of the condition for `conv2d_nhwc_spatial_pack.arm_cpu` has been restored ensure a program with a 32-bit target can still be compiled. Finally, we should only select the `depthwise_conv2d_nhwc_dsp.arm_cpu` schedule when a backend that understands `pragma_import_c` has been selected, i.e. "c". For a more detailed discussion of the issue please see: https://discuss.tvm.apache.org/t/tflite-llvm-llvm-error-when-compiling-tflite-model/15411

Mousius force-pushed the target-parser-aprofile-rollout branch 3 times, most recently from 341389d to 523f186 Compare August 16, 2022 16:19

Mousius force-pushed the target-parser-aprofile-rollout branch from 523f186 to dcc1ddd Compare August 25, 2022 09:46

Mousius force-pushed the target-parser-aprofile-rollout branch from dcc1ddd to df0ad05 Compare October 13, 2022 16:58

Mousius mentioned this pull request Oct 13, 2022

[TOPI] Re-organise Arm(R) Cortex(R) CPU schedules to reflect CPU and Features #13064

Closed

areusch added needs-triage PRs or issues that need to be investigated by maintainers to find the right assignees to address it and removed needs-triage PRs or issues that need to be investigated by maintainers to find the right assignees to address it labels Oct 19, 2022

[Target] Replace utility functions with target.features

b3dfa48

Following on from apache#12454 this patch removes the utility functions in favour of the centralised `target.features` property.

Mousius force-pushed the target-parser-aprofile-rollout branch from df0ad05 to b3dfa48 Compare October 20, 2022 09:30

Mousius added 2 commits October 20, 2022 11:11

Fix usages of aarch64 in targets and tests

0ffa19b

This removes many references to `is_aarch64` in favour of `is_asimd` for which it was often a proxy. Also relaxed a Compute Library test as the schedules are now different with proper arch detection

More comprehensive testing of is_int8_hw_support

960a519

Mousius force-pushed the target-parser-aprofile-rollout branch from 6b7629b to 960a519 Compare October 20, 2022 11:28

Mousius marked this pull request as ready for review October 20, 2022 13:50

Mousius force-pushed the target-parser-aprofile-rollout branch from 678f54c to 6df4400 Compare October 20, 2022 13:54

lhutton1 reviewed Oct 21, 2022

View reviewed changes

python/tvm/relay/op/strategy/arm_cpu.py Outdated Show resolved Hide resolved

lhutton1 reviewed Oct 21, 2022

View reviewed changes

tests/python/contrib/test_arm_compute_lib/test_network.py Outdated Show resolved Hide resolved

Fix testing targets

2870b0d

Mousius force-pushed the target-parser-aprofile-rollout branch from 6df4400 to 2870b0d Compare October 21, 2022 11:41

Less aggressive accuracy reduction for Compute Library test

4561d40

Mousius force-pushed the target-parser-aprofile-rollout branch from ce573d3 to 4561d40 Compare October 24, 2022 09:01

lhutton1 approved these changes Oct 24, 2022

View reviewed changes

lhutton1 merged commit 3131cdc into apache:main Oct 24, 2022

Mousius deleted the target-parser-aprofile-rollout branch October 24, 2022 12:32

leandron mentioned this pull request Feb 1, 2023

TVM v0.11.0 Release Candidate Notes #13899

Closed

lhutton1 mentioned this pull request Aug 3, 2023

[Relay][Strategy] Fix arm_cpu int8 conv2d schedule selection for 32-bit targets #15468

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Target] Replace utility functions with target.features #12455

[Target] Replace utility functions with target.features #12455

Uh oh!

Mousius commented Aug 16, 2022

Uh oh!

tvm-bot commented Oct 20, 2022 •

edited

Loading

Uh oh!

lhutton1 left a comment

Uh oh!

Uh oh!

Uh oh!

Mousius commented Oct 21, 2022

Uh oh!

lhutton1 left a comment

Uh oh!

lhutton1 commented Oct 24, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[Target] Replace utility functions with target.features #12455

[Target] Replace utility functions with target.features #12455

Uh oh!

Conversation

Mousius commented Aug 16, 2022

Uh oh!

tvm-bot commented Oct 20, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lhutton1 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Mousius commented Oct 21, 2022

Uh oh!

lhutton1 left a comment

Choose a reason for hiding this comment

Uh oh!

lhutton1 commented Oct 24, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

tvm-bot commented Oct 20, 2022 •

edited

Loading