RISC V vector predication support intrinsics support #7119

zvookin · 2022-10-25T05:28:56Z

Have RISC V use LLVM vector predication intrinsics and add support for some architecture specific intrinsics.

useful for RISC V, but it may be a simpler, better optimized path, for Halide vector operations in general. Add support for a maximum vector size that might be larger than the native vector size. RISC V vector LMUL support is an example of an architecture supporting this.

promotion contexts.

vector predication intrinsics.

improving the calling convention and naming of the new routines to generate the intrinsics.

caveperson programmer habits die hard. Improve comments.

concatenation into one line.

Change TODO(zalman) to TODO(zvookin) uniformly. Few other cleanups.

strided load for dense case. Add some comments.

src/CodeGen_RISCV.cpp

steven-johnson

Looks good, mostly nits

src/CodeGen_RISCV.cpp

rootjalex · 2022-10-25T20:26:35Z

@steven-johnson Do the buildbots test RISC-V cross-compilation?

If so, this PR should probably make an addition to simd_op_check, to check for the added vector instructions in this PR.

steven-johnson · 2022-10-25T20:29:38Z

@steven-johnson Do the buildbots test RISC-V cross-compilation?

If so, this PR should probably make an addition to simd_op_check, to check for the added vector instructions in this PR.

Not currently, no. We should definitely add that.

zvookin · 2022-10-25T21:20:23Z

Added an issue for simd_op_check coverage. #7122

zvookin · 2022-10-25T21:33:58Z

Yeah, I'd rant that not having the rounding mode flags in the intrinsic at the start is badly broken, but it's sort of how things are with LLVM and RISC V. Really they ought to be in the opcode, but you know, bits are expensive. I put in references to a couple PRs for LLVM but it doesn't look like there is recent movement on this.

…

On Tue, Oct 25, 2022 at 2:09 PM Andrew Adams ***@***.***> wrote: ***@***.**** commented on this pull request. ------------------------------ In src/CodeGen_RISCV.cpp <#7119 (comment)>: > + llvm::Function *wrapper = + llvm::Function::Create(wrapper_ty, llvm::GlobalValue::InternalLinkage, + wrapper_name, module.get()); + llvm::BasicBlock *block = + llvm::BasicBlock::Create(module->getContext(), "entry", wrapper); + llvm::IRBuilderBase::InsertPoint here = builder->saveIP(); + builder->SetInsertPoint(block); + + // Set vector fixed-point rounding flag if needed for intrinsic. + bool round_down = intrin.flags & RISCVIntrinsic::RoundDown; + bool round_up = intrin.flags & RISCVIntrinsic::RoundUp; + if (round_down || round_up) { + internal_assert(!(round_down && round_up)); + llvm::Value *rounding_mode = llvm::ConstantInt::get(xlen_type, round_down ? 2 : 0); + // TODO: When LLVM finally fixes the instructions to take rounding modes, + // this will have to change to passing the rounding mode to the intrinsic. Is there an existing LLVM issue for it? Having to use inline assembly to use any of the ops that rely on a rounding flag seems ... unfinished. — Reply to this email directly, view it on GitHub <#7119 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AALOUFEADYCC7HI3PIWVT6LWFBEARANCNFSM6AAAAAARNTLZIE> . You are receiving this because you authored the thread.Message ID: ***@***.***>

std::variant for mask was not handled correctly.

arg, or result type if no vector argument.

intrinsic. (Needed for loads.) Fix result type of vector predication reductions.

zvookin · 2022-10-26T06:21:48Z

Had to do a number of bug fixes on the vector predication work as I messed up on the test coverage previously. It works better with the code in this branch...

zvookin · 2022-10-26T06:35:44Z

I believe I have addressed the feedback raised. In a couple cases, by opening tracking issues, but mostly by incorporating the changes suggested or fixing the problems raised. If I've missed anything please let me know. I also did some better testing. Obviously that should get rolled into the real tests, which is effectively the simd_op_check issue. This should be ready to land.

steven-johnson

LGTM pending green

rootjalex

Also LGTM with a nit

widths in RISC V intrinsics.

zvookin · 2022-10-26T17:09:58Z

I improved naming and comments around the handling of all the integer type widths.

Vector efficiency improvement oer code review feedback.

Turn on vector predication support for RISC V. (First architecture to use this code. Bug fixes included here.) Add architecture specific vector intrinsics support as well. Should not affect anything outside of RISC V.

Z Stern added 30 commits September 16, 2022 19:27

Add vector predicated store support.

541833c

Merge branch 'main' into llvm_vector_predication_intrinsics

282b6fd

Change how void type is handled with call_intrin, other vector

cab2f01

promotion contexts.

Merge branch 'main' into llvm_vector_predication_intrinsics

5f2972a

Merge branch 'main' into llvm_vector_predication_intrinsics

4a689e0

Merge branch 'main' into llvm_vector_predication_intrinsics

72412ac

Fix a few issues with types, order of arguments and name mangling in

a6a0ba9

vector predication intrinsics.

Add support for using @llvm.vp.reduce.* intrinsics in vector reductions.

ab2a68f

Merge branch 'llvm_vector_predication_intrinsics' into riscv_update

0abb16d

Put RISC V intrinsics support back in.

3627ef4

Merge branch 'main' into llvm_vector_predication_intrinsics

8fd2aeb

Small refactor to clean up vector predication support. Mainly

af55a2e

improving the calling convention and naming of the new routines to generate the intrinsics.

Typo slipped in.

eaa4100

Merge branch 'main' into llvm_vector_predication_intrinsics

d9b52c7

Merge branch 'llvm_vector_predication_intrinsics' into riscv_update

f3f67cb

This time for sure.

7a8201c

Merge branch 'llvm_vector_predication_intrinsics' into riscv_update

fca802f

Formatting.

c0a9679

Merge branch 'llvm_vector_predication_intrinsics' into riscv_update

97b6d82

Formatting.

100a5c1

More formatting.

db0ea7c

Merge branch 'llvm_vector_predication_intrinsics' into riscv_update

bfc72ee

Use std::optional instead of -1 bottom value for mangle_index. Simple

96dcd93

caveperson programmer habits die hard. Improve comments.

Switch to using instead of typedef per review feedback.

51f3e35

Address review feedback re: default arguments, moving string

cb0cbbc

concatenation into one line.

Add GitHub issue for fmax/fmin strict_float TODO.

740f121

Change TODO(zalman) to TODO(zvookin) uniformly. Few other cleanups.

Rearrage the maze of twisty passages to not use vector predicated

2c0df5f

strided load for dense case. Add some comments.

Merge branch 'llvm_vector_predication_intrinsics' into riscv_update

1a60098

Merge branch 'main' into riscv_update

d2d848f

rootjalex reviewed Oct 25, 2022

View reviewed changes

src/CodeGen_RISCV.cpp Outdated Show resolved Hide resolved

steven-johnson reviewed Oct 25, 2022

View reviewed changes

rootjalex reviewed Oct 25, 2022

View reviewed changes

src/CodeGen_RISCV.cpp Show resolved Hide resolved

rootjalex reviewed Oct 25, 2022

View reviewed changes

src/CodeGen_RISCV.cpp Show resolved Hide resolved

Address some review feedback.

259c33b

Z Stern added 2 commits October 25, 2022 21:07

More review feedback.

99f7e00

Add comment pointing to architecture spec on fixed-point rounding mode.

b5a4050

Add issue reference for rounding mode tracking.

f8117ee

Z Stern added 6 commits October 26, 2022 04:04

Fix crash causing typo in vector predication PR.

98c8dd9

Fix another issue with vector predication PR where switching to a

f6e7088

std::variant for mask was not handled correctly.

Another typo in switching from args to vp_args.

43ac99a

When synthesizing an all ones mask, take mask type from first vector

f8f80b6

arg, or result type if no vector argument.

Add support for mangling in the result type of a vector predication

6404db0

intrinsic. (Needed for loads.) Fix result type of vector predication reductions.

Add support for signed by unsigned widening multiply (vwmulsu).

8691848

Formatting.

e35b82c

steven-johnson self-requested a review October 26, 2022 16:21

steven-johnson approved these changes Oct 26, 2022

View reviewed changes

rootjalex approved these changes Oct 26, 2022

View reviewed changes

Z Stern added 2 commits October 26, 2022 17:06

Improve naming and comments on patterns for handling all integer type

069000f

widths in RISC V intrinsics.

Formatting.

3f5f100

Comment fix.

1e8e2aa

Vector efficiency improvement oer code review feedback.

zvookin merged commit da87cb2 into main Oct 26, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RISC V vector predication support intrinsics support #7119

RISC V vector predication support intrinsics support #7119

zvookin commented Oct 25, 2022 •

edited

Loading

steven-johnson left a comment

rootjalex commented Oct 25, 2022

steven-johnson commented Oct 25, 2022

zvookin commented Oct 25, 2022

zvookin commented Oct 25, 2022 via email •

edited

Loading

zvookin commented Oct 26, 2022

zvookin commented Oct 26, 2022

steven-johnson left a comment

rootjalex left a comment

zvookin commented Oct 26, 2022

RISC V vector predication support intrinsics support #7119

RISC V vector predication support intrinsics support #7119

Conversation

zvookin commented Oct 25, 2022 • edited Loading

steven-johnson left a comment

Choose a reason for hiding this comment

rootjalex commented Oct 25, 2022

steven-johnson commented Oct 25, 2022

zvookin commented Oct 25, 2022

zvookin commented Oct 25, 2022 via email • edited Loading

zvookin commented Oct 26, 2022

zvookin commented Oct 26, 2022

steven-johnson left a comment

Choose a reason for hiding this comment

rootjalex left a comment

Choose a reason for hiding this comment

zvookin commented Oct 26, 2022

zvookin commented Oct 25, 2022 •

edited

Loading

zvookin commented Oct 25, 2022 via email •

edited

Loading