Implement AVX512_FP16 #1605

sayantn · 2024-07-02T11:21:52Z

This PR adds the AVX512_FP16 intrinsics in Rust. These intrinsics will be behind the feature gate #[feature(stdarch_x86_avx512_f16)] (rust-lang/rust#127213).

Progress:

This also adds some missing inlining in avx512ifma and updates the x86-intel.xml file to v3.6.9

The set1_pch intrinsics were not implemented due to a lack of complex number type.
cmpph and fpclassph intrinsics use inline asm because of no i1 support yet.

rustbot · 2024-07-02T11:21:56Z

r? @Amanieu

rustbot has assigned @Amanieu.
They will have a look at your PR within the next two weeks and either review your PR or reassign to another reviewer.

Use r? to explicitly pick a reviewer

tgross35 · 2024-07-04T20:05:51Z

Have you run into any weird behavior with these, or do things seem to be working smoothly? (ignoring the ABI issue for system function calls, that is)

sayantn · 2024-07-04T20:08:58Z

No problems yet, just that simd_fabs doesn't accept a f16 argument, so i will just use an and operation. I am actively avoiding doing f16 operations in rust, but that's not a blocker for sure.

bors · 2024-07-06T09:02:25Z

☔ The latest upstream changes (presumably 3dd9579) made this pull request unmergeable. Please resolve the merge conflicts.

sayantn · 2024-07-17T13:22:00Z

cc @tgross35 @beetrees

crates/core_arch/src/simd.rs

crates/core_arch/src/x86/test.rs

Add-Sub-Mul-Div, Load-Store-Move, `comi`, `set`

Reciprocal, RSqrt, Sqrt, Max, Min

`getexp`, `getmant`, `roundscale`, `scalef`, `reduce`

`cmpph`, `fpclass`, reduce, `blend`, `permutex`

Add `#[inline]` to avx512ifma intrinsics Fix the test equality. Remove the stability attributes in simd types and test functions

rustbot assigned Amanieu Jul 2, 2024

sayantn force-pushed the fp16 branch from 2f2dac7 to d5e5ea3 Compare July 3, 2024 18:46

sayantn force-pushed the fp16 branch 4 times, most recently from 91e0971 to 403897c Compare July 12, 2024 07:11

tgross35 mentioned this pull request Jul 12, 2024

Add f16 and f128 as simd types in LLVM rust-lang/rust#127487

Merged

sayantn force-pushed the fp16 branch 3 times, most recently from c9588c5 to e907eba Compare July 15, 2024 17:17

sayantn marked this pull request as ready for review July 17, 2024 13:18

sayantn mentioned this pull request Jul 17, 2024

Tracking Issue for AVX512_FP16 intrinsics rust-lang/rust#127213

Open

2 tasks

tgross35 mentioned this pull request Jul 18, 2024

Tracking Issue for f16 and f128 float types rust-lang/rust#116909

Open

84 tasks

Amanieu reviewed Jul 25, 2024

View reviewed changes

crates/core_arch/src/simd.rs Outdated Show resolved Hide resolved

crates/core_arch/src/x86/test.rs Outdated Show resolved Hide resolved

sayantn added 11 commits July 26, 2024 08:55

AVX512FP16 Part 0: Types

ac370a7

AVX512FP16 Part 1

1b093be

Add-Sub-Mul-Div, Load-Store-Move, `comi`, `set`

AVX512_FP16 Part 2: Complex Multiplication

bf92f83

AVX512FP16 Part 3: FMA

0bec23b

AVX512FP16 Part 4: Math functions

e6a5910

Reciprocal, RSqrt, Sqrt, Max, Min

AVX512FP16 Part 5: FP-Support

4872108

`getexp`, `getmant`, `roundscale`, `scalef`, `reduce`

AVX512FP16 Part 6: Remaining

d304918

`cmpph`, `fpclass`, reduce, `blend`, `permutex`

AVX512FP16 Part 7: Convert to f16

2ae57f0

AVX512FP16 Part 8: Convert from f16

57641cc

AVX512FP16 Part 9: Remaining avx512fp16 and avxneconvert

cf01aba

Update Intrinsics List to v3.6.9

8a5e971

Add `#[inline]` to avx512ifma intrinsics Fix the test equality. Remove the stability attributes in simd types and test functions

sayantn force-pushed the fp16 branch from 81512ca to 8a5e971 Compare July 26, 2024 03:26

Amanieu merged commit fb90dfa into rust-lang:master Jul 26, 2024
30 checks passed

sayantn deleted the fp16 branch July 27, 2024 11:06

tgross35 mentioned this pull request Aug 20, 2024

Add SIMD operations that use f16 and f128 rust-lang/rust#125440

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement AVX512_FP16 #1605

Implement AVX512_FP16 #1605

sayantn commented Jul 2, 2024 •

edited

Loading

rustbot commented Jul 2, 2024

tgross35 commented Jul 4, 2024

sayantn commented Jul 4, 2024

bors commented Jul 6, 2024

sayantn commented Jul 17, 2024

Implement AVX512_FP16 #1605

Implement AVX512_FP16 #1605

Conversation

sayantn commented Jul 2, 2024 • edited Loading

Progress:

rustbot commented Jul 2, 2024

tgross35 commented Jul 4, 2024

sayantn commented Jul 4, 2024

bors commented Jul 6, 2024

sayantn commented Jul 17, 2024

sayantn commented Jul 2, 2024 •

edited

Loading