-
Notifications
You must be signed in to change notification settings - Fork 273
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Implement AVX512_FP16 #1605
Implement AVX512_FP16 #1605
Conversation
Have you run into any weird behavior with these, or do things seem to be working smoothly? (ignoring the ABI issue for system function calls, that is) |
No problems yet, just that |
☔ The latest upstream changes (presumably 3dd9579) made this pull request unmergeable. Please resolve the merge conflicts. |
91e0971
to
403897c
Compare
c9588c5
to
e907eba
Compare
Add-Sub-Mul-Div, Load-Store-Move, `comi`, `set`
Reciprocal, RSqrt, Sqrt, Max, Min
`getexp`, `getmant`, `roundscale`, `scalef`, `reduce`
`cmpph`, `fpclass`, reduce, `blend`, `permutex`
Add `#[inline]` to avx512ifma intrinsics Fix the test equality. Remove the stability attributes in simd types and test functions
This PR adds the AVX512_FP16 intrinsics in Rust. These intrinsics will be behind the feature gate
#[feature(stdarch_x86_avx512_f16)]
(rust-lang/rust#127213).Progress:
getexp
,getmant
,scale
)This also adds some missing inlining in
avx512ifma
and updates thex86-intel.xml
file to v3.6.9The
set1_pch
intrinsics were not implemented due to a lack of complex number type.cmpph
andfpclassph
intrinsics use inline asm because of noi1
support yet.