Fix vectorized `min/max/minmax_element` for 64-bit types on x86 #2821

StephanTLavavej · 2022-06-24T05:37:45Z

This fixes a regression caused by #2447 that was specific to 64-bit types on x86, reported by internal VSO-1558536 "[RWC][prod/fe][Regression][x86] LLVM one test 'Profile/c-counter-overflows.c' failed".

Thanks to @AlexGuteniev for finding the root cause and providing the fix. The problem is that _mm_extract_epi32 and _mm_cvtsi128_si32 both return int. By directly casting to _Unsigned_t, i.e. uint64_t in _Minmax_traits_8, high-bit values were being sign-extended, but the desired behavior was zero-extension. Adding initial casts of static_cast<uint32_t> fixes this.

Then, I'm enhancing the test coverage:

Adding a few specific test cases that failed.
Truly randomizing the test, so that every run provides unique coverage.
Using wider distributions of values for the min/max testing, since the limited [1, 20] range concealed this bug. Now, integers use the full range, while floating-point values use a large range centered around 0. (Unfortunately, uniform_real_distribution requires b - a <= max, so we can't generate a full range without extra work. As floating-point is not yet vectorized, I felt that this was sufficient for now.)

Fortunately, this regression has not yet shipped in a Preview.

Co-authored-by: Alex Guteniev <[email protected]>

tests/std/tests/VSO_0000000_vector_algorithms/test.cpp

miscco

I want to reaffirm my opinion that @AlexGuteniev is a wizard

tests/std/tests/VSO_0000000_vector_algorithms/test.cpp

StephanTLavavej · 2022-06-25T00:06:18Z

I'm mirroring this to the MSVC-internal repo - please notify me if any further changes are pushed.

…osoft#2821) Co-authored-by: Alex Guteniev <[email protected]>

StephanTLavavej and others added 3 commits June 23, 2022 17:50

Fix 64-bit casts on x86.

e2743e5

Co-authored-by: Alex Guteniev <[email protected]>

Add specific test cases.

80a4344

Use randomized, wide-range testing.

d0cfc91

StephanTLavavej added bug Something isn't working high priority Important! labels Jun 24, 2022

StephanTLavavej requested a review from a team as a code owner June 24, 2022 05:37

CaseyCarter approved these changes Jun 24, 2022

View reviewed changes

tests/std/tests/VSO_0000000_vector_algorithms/test.cpp Show resolved Hide resolved

AlexGuteniev approved these changes Jun 24, 2022

View reviewed changes

miscco approved these changes Jun 24, 2022

View reviewed changes

statementreply reviewed Jun 24, 2022

View reviewed changes

tests/std/tests/VSO_0000000_vector_algorithms/test.cpp Outdated Show resolved Hide resolved

StephanTLavavej added 2 commits June 24, 2022 17:00

Add static_assert.

d824b0d

Floating-point? That doesn't look like anything to me.

351e37c

StephanTLavavej self-assigned this Jun 25, 2022

CaseyCarter approved these changes Jun 25, 2022

View reviewed changes

AraHaan approved these changes Jun 25, 2022

View reviewed changes

StephanTLavavej merged commit 62276be into microsoft:main Jun 25, 2022

StephanTLavavej deleted the fix-max-element branch June 25, 2022 23:00

fsb4000 pushed a commit to fsb4000/STL that referenced this pull request Aug 13, 2022

Fix vectorized min/max/minmax_element for 64-bit types on x86 (micr…

9d861c6

…osoft#2821) Co-authored-by: Alex Guteniev <[email protected]>

StephanTLavavej mentioned this pull request Apr 3, 2023

<algorithm>: Silent bad codegen for vectorized meow_element() above 4 GB #3617

Closed

AlexGuteniev mentioned this pull request May 7, 2024

vector_algorithms.cpp: minmax for 64-bit elements: replace ugly x86 workaround with a nice one #4661

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix vectorized `min/max/minmax_element` for 64-bit types on x86 #2821

Fix vectorized `min/max/minmax_element` for 64-bit types on x86 #2821

StephanTLavavej commented Jun 24, 2022

miscco left a comment

StephanTLavavej commented Jun 25, 2022

Fix vectorized min/max/minmax_element for 64-bit types on x86 #2821

Fix vectorized min/max/minmax_element for 64-bit types on x86 #2821

Conversation

StephanTLavavej commented Jun 24, 2022

miscco left a comment

Choose a reason for hiding this comment

StephanTLavavej commented Jun 25, 2022

Fix vectorized `min/max/minmax_element` for 64-bit types on x86 #2821

Fix vectorized `min/max/minmax_element` for 64-bit types on x86 #2821