Specialize fill and fill_n for vector<bool> #879

miscco · 2020-06-05T10:09:22Z

This partially addresses #625 by specializing fill and fill_n

This is essentially a subpart of #750 with one of the algorithms where the implementation is both simple and a clear win.

Adresses microsoft#625

Addresses microsoft#625

stl/inc/vector

BillyONeal · 2020-07-21T21:19:27Z

I spent some time messing with this and ended up effectively identical to yours except the calculation of _LastSourceMask is guarded by _Last._Myoff != 0 which resolves the forbidden full shift issue.

template <class _FwdIt, class _Ty>
_CONSTEXPR20 void _Fill_vbool(_FwdIt _First, _FwdIt _Last, const _Ty& _Val) {
    // Set [_First, _Last) to _Val
    if (_First == _Last) {
        return;
    }

    const auto _FillVal         = static_cast<_Vbase>(_Val ? -1 : 0);
    _Vbase* _Vb_first            = const_cast<_Vbase*>(_First._Myptr);
    const _Vbase* const _Vb_last = const_cast<_Vbase*>(_Last._Myptr);

    _STL_INTERNAL_CHECK(_First._Myoff != _VBITS);
    const auto _FirstSourceMask = static_cast<_Vbase>(-1) << _First._Myoff;
    const auto _FirstDestMask   = static_cast<_Vbase>(~_FirstSourceMask);

    if (_Vb_first == _Vb_last) {
        // special case only one block
        _STL_INTERNAL_CHECK(_Last._Myoff != 0); // because we have at least one bit to set
        const auto _LastSourceMask  = static_cast<_Vbase>(-1) >> (_VBITS - _Last._Myoff);
        const auto _LastDestMask    = static_cast<_Vbase>(~_LastSourceMask);
        const auto _SourceMask = _FirstSourceMask & _LastSourceMask;
        const auto _DestMask   = _FirstDestMask | _LastDestMask;
        *_Vb_first               = (*_Vb_first & _DestMask) | (_FillVal & _SourceMask);
        return;
    }

    // fill in trailing bits of the first block and move _Vb_first up
    *_Vb_first               = (*_Vb_first & _FirstDestMask) | (_FillVal & _FirstSourceMask);
    ++_Vb_first;

#ifdef __cpp_lib_is_constant_evaluated
    if (_STD is_constant_evaluated()) {
        for (; _Vb_first != _Vb_last; ++_Vb_first) {
            *_Vb_first = _FillVal;
        }
    } else
#endif // __cpp_lib_is_constant_evaluated
    {
        const auto _Vb_first_ch = reinterpret_cast<const char*>(_Vb_first);
        const auto _Vb_last_ch  = reinterpret_cast<const char*>(_Vb_last);
        const auto _Count_ch  = static_cast<size_t>(_Vb_last_ch - _Vb_first_ch);
        const auto _ValChar   = static_cast<unsigned char>(_Val ? -1 : 0);
        _CSTD memset(_Vb_first, _ValChar, _Count_ch);
        _Vb_first += _Vb_last - _Vb_first;
    }

    if (_Last._Myoff != 0) {
        // fill in leading bits of the last block *_Vb_last
        const auto _LastSourceMask  = static_cast<_Vbase>(-1) >> (_VBITS - _Last._Myoff);
        const auto _LastDestMask    = static_cast<_Vbase>(~_LastSourceMask);
        *_Vb_last = (*_Vb_last & _LastDestMask) | (_FillVal & _LastSourceMask);
    }
}

does that seem OK?

BillyONeal

We need to not do the forbidden full shift; other than that my other comments were extreme nitpicks or wrong.

miscco · 2020-07-22T07:16:51Z

Thanks that looks really good. I believe the _STL_INTERNAL_CHECK(_Last._Myoff != 0); // because we have at least one bit to set in the special case is not needed as we have the _First == _Last early return at the beginning.

I agree that static casts are not safe per se. That said, integer promotions are notoriously tricky and who knows what happens if _Vbase changes some time in the future.

BillyONeal · 2020-07-22T16:21:25Z

Thanks that looks really good. I believe the _STL_INTERNAL_CHECK(_Last._Myoff != 0); // because we have at least one bit to set in the special case is not needed as we have the _First == _Last early return at the beginning.

The comment follows exactly because of that check at the beginning; asserts are free for product code :)

miscco · 2020-07-23T05:04:35Z

Thanks a lot @BillyONeal

StephanTLavavej

Thanks for optimizing vector<bool>! I found some issues, but the core approach appears to be solid.

stl/inc/vector

tests/std/tests/GH_000625_vector_bool_optimization/test.cpp

stl/inc/xutility

tests/std/tests/GH_000625_vector_bool_optimization/test.cpp

…tors

StephanTLavavej

Looks good, will validate and push minor changes for the last remaining issues. Thanks!

stl/inc/xutility

stl/inc/vector

StephanTLavavej · 2020-08-01T22:59:08Z

Thanks for this significant perf improvement - it fills me with joy! 😹

miscco requested a review from a team as a code owner June 5, 2020 10:09

miscco force-pushed the vector_bool_fill branch 2 times, most recently from 1461cae to dbc59be Compare June 5, 2020 12:35

StephanTLavavej added the performance Must go faster label Jun 5, 2020

miscco force-pushed the vector_bool_fill branch from dbc59be to 8bfcede Compare June 5, 2020 18:41

miscco added 2 commits June 5, 2020 21:35

[vector] Add specialization of fill

a714e76

Adresses microsoft#625

[vector] Add specialization of fill_n

f477816

Addresses microsoft#625

miscco force-pushed the vector_bool_fill branch from 8bfcede to f477816 Compare June 5, 2020 19:37

miscco commented Jun 8, 2020

View reviewed changes

stl/inc/vector Outdated Show resolved Hide resolved

Use static_casts instead of C-casts

1e014d8

barcharcraz reviewed Jun 18, 2020

View reviewed changes

stl/inc/vector Outdated Show resolved Hide resolved

Use const methods

f64509e

Implement mnatsuhara comment

d9be996

miscco force-pushed the vector_bool_fill branch from 5144f95 to d9be996 Compare June 26, 2020 18:30

BillyONeal reviewed Jul 21, 2020

View reviewed changes

BillyONeal suggested changes Jul 22, 2020

View reviewed changes

BillyONeal added 3 commits July 22, 2020 10:34

Merge remote-tracking branch 'origin/master' into vector_bool_fill

4d0b7ae

Rename _UFirst -> _VbFirst, _ULast -> _VbLast.

f694d12

Guard forbidden full shift.

52cbae1

BillyONeal approved these changes Jul 22, 2020

View reviewed changes

mnatsuhara assigned StephanTLavavej Jul 22, 2020

miscco mentioned this pull request Jul 23, 2020

Specialize some algorithms for vector<bool> #750

Closed

StephanTLavavej requested changes Jul 29, 2020

View reviewed changes

StephanTLavavej removed their assignment Jul 29, 2020

Address STL comments

6f2f109

mnatsuhara assigned StephanTLavavej Jul 29, 2020

miscco added 2 commits July 30, 2020 09:19

Expand the _Is_vb_iterator to differentiate const and non-const itera…

ce674c5

…tors

Go back to _FwdIt to stay consistent with incoming algorithms

8493d52

miscco force-pushed the vector_bool_fill branch from 40a3478 to 8493d52 Compare July 30, 2020 07:19

BillyONeal approved these changes Jul 31, 2020

View reviewed changes

BillyONeal requested a review from StephanTLavavej July 31, 2020 21:29

StephanTLavavej reviewed Aug 1, 2020

View reviewed changes

StephanTLavavej added 2 commits July 31, 2020 18:28

Code review feedback.

ff94d0b

Merge branch 'master' into vector_bool_fill

acaef25

StephanTLavavej approved these changes Aug 1, 2020

View reviewed changes

StephanTLavavej merged commit 1ff3d1e into microsoft:master Aug 1, 2020

miscco deleted the vector_bool_fill branch September 4, 2020 18:34

StephanTLavavej mentioned this pull request Nov 7, 2023

Various cleanups: Declare _Meow_vbool to avoid ADL #4151

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Specialize fill and fill_n for vector<bool> #879

Specialize fill and fill_n for vector<bool> #879

miscco commented Jun 5, 2020 •

edited

Loading

BillyONeal commented Jul 21, 2020 •

edited

Loading

BillyONeal left a comment

miscco commented Jul 22, 2020

BillyONeal commented Jul 22, 2020

miscco commented Jul 23, 2020

StephanTLavavej left a comment

StephanTLavavej left a comment

StephanTLavavej commented Aug 1, 2020

Specialize fill and fill_n for vector<bool> #879

Specialize fill and fill_n for vector<bool> #879

Conversation

miscco commented Jun 5, 2020 • edited Loading

BillyONeal commented Jul 21, 2020 • edited Loading

BillyONeal left a comment

Choose a reason for hiding this comment

miscco commented Jul 22, 2020

BillyONeal commented Jul 22, 2020

miscco commented Jul 23, 2020

StephanTLavavej left a comment

Choose a reason for hiding this comment

StephanTLavavej left a comment

Choose a reason for hiding this comment

StephanTLavavej commented Aug 1, 2020

miscco commented Jun 5, 2020 •

edited

Loading

BillyONeal commented Jul 21, 2020 •

edited

Loading