This repository has been archived by the owner on Mar 21, 2024. It is now read-only.
CUB 2.1.0 #645
alliepiper
announced in
Announcements
CUB 2.1.0
#645
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Breaking Changes
CUB_USE_COOPERATIVE_GROUPS
macro, as all supported CTK distributions provide CG. This macro will be removed in a future version of CUB.New Features
DeviceBatchMemcpy
algorithm.DeviceMergeSort::StableSortKeysCopy
API. Thanks to David Wendt (@davidwendt) for this contribution.DeviceRadixSort
. Thanks to Andy Adinets (@canonizer) for this contribution.CUB_DISABLE_CDP
.DeviceReduce
.Bug Fixes
BlockShuffle
resulting from an invalid thread offset. Thanks to @sjfeng1999 for this contribution.BlockRadixRank
when used with blocks that are not a multiple of 32 threads.SHFL_IDX_SYNC
in block radix rank #579: Ensure that all threads in the logical warp participate in the index-shuffle forBlockRadixRank
. Thanks to Andy Adinets (@canonizer) for this contribution.DeviceSegmentedSort
when used withbool
keys.DeviceReduce
.DeviceRunLengthEncode
when the first item is aNaN
.WarpScanExclusive
for vector types.Other Enhancements
BlockReduceRaking
docs for non-commutative operations. Thanks to Tobias Ribizel (@upsj) for this contribution.This discussion was created from the release CUB 2.1.0.
Beta Was this translation helpful? Give feedback.
All reactions