Skip to content
Merged
Changes from all commits
Commits
Show all changes
20 commits
Select commit Hold shift + click to select a range
5f17c5a
Reorganize project folders (#6)
jayhawk-commits Apr 30, 2025
fa0b83e
Merge commit '97ddb51605d9f42a575d929add981c275f10c6d7' into develop
github-actions[bot] May 2, 2025
a9a63b2
Merge commit '1b526968ba327ecc8a74011ee07faae5b9f8dee7' into develop
github-actions[bot] May 6, 2025
a35bd55
Merge commit '7347b9df219fe0f7aa44ff321e86cda3d714fb3f' into develop
github-actions[bot] May 12, 2025
84286ab
Merge commit 'a1cf3bb3166849d1de7fc8ec3c1eda8f15bb4aaa' into develop
github-actions[bot] May 13, 2025
8c1682d
Split 'projects/rocprim/' into commit '84286ab36c61c216e853d5a03429c7…
jayhawk-commits May 14, 2025
28b5236
Merge commit '303bd8ab840eaef73768161dad5f3444063b4003' into develop
assistant-librarian[bot] May 14, 2025
4569b61
Merge branch 'develop' of https://github.com/ROCm/rocm-libraries into…
jayhawk-commits May 14, 2025
e940e58
[GitHub Actions] Speed up push flow (#37)
jayhawk-commits May 14, 2025
8e9fa71
[GitHub Actions] Fix concurrency name for workflow
jayhawk-commits May 14, 2025
a1ae0cd
[GitHub Actions] More workflow adjustments for auto-push
jayhawk-commits May 14, 2025
325cebb
[GitHub Actions] Import PR preserve draft status (#36)
jayhawk-commits May 14, 2025
193b08a
implemented size reader
NguyenNhuDi May 14, 2025
70393f1
fixed memory access faults
NguyenNhuDi May 14, 2025
b2aa830
made the cast to be unsigned long long always
NguyenNhuDi May 14, 2025
2e95b83
pulled from upstream develop
NguyenNhuDi May 16, 2025
39b33eb
removed extra newline and added comments
NguyenNhuDi May 20, 2025
04817a9
Merge branch 'develop' into zenguyen/make-device-batch-offset-compati…
NguyenNhuDi May 29, 2025
20cc443
casted to buffer_size_type instead of unsigned long long
NguyenNhuDi May 29, 2025
7ff9782
Merge branch 'zenguyen/make-device-batch-offset-compatible-with-thrus…
NguyenNhuDi May 29, 2025
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Original file line number Diff line number Diff line change
Expand Up @@ -545,8 +545,14 @@ struct batch_memcpy_impl
if(blev_buffer_offset < num_blev_buffers)
{
auto tile_buffer_id = buffer_by_size_class[blev_buffer_offset].buffer_id;
/* In the case that buffer_size_type is rocthrust::device_reference<T> a static cast to
/ buffer_size_type is needed so that the type passed into ceiling_div is not
/ rocthrust::device_reference<T>. This is possible since rocthrust::device_reference<T>
/ can be implicitly cast to type T.
*/
buffer_size_type size = static_cast<buffer_size_type>(buffers.sizes[tile_buffer_id]);
tile_offsets[i]
= rocprim::detail::ceiling_div(buffers.sizes[tile_buffer_id],
= rocprim::detail::ceiling_div(size,
blev_block_size * blev_bytes_per_thread);
}
else
Expand Down Expand Up @@ -620,10 +626,15 @@ struct batch_memcpy_impl
buffer_offset += warps_per_block)
{
const auto buffer_id = buffers_by_size_class[buffer_offset].buffer_id;

/* In the case that buffer_size_type is rocthrust::device_reference<T> a static cast to
/ buffer_size_type is needed so that the type passed into copy_items is not
/ rocthrust::device_reference<T>. This is possible since rocthrust::device_reference<T>
/ can be implicitly cast to type T.
*/
buffer_size_type size = static_cast<buffer_size_type>(tile_buffers.sizes[buffer_id]);
batch_memcpy::copy_items<IsMemCpy>(tile_buffers.srcs[buffer_id],
tile_buffers.dsts[buffer_id],
tile_buffers.sizes[buffer_id]);
tile_buffers.dsts[buffer_id],
size);
}
}

Expand Down
Loading