Bloom filter optimizations (1/5): Less noisy benchmarks #669

sleeepyjack · 2025-02-12T16:05:39Z

This PR is part 1/5 of the Bloom filter optimization project and must be merged in the correct order.

This PR introduces two changes that reduce the noise level of the benchmark measurements drastically:

Generate input data on-the-fly rather than loading the keys from global memory.
Increase the number of input keys to make the kernels run longer (former runtimes were in the single-digit ms range which was too noisy).

PointKernel · 2025-02-12T17:52:25Z

benchmarks/bloom_filter/defaults.hpp

 using BF_WORD = nvbench::uint32_t;

-static constexpr auto BF_N               = 400'000'000;
+static constexpr auto BF_N               = 1'000'000'000;


copyright years to be updated otherwise looks good

sleeepyjack added 4 commits February 6, 2025 18:39

Eliminate IO from bloom_filter::add benchmark

9e97c67

Don't read benchmark input data from gmem

e2bb179

Increase benchmark input to reduce noise in measurements

15ecc93

Merge remote-tracking branch 'upstream' into bf-bench

cfcd09f

sleeepyjack added type: improvement Improvement / enhancement to an existing function topic: bloom_filter Issues related to bloom_filter labels Feb 12, 2025

sleeepyjack self-assigned this Feb 12, 2025

sleeepyjack requested a review from PointKernel as a code owner February 12, 2025 16:05

PointKernel approved these changes Feb 12, 2025

View reviewed changes

mhaseeb123 approved these changes Feb 12, 2025

View reviewed changes

Update copyright year

a51e7e6

sleeepyjack mentioned this pull request Feb 13, 2025

Bloom filter optimizations (2/5): Eliminate lmem access during salt lookup in arrow policy #670

Merged

sleeepyjack merged commit ac4ba6b into NVIDIA:dev Feb 13, 2025
18 checks passed

sleeepyjack deleted the bf-bench branch February 13, 2025 00:01

sleeepyjack mentioned this pull request Feb 13, 2025

Bloom filter optimizations (4/5): Implement device bulk add #672

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Bloom filter optimizations (1/5): Less noisy benchmarks #669

Bloom filter optimizations (1/5): Less noisy benchmarks #669

Uh oh!

sleeepyjack commented Feb 12, 2025

Uh oh!

PointKernel Feb 12, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Bloom filter optimizations (1/5): Less noisy benchmarks #669

Bloom filter optimizations (1/5): Less noisy benchmarks #669

Uh oh!

Conversation

sleeepyjack commented Feb 12, 2025

Uh oh!

PointKernel Feb 12, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants