Skip to content

Commit

Permalink
Merge pull request #4920 from martin-frbg/issue4917
Browse files Browse the repository at this point in the history
Fix potential inaccuracy in multithreaded level3 related to SWITCH_RATIO
  • Loading branch information
martin-frbg authored Oct 7, 2024
2 parents 0dfe42d + 8a1710d commit bc0691a
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion driver/level3/level3_thread.c
Original file line number Diff line number Diff line change
Expand Up @@ -742,7 +742,7 @@ static int gemm_driver(blas_arg_t *args, BLASLONG *range_m, BLASLONG
num_parts = 0;
while (n > 0){
width = blas_quickdivide(n + nthreads - num_parts - 1, nthreads - num_parts);
if (width < switch_ratio) {
if (width < switch_ratio && width > 1) {
width = switch_ratio;
}
width = round_up(n, width, GEMM_PREFERED_SIZE);
Expand Down

0 comments on commit bc0691a

Please sign in to comment.