Skip to content

Commit

Permalink
#13621: enable default fp32 acc for reduce
Browse files Browse the repository at this point in the history
  • Loading branch information
bbradelTT committed Dec 4, 2024
1 parent b756309 commit 1dfa9cd
Showing 1 changed file with 6 additions and 2 deletions.
Original file line number Diff line number Diff line change
Expand Up @@ -191,8 +191,12 @@ Tensor reduce(
auto is_multicore_hw = parallelization_strategy == ReduceOpParallelizationStrategy::MULTI_CORE_HW;
float pad_value = reduce_math == ReduceOpMath::MAX ? -std::numeric_limits<float>::infinity() : 0;

ttnn::DeviceComputeKernelConfig config = compute_kernel_config.value_or(
ttnn::init_device_compute_kernel_config(input_tensor.device()->arch(), std::nullopt, MathFidelity::HiFi4));
ttnn::DeviceComputeKernelConfig config = compute_kernel_config.value_or(ttnn::init_device_compute_kernel_config(
input_tensor.device()->arch(),
std::nullopt,
MathFidelity::HiFi4,
/*default_approx_mode=*/false,
/*default_fp32_acc=*/true));

std::vector<Tensor> output_tensors = {Tensor(operation::get_workers_for_op_output({input_tensor}))};
if (is_multicore_hw) {
Expand Down

0 comments on commit 1dfa9cd

Please sign in to comment.