[Perf] Windows/arm64: 3 Regressions on 2/9/2023 7:28:55 PM #13046

performanceautofiler · 2023-02-16T11:14:11Z

Run Information

Architecture	arm64
OS	Windows 10.0.25094
Baseline	3ff80e90e828bac0370c1930c9950c9650ae61b9
Compare	46cb4eddbb1f4bebdea970964d7449bacde8df1a
Diff	Diff

Regressions in System.MathBenchmarks.Single

Benchmark	Baseline	Test	Test/Base	Test Quality	Edge Detector	Baseline IR	Compare IR	IR Ratio	Baseline ETL	Compare ETL
Min - Duration of single invocation	3.36 μs	6.79 μs	2.02	0.70	False

Test Report

Repro

General Docs link: https://github.com/dotnet/performance/blob/main/docs/benchmarking-workflow-dotnet-runtime.md

Payloads

Baseline
Compare

git clone https://github.com/dotnet/performance.git
py .\performance\scripts\benchmarks_ci.py -f net8.0 --filter 'System.MathBenchmarks.Single*'

Payloads

Baseline
Compare

Histogram

System.MathBenchmarks.Single.Min

Description of detection logic

IsRegressionChecked: Marked as regression because the three check build points were 0.05 greater than the baseline.
IsImprovementBase: Marked as not an improvement because the compare was not 5% less than the baseline, or the value was too small.
IsRegressionBase: Marked as regression because the compare was 5% greater than the baseline, and the value was not too small.
IsRegressionChecked: Marked as regression because the three check build points were 0.05 greater than the baseline.
IsRegressionWindowed: Marked as regression because 6.789579067730634 > 4.6191408849201805.
IsChangePoint: Marked as a change because one of 11/16/2022 4:37:16 PM, 11/18/2022 7:31:41 AM, 2/9/2023 1:41:04 PM, 2/16/2023 5:39:51 AM falls between 2/7/2023 3:56:56 PM and 2/16/2023 5:39:51 AM.
IsRegressionStdDev: Marked as regression because -13.134728502088736 (T) = (0 -6389.731027976929) / Math.Sqrt((217559.20690277216 / (15)) + (212374.7931338129 / (22))) is less than -2.0301079282477414 = MathNet.Numerics.Distributions.StudentT.InvCDF(0, 1, (15) + (22) - 2, .025) and -0.4694954470860071 = (4348.248264836542 - 6389.731027976929) / 4348.248264836542 is less than -0.05.
IsImprovementBase: Marked as not an improvement because the compare was not 5% less than the baseline, or the value was too small.
IsChangeEdgeDetector: Marked not as a regression because Edge Detector said so.

Docs

Profiling workflow for dotnet/runtime repository
Benchmarking workflow for dotnet/runtime repository

Run Information

Architecture	arm64
OS	Windows 10.0.25094
Baseline	3ff80e90e828bac0370c1930c9950c9650ae61b9
Compare	46cb4eddbb1f4bebdea970964d7449bacde8df1a
Diff	Diff

Regressions in System.Collections.CreateAddAndClear<Int32>

Benchmark	Baseline	Test	Test/Base	Test Quality	Edge Detector	Baseline IR	Compare IR	IR Ratio	Baseline ETL	Compare ETL
ConcurrentDictionary - Duration of single invocation	157.60 μs	202.67 μs	1.29	0.65	False

Test Report

Repro

General Docs link: https://github.com/dotnet/performance/blob/main/docs/benchmarking-workflow-dotnet-runtime.md

Payloads

Baseline
Compare

git clone https://github.com/dotnet/performance.git
py .\performance\scripts\benchmarks_ci.py -f net8.0 --filter 'System.Collections.CreateAddAndClear&lt;Int32&gt;*'

Payloads

Baseline
Compare

Histogram

System.Collections.CreateAddAndClear<Int32>.ConcurrentDictionary(Size: 512)

Description of detection logic

IsRegressionChecked: Marked as regression because the three check build points were 0.05 greater than the baseline.
IsImprovementBase: Marked as not an improvement because the compare was not 5% less than the baseline, or the value was too small.
IsRegressionBase: Marked as regression because the compare was 5% greater than the baseline, and the value was not too small.
IsRegressionChecked: Marked as regression because the three check build points were 0.05 greater than the baseline.
IsRegressionWindowed: Marked as regression because 202.669130952381 > 194.30586389280376.
IsChangePoint: Marked as a change because one of 2/9/2023 1:41:04 PM, 2/16/2023 5:39:51 AM falls between 2/7/2023 3:56:56 PM and 2/16/2023 5:39:51 AM.
IsRegressionStdDev: Marked as regression because -10.084408793310754 (T) = (0 -223509.0795869315) / Math.Sqrt((172223009.61375564 / (15)) + (183281893.00650138 / (22))) is less than -2.0301079282477414 = MathNet.Numerics.Distributions.StudentT.InvCDF(0, 1, (15) + (22) - 2, .025) and -0.25129579121036716 = (178622.09811377467 - 223509.0795869315) / 178622.09811377467 is less than -0.05.
IsImprovementBase: Marked as not an improvement because the compare was not 5% less than the baseline, or the value was too small.
IsChangeEdgeDetector: Marked not as a regression because Edge Detector said so.

Docs

Profiling workflow for dotnet/runtime repository
Benchmarking workflow for dotnet/runtime repository

Run Information

Architecture	arm64
OS	Windows 10.0.25094
Baseline	3ff80e90e828bac0370c1930c9950c9650ae61b9
Compare	46cb4eddbb1f4bebdea970964d7449bacde8df1a
Diff	Diff

Regressions in System.Collections.TryAddDefaultSize<Int32>

Benchmark	Baseline	Test	Test/Base	Test Quality	Edge Detector	Baseline IR	Compare IR	IR Ratio	Baseline ETL	Compare ETL
ConcurrentDictionary - Duration of single invocation	148.97 μs	196.75 μs	1.32	0.69	False

Test Report

Repro

General Docs link: https://github.com/dotnet/performance/blob/main/docs/benchmarking-workflow-dotnet-runtime.md

Payloads

Baseline
Compare

git clone https://github.com/dotnet/performance.git
py .\performance\scripts\benchmarks_ci.py -f net8.0 --filter 'System.Collections.TryAddDefaultSize&lt;Int32&gt;*'

Payloads

Baseline
Compare

Histogram

System.Collections.TryAddDefaultSize<Int32>.ConcurrentDictionary(Count: 512)

Description of detection logic

IsRegressionChecked: Marked as regression because the three check build points were 0.05 greater than the baseline.
IsImprovementBase: Marked as not an improvement because the compare was not 5% less than the baseline, or the value was too small.
IsRegressionBase: Marked as regression because the compare was 5% greater than the baseline, and the value was not too small.
IsRegressionChecked: Marked as regression because the three check build points were 0.05 greater than the baseline.
IsRegressionWindowed: Marked as regression because 196.7454220779221 > 156.83800762773058.
IsChangePoint: Marked as a change because one of 2/9/2023 1:41:04 PM, 2/16/2023 5:39:51 AM falls between 2/7/2023 3:56:56 PM and 2/16/2023 5:39:51 AM.
IsRegressionStdDev: Marked as regression because -11.208636392415473 (T) = (0 -191322.57909932686) / Math.Sqrt((154967953.42477354 / (15)) + (194641512.6552477 / (22))) is less than -2.0301079282477414 = MathNet.Numerics.Distributions.StudentT.InvCDF(0, 1, (15) + (22) - 2, .025) and -0.34510419054354413 = (142236.25236199374 - 191322.57909932686) / 142236.25236199374 is less than -0.05.
IsImprovementBase: Marked as not an improvement because the compare was not 5% less than the baseline, or the value was too small.
IsChangeEdgeDetector: Marked not as a regression because Edge Detector said so.

Docs

Profiling workflow for dotnet/runtime repository
Benchmarking workflow for dotnet/runtime repository

The text was updated successfully, but these errors were encountered:

performanceautofiler bot assigned EgorBo Feb 16, 2023

performanceautofiler bot added ampere arm64 untriaged labels Feb 16, 2023

cincuranet mentioned this issue Feb 16, 2023

Improve ConcurrentDictionary performance, in particular for strings dotnet/runtime#81557

Merged

cincuranet removed the untriaged label Feb 16, 2023

performanceautofiler bot added perf-regression os-windows branch-refs/heads/main arch-arm64 runtime-coreclr kind-micro and removed refs/heads/main labels Mar 23, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Perf] Windows/arm64: 3 Regressions on 2/9/2023 7:28:55 PM #13046

[Perf] Windows/arm64: 3 Regressions on 2/9/2023 7:28:55 PM #13046

performanceautofiler bot commented Feb 16, 2023 •

edited

Loading

Payloads

Histogram

System.MathBenchmarks.Single.Min

Description of detection logic

Docs

Payloads

Histogram

System.Collections.CreateAddAndClear<Int32>.ConcurrentDictionary(Size: 512)

Description of detection logic

Docs

Payloads

Histogram

System.Collections.TryAddDefaultSize<Int32>.ConcurrentDictionary(Count: 512)

Description of detection logic

Docs

[Perf] Windows/arm64: 3 Regressions on 2/9/2023 7:28:55 PM #13046

[Perf] Windows/arm64: 3 Regressions on 2/9/2023 7:28:55 PM #13046

Comments

performanceautofiler bot commented Feb 16, 2023 • edited Loading

Run Information

Regressions in System.MathBenchmarks.Single

Repro

Payloads

Payloads

Histogram

System.MathBenchmarks.Single.Min

Description of detection logic

Docs

Run Information

Regressions in System.Collections.CreateAddAndClear<Int32>

Repro

Payloads

Payloads

Histogram

System.Collections.CreateAddAndClear<Int32>.ConcurrentDictionary(Size: 512)

Description of detection logic

Docs

Run Information

Regressions in System.Collections.TryAddDefaultSize<Int32>

Repro

Payloads

Payloads

Histogram

System.Collections.TryAddDefaultSize<Int32>.ConcurrentDictionary(Count: 512)

Description of detection logic

Docs

performanceautofiler bot commented Feb 16, 2023 •

edited

Loading