[Perf] Windows/x64: 5 Regressions on 2/3/2024 12:19:35 AM #98044

performanceautofiler · 2024-02-06T08:28:28Z

Run Information

Name	Value
Architecture	x64
OS	Windows 10.0.18362
Queue	TigerWindows
Baseline	1a2f095fb212dcbf394f01122b9f317b7cc70fdb
Compare	2361c00717a54a5dd9b0cf727102d64f783855b9
Diff	Diff
Configs	CompilationMode:tiered, RunKind:micro

Regressions in System.Globalization.Tests.StringEquality

Benchmark	Baseline	Test	Test/Base	Test Quality	Edge Detector	Baseline IR	Compare IR	IR Ratio
Compare_Same - Duration of single invocation 📝 - Benchmark Source 📈 - ADX Test Multi Config Graph	761.52 ns	973.83 ns	1.28	0.00	True
Compare_Same_Upper - Duration of single invocation 📝 - Benchmark Source 📈 - ADX Test Multi Config Graph	1.19 μs	1.28 μs	1.08	0.01	False

Test Report

Repro

General Docs link: https://github.com/dotnet/performance/blob/main/docs/benchmarking-workflow-dotnet-runtime.md

git clone https://github.com/dotnet/performance.git
py .\performance\scripts\benchmarks_ci.py -f net8.0 --filter 'System.Globalization.Tests.StringEquality*'

Payloads

Baseline
Compare

System.Globalization.Tests.StringEquality.Compare_Same(Count: 1024, Options: (en-US, OrdinalIgnoreCase))

ETL Files

Histogram

JIT Disasms

System.Globalization.Tests.StringEquality.Compare_Same_Upper(Count: 1024, Options: (en-US, OrdinalIgnoreCase))

ETL Files

Histogram

JIT Disasms

Docs

Profiling workflow for dotnet/runtime repository
Benchmarking workflow for dotnet/runtime repository

Run Information

Name	Value
Architecture	x64
OS	Windows 10.0.18362
Queue	TigerWindows
Baseline	1a2f095fb212dcbf394f01122b9f317b7cc70fdb
Compare	2361c00717a54a5dd9b0cf727102d64f783855b9
Diff	Diff
Configs	CompilationMode:tiered, RunKind:micro

Regressions in Benchmark.GetChildKeysTests

Benchmark	Baseline	Test	Test/Base	Test Quality	Edge Detector	Baseline IR	Compare IR	IR Ratio
AddChainedConfigurationEmpty - Duration of single invocation 📝 - Benchmark Source 📈 - ADX Test Multi Config Graph	14.99 ms	16.20 ms	1.08	0.02	False

Test Report

Repro

General Docs link: https://github.com/dotnet/performance/blob/main/docs/benchmarking-workflow-dotnet-runtime.md

git clone https://github.com/dotnet/performance.git
py .\performance\scripts\benchmarks_ci.py -f net8.0 --filter 'Benchmark.GetChildKeysTests*'

Payloads

Baseline
Compare

Benchmark.GetChildKeysTests.AddChainedConfigurationEmpty

ETL Files

Histogram

JIT Disasms

Docs

Profiling workflow for dotnet/runtime repository
Benchmarking workflow for dotnet/runtime repository

Run Information

Name	Value
Architecture	x64
OS	Windows 10.0.18362
Queue	TigerWindows
Baseline	1a2f095fb212dcbf394f01122b9f317b7cc70fdb
Compare	2361c00717a54a5dd9b0cf727102d64f783855b9
Diff	Diff
Configs	CompilationMode:tiered, RunKind:micro

Regressions in Span.Sorting

Benchmark	Baseline	Test	Test/Base	Test Quality	Edge Detector	Baseline IR	Compare IR	IR Ratio
QuickSortArray - Duration of single invocation 📝 - Benchmark Source 📈 - ADX Test Multi Config Graph	8.54 μs	16.21 μs	1.90	0.45	True

Test Report

Repro

General Docs link: https://github.com/dotnet/performance/blob/main/docs/benchmarking-workflow-dotnet-runtime.md

git clone https://github.com/dotnet/performance.git
py .\performance\scripts\benchmarks_ci.py -f net8.0 --filter 'Span.Sorting*'

Payloads

Baseline
Compare

Span.Sorting.QuickSortArray(Size: 512)

ETL Files

Histogram

JIT Disasms

Docs

Profiling workflow for dotnet/runtime repository
Benchmarking workflow for dotnet/runtime repository

Run Information

Name	Value
Architecture	x64
OS	Windows 10.0.18362
Queue	TigerWindows
Baseline	1a2f095fb212dcbf394f01122b9f317b7cc70fdb
Compare	2361c00717a54a5dd9b0cf727102d64f783855b9
Diff	Diff
Configs	CompilationMode:tiered, RunKind:micro

Regressions in Benchstone.BenchI.EightQueens

Benchmark	Baseline	Test	Test/Base	Test Quality	Edge Detector	Baseline IR	Compare IR	IR Ratio
Test - Duration of single invocation 📝 - Benchmark Source 📈 - ADX Test Multi Config Graph	1.80 μs	2.04 μs	1.13	0.03	False

Test Report

Repro

General Docs link: https://github.com/dotnet/performance/blob/main/docs/benchmarking-workflow-dotnet-runtime.md

git clone https://github.com/dotnet/performance.git
py .\performance\scripts\benchmarks_ci.py -f net8.0 --filter 'Benchstone.BenchI.EightQueens*'

Payloads

Baseline
Compare

Benchstone.BenchI.EightQueens.Test

ETL Files

Histogram

JIT Disasms

Docs

Profiling workflow for dotnet/runtime repository
Benchmarking workflow for dotnet/runtime repository

DrewScoggins · 2024-02-06T18:06:30Z

Diff here: 207e1fb...df0778d

Nothing is jumping out as the culprit, but there were a few JIT changes.

DrewScoggins · 2024-02-06T18:21:57Z

Linux related regressions: dotnet/perf-autofiling-issues#28564

ghost · 2024-02-07T16:59:59Z

Tagging subscribers to this area: @JulieLeeMSFT, @jakobbotsch
See info in area-owners.md if you want to be subscribed.

Issue Details

Run Information

Name	Value
Architecture	x64
OS	Windows 10.0.18362
Queue	TigerWindows
Baseline	1a2f095fb212dcbf394f01122b9f317b7cc70fdb
Compare	2361c00717a54a5dd9b0cf727102d64f783855b9
Diff	Diff
Configs	CompilationMode:tiered, RunKind:micro

Regressions in System.Globalization.Tests.StringEquality

Benchmark	Baseline	Test	Test/Base	Test Quality	Edge Detector	Baseline IR	Compare IR	IR Ratio
Compare_Same - Duration of single invocation 📝 - Benchmark Source 📈 - ADX Test Multi Config Graph	761.52 ns	973.83 ns	1.28	0.00	True
Compare_Same_Upper - Duration of single invocation 📝 - Benchmark Source 📈 - ADX Test Multi Config Graph	1.19 μs	1.28 μs	1.08	0.01	False

Test Report

Repro

General Docs link: https://github.com/dotnet/performance/blob/main/docs/benchmarking-workflow-dotnet-runtime.md

git clone https://github.com/dotnet/performance.git
py .\performance\scripts\benchmarks_ci.py -f net8.0 --filter 'System.Globalization.Tests.StringEquality*'

Payloads

Baseline
Compare

System.Globalization.Tests.StringEquality.Compare_Same(Count: 1024, Options: (en-US, OrdinalIgnoreCase))

ETL Files

Histogram

JIT Disasms

System.Globalization.Tests.StringEquality.Compare_Same_Upper(Count: 1024, Options: (en-US, OrdinalIgnoreCase))

ETL Files

Histogram

JIT Disasms

Docs

Profiling workflow for dotnet/runtime repository
Benchmarking workflow for dotnet/runtime repository

Run Information

Name	Value
Architecture	x64
OS	Windows 10.0.18362
Queue	TigerWindows
Baseline	1a2f095fb212dcbf394f01122b9f317b7cc70fdb
Compare	2361c00717a54a5dd9b0cf727102d64f783855b9
Diff	Diff
Configs	CompilationMode:tiered, RunKind:micro

Regressions in Benchmark.GetChildKeysTests

Benchmark	Baseline	Test	Test/Base	Test Quality	Edge Detector	Baseline IR	Compare IR	IR Ratio
AddChainedConfigurationEmpty - Duration of single invocation 📝 - Benchmark Source 📈 - ADX Test Multi Config Graph	14.99 ms	16.20 ms	1.08	0.02	False

Test Report

Repro

General Docs link: https://github.com/dotnet/performance/blob/main/docs/benchmarking-workflow-dotnet-runtime.md

git clone https://github.com/dotnet/performance.git
py .\performance\scripts\benchmarks_ci.py -f net8.0 --filter 'Benchmark.GetChildKeysTests*'

Payloads

Baseline
Compare

Benchmark.GetChildKeysTests.AddChainedConfigurationEmpty

ETL Files

Histogram

JIT Disasms

Docs

Profiling workflow for dotnet/runtime repository
Benchmarking workflow for dotnet/runtime repository

Run Information

Name	Value
Architecture	x64
OS	Windows 10.0.18362
Queue	TigerWindows
Baseline	1a2f095fb212dcbf394f01122b9f317b7cc70fdb
Compare	2361c00717a54a5dd9b0cf727102d64f783855b9
Diff	Diff
Configs	CompilationMode:tiered, RunKind:micro

Regressions in Span.Sorting

Benchmark	Baseline	Test	Test/Base	Test Quality	Edge Detector	Baseline IR	Compare IR	IR Ratio
QuickSortArray - Duration of single invocation 📝 - Benchmark Source 📈 - ADX Test Multi Config Graph	8.54 μs	16.21 μs	1.90	0.45	True

Test Report

Repro

General Docs link: https://github.com/dotnet/performance/blob/main/docs/benchmarking-workflow-dotnet-runtime.md

git clone https://github.com/dotnet/performance.git
py .\performance\scripts\benchmarks_ci.py -f net8.0 --filter 'Span.Sorting*'

Payloads

Baseline
Compare

Span.Sorting.QuickSortArray(Size: 512)

ETL Files

Histogram

JIT Disasms

Docs

Profiling workflow for dotnet/runtime repository
Benchmarking workflow for dotnet/runtime repository

Run Information

Name	Value
Architecture	x64
OS	Windows 10.0.18362
Queue	TigerWindows
Baseline	1a2f095fb212dcbf394f01122b9f317b7cc70fdb
Compare	2361c00717a54a5dd9b0cf727102d64f783855b9
Diff	Diff
Configs	CompilationMode:tiered, RunKind:micro

Regressions in Benchstone.BenchI.EightQueens

Benchmark	Baseline	Test	Test/Base	Test Quality	Edge Detector	Baseline IR	Compare IR	IR Ratio
Test - Duration of single invocation 📝 - Benchmark Source 📈 - ADX Test Multi Config Graph	1.80 μs	2.04 μs	1.13	0.03	False

Test Report

Repro

General Docs link: https://github.com/dotnet/performance/blob/main/docs/benchmarking-workflow-dotnet-runtime.md

git clone https://github.com/dotnet/performance.git
py .\performance\scripts\benchmarks_ci.py -f net8.0 --filter 'Benchstone.BenchI.EightQueens*'

Payloads

Baseline
Compare

Benchstone.BenchI.EightQueens.Test

ETL Files

Histogram

JIT Disasms

Docs

Profiling workflow for dotnet/runtime repository
Benchmarking workflow for dotnet/runtime repository

Author:	performanceautofiler[bot]
Assignees:	-
Labels:	`os-windows`, `arch-x64`, `area-CodeGen-coreclr`, `untriaged`, `runtime-coreclr`, `needs-area-label`
Milestone:	-

BruceForstall · 2024-02-13T17:26:10Z

Maybe #97722?

AndyAyersMS · 2024-07-22T18:55:24Z

EightQueens seems to be an intel-only regression, and then only on some cases, and two other regressions since.

Most all the time is in TryMe.

Codegen from baseline to latest shows RBO did one jump thread (from #97722), different layout, and an IV widening.

There are a lot of spilled CSEs here in both baseline and latest codegen, but more spill occurrences in latest. Possibly the one extra jump thread by RBO has created more critical edges and so made life more difficult for LSRA.

Final flow graphs. You can clearly see the impact of RPO layout at least...

MAIN	BASELINE

AndyAyersMS · 2024-08-06T21:52:27Z

Span.Sorting.QuickSortArray(Size: 512)

Regressions here were fixed by RPO layout:

AndyAyersMS · 2024-08-06T22:13:34Z

System.Globalization.Tests.StringEquality.Compare_Same(Count: 1024, Options: (en-US, OrdinalIgnoreCase))

Ditto for this benchmark

AndyAyersMS · 2024-08-06T22:39:46Z

System.Globalization.Tests.StringEquality.Compare_Same_Upper(Count: 1024, Options: (en-US, OrdinalIgnoreCase))

Same as the two above, recovers with later changes.

AndyAyersMS · 2024-08-06T22:45:01Z

Benchmark.GetChildKeysTests.AddChainedConfigurationEmpty

Ditto like the above

AndyAyersMS · 2024-08-06T22:46:23Z

So the only persisted regression is in 8 queens, and that one seems to be the increase in resolution moves by the allocator.

Going to move this to .NET 10 as there's no simple fix available now.

performanceautofiler bot added arch-x64 os-windows runtime-coreclr specific to the CoreCLR runtime untriaged New issue has not been triaged by the area owner labels Feb 6, 2024

performanceautofiler bot mentioned this issue Feb 6, 2024

[SENTINEL] Autofile run complete at 2/6/2024 8:30:52 AM. 20 issues filed. dotnet/perf-autofiling-issues#28634

Closed

DrewScoggins removed the untriaged New issue has not been triaged by the area owner label Feb 6, 2024

DrewScoggins transferred this issue from dotnet/perf-autofiling-issues Feb 6, 2024

dotnet-issue-labeler bot added the needs-area-label An area label is needed to ensure this gets routed to the appropriate area owners label Feb 6, 2024

ghost added the untriaged New issue has not been triaged by the area owner label Feb 6, 2024

jeffschwMSFT added the area-CodeGen-coreclr CLR JIT compiler in src/coreclr/src/jit and related components such as SuperPMI label Feb 7, 2024

vcsjones removed the needs-area-label An area label is needed to ensure this gets routed to the appropriate area owners label Feb 13, 2024

BruceForstall assigned AndyAyersMS Feb 13, 2024

BruceForstall added this to the 9.0.0 milestone Feb 13, 2024

ghost removed the untriaged New issue has not been triaged by the area owner label Feb 13, 2024

AndyAyersMS added the Priority:2 Work that is important, but not critical for the release label May 8, 2024

AndyAyersMS added the tenet-performance-benchmarks Issue from performance benchmark label Jul 27, 2024

AndyAyersMS modified the milestones: 9.0.0, 10.0.0 Aug 6, 2024

AndyAyersMS removed the Priority:2 Work that is important, but not critical for the release label Aug 6, 2024

DrewScoggins mentioned this issue Aug 22, 2024

Performance Fundamental: Net 8 -> Net 9 Manual Comparison Report #106824

Closed

[Perf] Windows/x64: 5 Regressions on 2/3/2024 12:19:35 AM #98044

[Perf] Windows/x64: 5 Regressions on 2/3/2024 12:19:35 AM #98044

Comments

performanceautofiler bot commented Feb 6, 2024 • edited Loading

Run Information

Regressions in System.Globalization.Tests.StringEquality

Repro

Payloads

System.Globalization.Tests.StringEquality.Compare_Same(Count: 1024, Options: (en-US, OrdinalIgnoreCase))

ETL Files

Histogram

JIT Disasms

System.Globalization.Tests.StringEquality.Compare_Same_Upper(Count: 1024, Options: (en-US, OrdinalIgnoreCase))

ETL Files

Histogram

JIT Disasms

Docs

Run Information

Regressions in Benchmark.GetChildKeysTests

Repro

Payloads

Benchmark.GetChildKeysTests.AddChainedConfigurationEmpty

ETL Files

Histogram

JIT Disasms

Docs

Run Information

Regressions in Span.Sorting

Repro

Payloads

Span.Sorting.QuickSortArray(Size: 512)

ETL Files

Histogram

JIT Disasms

Docs

Run Information

Regressions in Benchstone.BenchI.EightQueens

Repro

Payloads

Benchstone.BenchI.EightQueens.Test

ETL Files

Histogram

JIT Disasms

Docs

DrewScoggins commented Feb 6, 2024

DrewScoggins commented Feb 6, 2024

ghost commented Feb 7, 2024

Run Information

Regressions in System.Globalization.Tests.StringEquality

Repro

Payloads

System.Globalization.Tests.StringEquality.Compare_Same(Count: 1024, Options: (en-US, OrdinalIgnoreCase))

ETL Files

Histogram

JIT Disasms

System.Globalization.Tests.StringEquality.Compare_Same_Upper(Count: 1024, Options: (en-US, OrdinalIgnoreCase))

ETL Files

Histogram

JIT Disasms

Docs

Run Information

Regressions in Benchmark.GetChildKeysTests

Repro

Payloads

Benchmark.GetChildKeysTests.AddChainedConfigurationEmpty

ETL Files

Histogram

JIT Disasms

Docs

Run Information

Regressions in Span.Sorting

Repro

Payloads

Span.Sorting.QuickSortArray(Size: 512)

ETL Files

Histogram

JIT Disasms

Docs

Run Information

Regressions in Benchstone.BenchI.EightQueens

performanceautofiler bot commented Feb 6, 2024 •

edited

Loading