compile composite with avx2 on x64 #55057

mangod9 · 2021-07-02T03:25:02Z

Compile Fx composite images using avx2.

mangod9 · 2021-07-02T03:26:18Z

src/installer/pkg/sfx/Microsoft.NETCore.App/Microsoft.NETCore.App.Runtime.Composite.sfxproj

@@ -32,6 +32,9 @@

  <ItemGroup>
    <PublishReadyToRunCrossgen2ExtraArgsList Include="--compositekeyfile:$(AssemblyOriginatorKeyFile)"/>
+    <!-- Compile with avx2 on x64 -->
+    <PublishReadyToRunCrossgen2ExtraArgsList Condition="'$(TargetArchitecture)' == 'x64'" Include="--inputbubble"/>


currently inputbubble is required with instruction-set. Shouldnt composite just imply that? @AntonLapounov @trylek ?

Yes, it implies; I think the check in Crossgen2 should be relaxed accordingly. @davidwrighton has filed dotnet/sdk#17760, but I am not sure why it is filed on the SDK side.

yeah this needs to be fixed in cg2. perhaps we should just move the bug over.

src/installer/pkg/sfx/Microsoft.NETCore.App/Microsoft.NETCore.App.Runtime.Composite.sfxproj

AntonLapounov

Thanks!

mangod9 · 2021-07-02T05:01:00Z

wonder whether we should also compile with -Ot based on #52708 (comment) ?

AntonLapounov · 2021-07-02T06:24:13Z

That comment was about "workloads that use R2R with tiered JIT compilation disabled", which is not the normal configuration.

mangod9 · 2021-07-02T06:26:43Z

yeah true, that's unlikely for containers.

tannergooding · 2021-07-02T14:36:53Z

src/installer/pkg/sfx/Microsoft.NETCore.App/Microsoft.NETCore.App.Runtime.Composite.sfxproj

@@ -32,6 +32,9 @@

  <ItemGroup>
    <PublishReadyToRunCrossgen2ExtraArgsList Include="--compositekeyfile:$(AssemblyOriginatorKeyFile)"/>
+    <!-- Compile with avx2 on x64 -->


What's the reasoning for "just AVX2"?

That is, specifying avx2 will imply anything in the direct heirarchy, so it will include:

X86Base

SSE

SSE2

SSE3

SSSE3

SSE4.1

SSE4.2

AVX

However, there are certain branch instruction sets that aren't part of the direct hierarchy. These are generally considered to exist due to age or vendor (AMD vs Intel) differences in how things are exposed and so should be reasonable to also include:

AES (2010+)

PCLMULQDQ (2010+)

POPCNT (2008+)

Likewise, there are instruction sets that were introduced alongside AVX2 (which was introduced in Haswell, 2013+) and are realistically provided SxS in machines that provide AVX2 (only missing in machines like newer Atom processors that are also missing AVX2):

BMI1

BMI2

FMA

LZCNT

avx2 is what we have been running our benchmarks with, which showed decent startup improvements. We could subsequently add others as you suggest, if there is measurable perf improvement when enabled.

POPCNT and LZCNT are both used in various hot paths (e.g. in Span or UTF8<->UTF16 conversion).
Intel technically considers POPCNT part of SSE4.2 and LZCNT part of BMI1, they are split out as not all vendors (such as AMD) consider them this way (but they are functionally present in the same scenarios on the other vendors).

FMA and BMI2 aren't used by any of our own startup paths but may be called from user code if they are doing anything around Math.BigMul or Math/MathF.FusedMultiplyAdd

AES and PCLMULQDQ are likely fine to not enable for R2R as they are edge case scenarios, but they should also be harmless to enable on any machine that supports AVX2.

ok thanks for the info.

POPCNT and LZCNT are both used in various hot paths (e.g. in Span or UTF8<->UTF16 conversion).

Will add these to current benchmarks to measure the perf improvement and then enable for composite, assuming most modern CPUs support these.

assuming most modern CPUs support these.

They should. POPCNT will be in basically any CPU that supports SSE4.2 (2008+) and LZCNT/BMI1 in basically any CPU that supports AVX2 (2013+).
Since we're enabling AVX2 here, both should effectively be available in the same scenarios.

There are minor nuances here for pre-AVX2 CPUs between AMD and Intel, but that isn't particularly relevant if we are saying AVX2 is "safe" to be the baseline/default

compile composite with avx2 on x64

052b34e

mangod9 added the area-crossgen2-coreclr label Jul 2, 2021

mangod9 requested review from richlander and davidwrighton July 2, 2021 03:25

mangod9 commented Jul 2, 2021

View reviewed changes

AntonLapounov reviewed Jul 2, 2021

View reviewed changes

src/installer/pkg/sfx/Microsoft.NETCore.App/Microsoft.NETCore.App.Runtime.Composite.sfxproj Outdated Show resolved Hide resolved

AntonLapounov approved these changes Jul 2, 2021

View reviewed changes

fix xml coding style

f0d3937

tannergooding reviewed Jul 2, 2021

View reviewed changes

mangod9 merged commit aeb467e into dotnet:main Jul 6, 2021

mangod9 deleted the useavx2 branch July 6, 2021 18:32

ghost locked as resolved and limited conversation to collaborators Aug 5, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

compile composite with avx2 on x64 #55057

compile composite with avx2 on x64 #55057

mangod9 commented Jul 2, 2021

mangod9 Jul 2, 2021

AntonLapounov Jul 2, 2021

mangod9 Jul 2, 2021

AntonLapounov left a comment

mangod9 commented Jul 2, 2021

AntonLapounov commented Jul 2, 2021

mangod9 commented Jul 2, 2021

tannergooding Jul 2, 2021 •

edited

Loading

mangod9 Jul 2, 2021

tannergooding Jul 2, 2021

mangod9 Jul 2, 2021

tannergooding Jul 2, 2021

compile composite with avx2 on x64 #55057

compile composite with avx2 on x64 #55057

Conversation

mangod9 commented Jul 2, 2021

mangod9 Jul 2, 2021

Choose a reason for hiding this comment

AntonLapounov Jul 2, 2021

Choose a reason for hiding this comment

mangod9 Jul 2, 2021

Choose a reason for hiding this comment

AntonLapounov left a comment

Choose a reason for hiding this comment

mangod9 commented Jul 2, 2021

AntonLapounov commented Jul 2, 2021

mangod9 commented Jul 2, 2021

tannergooding Jul 2, 2021 • edited Loading

Choose a reason for hiding this comment

mangod9 Jul 2, 2021

Choose a reason for hiding this comment

tannergooding Jul 2, 2021

Choose a reason for hiding this comment

mangod9 Jul 2, 2021

Choose a reason for hiding this comment

tannergooding Jul 2, 2021

Choose a reason for hiding this comment

tannergooding Jul 2, 2021 •

edited

Loading