Fix return address hijacking with CET #109074

janvorli · 2024-10-21T13:21:02Z

There is a problematic case when return address is hijacked while in a managed method that tail calls a GC write barrier and when CET is enabled. The write barrier code can change while the handler for the hijacked address is executed from the vectored exception handler. When the vectored exception handler then returns to the write barrier to re-execute the ret instruction that has triggered the vectored exception handler due to the main stack containing a different address than the shadow stack (now with the main stack fixed), the instruction may no longer be ret due to the change of the write barrier change.

This change fixes it by setting the context to return to from the vectored exception handler to point to the caller and setting the Rsp and SSP to match that. That way, the write barrier code no longer matters.

There is a problematic case when return address is hijacked while in a managed method that tail calls a GC write barrier and when CET is enabled. The write barrier code can change while the handler for the hijacked address is executed from the vectored exception handler. When the vectored exception handler then returns to the write barrier to re-execute the `ret` instruction that has triggered the vectored exception handler due to the main stack containing a different address than the shadow stack (now with the main stack fixed), the instruction may no longer be `ret` due to the change of the write barrier change. This change fixes it by setting the context to return to from the vectored exception handler to point to the caller and setting the Rsp and SSP to match that. That way, the write barrier code no longer matters.

jkotas · 2024-10-21T14:05:01Z

Should the same change be made in naot to keep the two implementations in sync?

runtime/src/coreclr/nativeaot/Runtime/EHHelpers.cpp

Lines 551 to 556 in 061941a

    
           if (areShadowStacksEnabled) 
        
           { 
        
               // Undo the "pop", so that the ret could now succeed. 
        
               interruptedContext->SetSp(interruptedContext->GetSp() - 8); 
        
               interruptedContext->SetIp(origIp); 
        
           }

VSadov · 2024-10-21T14:57:36Z

NativeAOT does not require the fix, but it would be better if implementations match.

LGTM, otherwise. Thanks!!

janvorli · 2024-10-22T11:36:25Z

I've just added commit with the nativeaot change. But I would like to know your opinion on where should the newly added GetSSP and SetSSP be.and whether they should be made a PAL API or not. Their implementation needs the windows headers to be included or the new types / API it uses redefined.

VSadov · 2024-10-22T17:17:57Z

I would like to know your opinion on where should the newly added GetSSP and SetSSP be.and whether they should be made a PAL API or not.

That seems fine to me. That is where I'd put the helpers as well.

janvorli · 2024-11-05T14:33:14Z

/backport to release/9.0-staging

github-actions · 2024-11-05T14:33:28Z

Started backporting to release/9.0-staging: https://github.com/dotnet/runtime/actions/runs/11686309680

janvorli added the area-VM-coreclr label Oct 21, 2024

janvorli added this to the 9.0.0 milestone Oct 21, 2024

janvorli requested a review from VSadov October 21, 2024 13:21

janvorli self-assigned this Oct 21, 2024

VSadov approved these changes Oct 21, 2024

View reviewed changes

Add equivalent change to nativeaot

c26547c

janvorli requested a review from MichalStrehovsky as a code owner October 22, 2024 11:32

Add missing ifdef

5fe9c89

VSadov approved these changes Oct 22, 2024

View reviewed changes

This was referenced Oct 22, 2024

slow macOS - "##[error]The job running on agent Azure Pipelines 9 ran longer than the maximum time of 60 minutes." dotnet/dnceng#1883

Open

The Operation will be canceled. The next steps may not contain expected logs. dotnet/dnceng#3008

Open

janvorli merged commit 4794a57 into dotnet:main Oct 22, 2024
90 checks passed

VSadov mentioned this pull request Oct 23, 2024

SortedDictionary<BigInteger, MyObject> System.AccessViolationException #108763

Closed

github-actions bot mentioned this pull request Nov 5, 2024

[release/9.0-staging] Fix return address hijacking with CET #109548

Merged

4 tasks

stephentoub mentioned this pull request Dec 6, 2024

[Win11] [Bug] [ConcurrentBag regression in .Net 9] System.AccessViolationException: 'Attempted to read or write protected memory. This is often an indication that other memory is corrupt.' #110355

Closed

github-actions bot locked and limited conversation to collaborators Dec 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix return address hijacking with CET #109074

Fix return address hijacking with CET #109074

Uh oh!

janvorli commented Oct 21, 2024

Uh oh!

jkotas commented Oct 21, 2024

Uh oh!

VSadov commented Oct 21, 2024

Uh oh!

janvorli commented Oct 22, 2024

Uh oh!

VSadov commented Oct 22, 2024

Uh oh!

Uh oh!

janvorli commented Nov 5, 2024

Uh oh!

github-actions bot commented Nov 5, 2024

Uh oh!

Uh oh!

Fix return address hijacking with CET #109074

Fix return address hijacking with CET #109074

Uh oh!

Conversation

janvorli commented Oct 21, 2024

Uh oh!

jkotas commented Oct 21, 2024

Uh oh!

VSadov commented Oct 21, 2024

Uh oh!

janvorli commented Oct 22, 2024

Uh oh!

VSadov commented Oct 22, 2024

Uh oh!

Uh oh!

janvorli commented Nov 5, 2024

Uh oh!

github-actions bot commented Nov 5, 2024

Uh oh!

Uh oh!