Skip to content

Conversation

@huoyaoyuan
Copy link
Member

@huoyaoyuan huoyaoyuan commented Jun 9, 2025

As discussed in #112419, there should not be blocker for adopting C++ 17 in development environment. SuperPMI is I/O intensive and can be simplified significantly with std::filesystem.

It's non-starter to introduce C++ 17. Instead, using std::string can also simplify string manipulation a lot.

The end goal is to eliminate PAL simulation of Windows I/O functions. This PR acts as a first step to adopt STL in the dll shim entries.

@github-actions github-actions bot added the area-CodeGen-coreclr CLR JIT compiler in src/coreclr/src/jit and related components such as SuperPMI label Jun 9, 2025
@dotnet-policy-service dotnet-policy-service bot added the community-contribution Indicates that the PR has been added by a community member label Jun 9, 2025
Comment on lines 19 to 24
std::filesystem::path g_realJitPath{""}; // Destructable objects will be cleaned up and won't leak
std::filesystem::path g_logPath{""};
std::filesystem::path g_HomeDirectory{""};
std::filesystem::path g_DefaultRealJitPath{""};

void SetDefaultPaths()
std::unique_ptr<MethodCallSummarizer> g_globalContext = nullptr;
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

According to https://learn.microsoft.com/windows/win32/dlls/dllmain and other documentations I can find, constructors/destructors for global objects with be respected by dll/so unloading. I haven't test it though.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Global destructors are a bug farm. The problem is that they run while the other threads are stopped at random spot (on Windows) and while other threads are running (on non-Windows). Every time we happen to introduce one by mistake, we end up deleting it later to fix intermittent hangs and crashes.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

How does it compare to DllMain (DETACH)? The counter shim writes statistics on unload, which is simulated by PAL. Is there better approach?

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

DllMain (DETACH)

It has the same problems

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Does is mean that things won't be worse if we replace existing DllMain with global objects? Currently the DllMain of JIT is doing shutdown.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Currently the DllMain of JIT is doing shutdown.

If I am reading the code correctly, DllMain of JIT is only called on Windows. It is never called on non-Windows.

Does is mean that things won't be worse if we replace existing DllMain with global objects?

I would expect it to be fine for superpmi. I am not sure about regular JIT.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If I am reading the code correctly, DllMain of JIT is only called on Windows. It is never called on non-Windows.

JIT is loaded by CLRLoadLibrary in LoadAndInitializeJIT, which is implemented in PAL with LoadLibraryExW and does DllMain simulation. It's never unloaded though, but if it is, the DllMain will also be called by PAL.

@huoyaoyuan
Copy link
Member Author

It seems that our linux glibc x64/x86/arm64 build environment aren't ready for C++ 17 STL. Is it because we are targeting the runtime version of lowest supported OS?

@janvorli
Copy link
Member

janvorli commented Jun 9, 2025

We use Ubuntu 22.04 18.04 as cross build target for arm64. That one doesn't have support for c++17. So unless we decided to raise the minimum supported OSes in .NET, we are blocked.

@huoyaoyuan
Copy link
Member Author

Is it OK to use some polyfill for superPMI only? We don't need strict compatibility or reliability here.

If this is too flaky, I can remove C++ 17 usage and convert to C++ 11 STL instad.

@jkotas
Copy link
Member

jkotas commented Jun 10, 2025

I can remove C++ 17 usage and convert to C++ 11 STL instad.

What would you have to give up on by converting to C++ 11?

@huoyaoyuan
Copy link
Member Author

What would you have to give up on by converting to C++ 11?

Everything in std::filesystem, currently just path that automatically converts to platform-native encoding. In superpmi it should be fine to use ANSI variants on Windows.

@AaronRobinsonMSFT
Copy link
Member

AaronRobinsonMSFT commented Jul 7, 2025

What would you have to give up on by converting to C++ 11?

Everything in std::filesystem, currently just path that automatically converts to platform-native encoding. In superpmi it should be fine to use ANSI variants on Windows.

std::filesystem is a fine API, but we shouldn't really need it. It does help abstract away a lot, but it leveages C++ isms that people tend to balk at (for example, operator overloads). My preference would be to avoid std::filesystem in all cases and focus on collections and other language affordances that actually make the code simpler and explicit.

C++11 and C++14 are the biggest wins for the runtime repo from an API perspective.

@AaronRobinsonMSFT
Copy link
Member

@huoyaoyuan In fact, I called this API out specifically in the referenced issue #112419 (comment). We shouldn't be using this without a real benefit. I don't see that in this PR.

@JulieLeeMSFT JulieLeeMSFT added the needs-author-action An issue or pull request that requires more info or actions from the author. label Jul 7, 2025
@JulieLeeMSFT
Copy link
Member

My preference would be to avoid std::filesystem in all cases

@huoyaoyuan, Given Aaron's comment to avoid std::filesystem, there isn't much point in pursuing this PR without it. What do you think?

@huoyaoyuan
Copy link
Member Author

The major benefit of std::filesystem in superpmi is actually encoding handling of file names. Given its non-shipping tool, we can really avoid completed support.

I'll convert this PR to std::string and fstream based. They are still beneficial.

@dotnet-policy-service dotnet-policy-service bot removed the needs-author-action An issue or pull request that requires more info or actions from the author. label Jul 8, 2025
@huoyaoyuan huoyaoyuan changed the title Start to use C++ 17 STL in superpmi Start to use STL and stdio in superpmi Aug 12, 2025
@dotnet-policy-service dotnet-policy-service bot removed the needs-author-action An issue or pull request that requires more info or actions from the author. label Aug 12, 2025
if (jitLib == NULL)
{
LogError("LoadRealJitLib - LoadLibrary failed to load '%ws' (0x%08x)", jitLibPath, ::GetLastError());
LogError("LoadRealJitLib - LoadLibrary failed to load '%s' (%d)", jitLibPath.c_str(), ::GetLastError());
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
LogError("LoadRealJitLib - LoadLibrary failed to load '%s' (%d)", jitLibPath.c_str(), ::GetLastError());
LogError("LoadRealJitLib - LoadLibrary failed to load '%s' (0x%08x)", jitLibPath.c_str(), ::GetLastError());

Win32 error codes are best expressed as hex.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Win32 error codes are best expressed as hex.

Nit: Win32 error code != HRESULTs. I believe Win32 error codes are typically displayed in decimal. For example, our own Win32Exception prints them as decimal - Console.WriteLine(new Win32Exception(3 /* ERROR_PATH_NOT_FOUND */)); gives System.ComponentModel.Win32Exception (3): The system cannot find the path specified..

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This particular case was using hex format, while there are other places using decimal. I tend to keep it as-is for now.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nit: Win32 error code != HRESULTs.

Agree. That is why I stated Win32 error code :)

I believe Win32 error codes are typically displayed in decimal.

Sadly, yes. That has been the most common experience with user facing scenarios and I continue to believe it isn't a helpful display format. The most common Win32 error codes (for example, 3, 5 ,etc) are sub 10, which display the same. The higher ones are less common and often also used with HRESULT and LSTATUS. Consistently displaying them as hex avoids the common signed issue (that is, using %d instead of %u) and allows for easier pattern recognition in logs.

In a better world, our error system would have something as simple as dlerror() as opposed to the painfully baroque FormatMessage().

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

it isn't a helpful display format.

I believe that it is the common established display format for Windows error code. For example, compare these search results - the first returns good answer for me, the second one returns confusing garbage.

https://www.bing.com/search?q=windows+error+code+3

https://www.bing.com/search?q=windows+error+code+0x00000003

The higher ones are less common and often also used with HRESULT and LSTATUS.

Right, these are different enums that are not interchangeable. I do not have a problem with using hex for HRESULT.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

There are ~100 places that print GetLastError in superpmi. ~85% print it as decimal.

@AaronRobinsonMSFT
Copy link
Member

/cc @dotnet/jit-contrib PTAL. I'm signed off.

@JulieLeeMSFT
Copy link
Member

@EgorBo, PTAL.

@JulieLeeMSFT JulieLeeMSFT requested a review from EgorBo September 8, 2025 16:26
Copy link
Member

@JulieLeeMSFT JulieLeeMSFT left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

area-CodeGen-coreclr CLR JIT compiler in src/coreclr/src/jit and related components such as SuperPMI community-contribution Indicates that the PR has been added by a community member

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants