Fix "Cleanup some string operating functions" PR #100451

AaronRobinsonMSFT · 2024-03-29T20:20:49Z

Unrevert #96099

The revert of #96099 was done in #97264. Commits after the first fix troublesome locations.

See #97264 for details on what the original PR impacted.

/cc @tommcdon @hoyosjs @huoyaoyuan

This reverts commit b8114f9.

dotnet-policy-service · 2024-03-29T20:21:16Z

Tagging subscribers to this area: @mangod9
See info in area-owners.md if you want to be subscribed.

src/coreclr/inc/corhlprpriv.h

Use correct error code mechanism.

src/coreclr/vm/methodtablebuilder.cpp

src/coreclr/vm/classcompat.cpp

jkotas · 2024-03-30T22:52:53Z

src/coreclr/utilcode/sstring.cpp

-    if (SUCCEEDED(hr))
-    {
-        s.Resize(length, REPRESENTATION_UTF8);
+    COUNT_T length = WszWideCharToMultiByte(CP_UTF8, 0, GetRawUnicode(), GetRawCount()+1, NULL, 0, NULL, NULL);


I assume that this used FString::Unicode_Utf8_Length to avoid some perf problems in WszWideCharToMultiByte. Is switching to WszWideCharToMultiByte going to introduce performance regression? (I am particularly worried about Windows.)

I will check here.

Regardless, in principle I would prefer to defer to system APIs that we can/should assume are optimized. Having these narrowly defined "optimizations" is a non-trivial tax for an already covoluted space like SString.

I agree that we should get rid of FString::*. We may want to replace it with a call to the minipal UTF8 methods for perf reasons like what the original change tried to do. I would expect that minipal UTF8 convertors are going to have significantly better perf than Windows OS WideCharToMultiByte.

system APIs that we can/should assume are optimized

Historically, Windows OS WideCharToMultiByte has been significantly slower for UTF8 than one would expect from a reasonably optimized implementation.

Would static linking simdutf to the VM for such conversions make sense here?

We may want to replace it with a call to the minipal UTF8 methods for perf reasons like what the original change tried to do.

So that I can get behind. There is a small performance regression here. Let me replace this with the minipal APIs. I chose this direction to match the other API. I'd prefer symmetrical API usage in this case as it avoids confusion.

Would static linking simdutf to the VM for such conversions make sense here?

That is something we can discuss, but it not for this PR.

we can push the hotpaths to managed

That seems unlikely in this case. The use of SString is litered throughout the system and there are many places where it is going to be unnatural and difficult to call into managed code. We can try jumping out in some cases, but that isn't something I am inclined to do in this PR.

My bar for these types of changes is to avoid perf regressions. (Of course, perf improvements are nice - but it is better to evaluate them separately if they come with tradeoffs.)

Neither the existing FString fast path code nor the WideCharToMultiByte implementation built into Windows use any vectorization tricks. It suggests that we should not need vectorization tricks to avoid the perf regression here.

The shape of the code in WideCharToMultiByte built into Windows is actually very similar to the corefx implementation and the minipal. It seems that the minipal implementation picked up some cruft alone the way that makes it quite a bit slower.

It is interesting that we picked up minipal implementation for Windows + Unix in mono (and Unix in coreclr as before) in .NET 8 and haven't seen any report of regression. Maybe there is some low-hanging alignment issue which can be tweaked via cl switch?

picked up minipal implementation for Windows + Unix in mono

Mono does not get as much perf scrutiny as CoreCLR.

low-hanging alignment issue

I do not think it is that.

From a cursory look, I see two problems:

For smaller lengths, there is a fixed overhead. Adding a simple FString-like loop that deals with small ASCII-only strings as the first thing in the minipal_* methods should fix that.

For longer lengths, the code of the core look does not look great (e.g. it uses more registers than necessary). For example, it may help to change

runtime/src/native/minipal/utf8.c

Lines 1613 to 1618 in 7b18be5

*pTarget = (unsigned char)ch;

*(pTarget + 1) = (unsigned char)(ch >> 16);

pSrc += 4;

*(pTarget + 2) = (unsigned char)chc;

*(pTarget + 3) = (unsigned char)(chc >> 16);

pTarget += 4;

to

*pTarget = (unsigned char)ch; *(pTarget + 2) = (unsigned char)(chc); pSrc += 4; *(pTarget + 1) = (unsigned char)(ch >> 16); *(pTarget + 3) = (unsigned char)(chc >> 16); pTarget += 4;

dotnet-policy-service · 2024-06-02T07:17:01Z

Draft Pull Request was automatically closed for 30 days of inactivity. Please let us know if you'd like to reopen it.

AaronRobinsonMSFT added 2 commits March 29, 2024 13:10

Revert "Revert pr#96099 (dotnet#97264)"

6c640f5

This reverts commit b8114f9.

Fix up API usage issues

0c81a7f

AaronRobinsonMSFT added the area-VM-coreclr label Mar 29, 2024

AaronRobinsonMSFT added this to the 9.0.0 milestone Mar 29, 2024

dotnet-policy-service bot assigned AaronRobinsonMSFT Mar 29, 2024

AaronRobinsonMSFT requested review from tommcdon and jkotas March 30, 2024 15:21

AaronRobinsonMSFT marked this pull request as ready for review March 30, 2024 15:21

jkotas reviewed Mar 30, 2024

View reviewed changes

src/coreclr/inc/corhlprpriv.h Outdated Show resolved Hide resolved

AaronRobinsonMSFT commented Mar 30, 2024

View reviewed changes

src/coreclr/inc/corhlprpriv.h Outdated Show resolved Hide resolved

AaronRobinsonMSFT commented Mar 30, 2024

View reviewed changes

src/coreclr/inc/corhlprpriv.h Outdated Show resolved Hide resolved

Apply suggestions from code review

5412a53

Use correct error code mechanism.

jkotas reviewed Mar 30, 2024

View reviewed changes

AaronRobinsonMSFT marked this pull request as draft May 3, 2024 04:10

am11 mentioned this pull request May 21, 2024

Simplify utf8.c #102424

Closed

dotnet-policy-service bot removed this from the 9.0.0 milestone Jun 2, 2024

dotnet-policy-service bot closed this Jun 2, 2024

github-actions bot locked and limited conversation to collaborators Jul 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix "Cleanup some string operating functions" PR #100451

Fix "Cleanup some string operating functions" PR #100451

AaronRobinsonMSFT commented Mar 29, 2024 •

edited

Loading

dotnet-policy-service bot commented Mar 29, 2024

jkotas Mar 30, 2024

AaronRobinsonMSFT Apr 1, 2024

jkotas Apr 1, 2024

MichalPetryka Apr 1, 2024

AaronRobinsonMSFT Apr 1, 2024

AaronRobinsonMSFT Apr 12, 2024

jkotas Apr 12, 2024

jkotas Apr 12, 2024

am11 Apr 12, 2024

jkotas Apr 12, 2024 •

edited

Loading

dotnet-policy-service bot commented Jun 2, 2024

	*pTarget = (unsigned char)ch;
	*(pTarget + 1) = (unsigned char)(ch >> 16);
	pSrc += 4;
	*(pTarget + 2) = (unsigned char)chc;
	*(pTarget + 3) = (unsigned char)(chc >> 16);
	pTarget += 4;

Fix "Cleanup some string operating functions" PR #100451

Fix "Cleanup some string operating functions" PR #100451

Conversation

AaronRobinsonMSFT commented Mar 29, 2024 • edited Loading

dotnet-policy-service bot commented Mar 29, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jkotas Apr 12, 2024 • edited Loading

Choose a reason for hiding this comment

dotnet-policy-service bot commented Jun 2, 2024

AaronRobinsonMSFT commented Mar 29, 2024 •

edited

Loading

jkotas Apr 12, 2024 •

edited

Loading