HTTP/2 has poor throughput with content larger than initial receive window size #43086

brporter · 2020-10-06T13:45:33Z

Description

SocketsHttpHandler request performance is roughly 5X slower when using HTTP/2.0 than when using HTTP/1.1 to the same destination. There are no meaningful variations in flow control characteristics at the transport (TCP) protocol layers. It appears to be an issue in the HTTP/2.0 implementation itself.

It does not appear to be a server-side issue, as the equivalent HTTP/2.0 request issued using WinHttpHandler does not exhibit the performance reduction against the same HTTP/2.0 server.

Minimal repro:

using System;
using System.Threading.Tasks;
using System.Diagnostics;
using System.Net.Http;

namespace sockinv
{
    class Program
    {
        const uint BYTE_LENGTH = 26_214_400; // 25MB

        static void Main(string[] args)
        {
            var timer = new Stopwatch();

            timer.Start();
            using (var handler = new SocketsHttpHandler())
            {
                var result = TestHandler(handler, new Version(1, 1));
                result.Wait();
                timer.Stop();

                Console.WriteLine($"SocketsHttpHandler (Success: {result.Result}) HTTP/1.1 in {timer.ElapsedMilliseconds}ms ({BYTE_LENGTH / timer.ElapsedMilliseconds / 1000:N3} MB/s)");
            }

            timer.Restart();
            using (var handler = new SocketsHttpHandler())
            {
                var result = TestHandler(handler, new Version(2,0));
                result.Wait();
                timer.Stop();

                Console.WriteLine($"SocketsHttpHandler (Success: {result.Result}) HTTP/2.0 in {timer.ElapsedMilliseconds}ms ({BYTE_LENGTH / timer.ElapsedMilliseconds / 1000:N3} MB/s)");
            }

            timer.Restart();
            using (var handler = new WinHttpHandler())
            {
                var result = TestHandler(handler, new Version(1, 1));
                result.Wait();
                timer.Stop();

                Console.WriteLine($"WinHttpHandler (Success: {result.Result}) HTTP/1.1 in {timer.ElapsedMilliseconds}ms ({BYTE_LENGTH / timer.ElapsedMilliseconds / 1000:N3} MB/s)");
            }

            timer.Restart();
            using (var handler = new WinHttpHandler())
            {
                var result = TestHandler(handler, new Version(2, 0));
                result.Wait();
                timer.Stop();

                Console.WriteLine($"WinHttpHandler (Success: {result.Result}) HTTP/2.0 in {timer.ElapsedMilliseconds}ms ({BYTE_LENGTH / timer.ElapsedMilliseconds / 1000:N3} MB/s)");
            }
        }

        static HttpRequestMessage GenerateRequestMessage(Version httpVersion, uint bytes)
        {
            // Replace the URL below with the URL of server that can generate an arbitrary number of bytes
            return new HttpRequestMessage(HttpMethod.Get, $"<YOUR_URL_HERE>&length={bytes}")
            {
                Version = httpVersion
            };
        }

        static async Task<bool> TestHandler(HttpMessageHandler handler, Version httpVersion)
        {
            using (var client = new HttpClient(handler, false))
            {
                var message = GenerateRequestMessage(httpVersion, BYTE_LENGTH);
                var response = await client.SendAsync(message);

                return response.IsSuccessStatusCode;
            }
        }
    }
}

Configuration

Windows 20H2. .NET Core 3.1 & .NET 5.0 were both tested with the same results.

Regression?

Not certain, I didn't test back before .NET Core 3.1.

Data

Example run of the above repro to my test server at 4ms RTT:

SocketsHttpHandler (Success: True) HTTP/1.1 in 622ms (42.000 MB/s)
SocketsHttpHandler (Success: True) HTTP/2.0 in 2220ms (11.000 MB/s)
WinHttpHandler (Success: True) HTTP/1.1 in 465ms (56.000 MB/s)
WinHttpHandler (Success: True) HTTP/2.0 in 489ms (53.000 MB/s)

Analysis

I took a packet capture while exercising the above repro and noted a number of things. First and foremost, SocketsHttpHandler (in the HTTP/1.1 case) and WinHttpHandler (in both the HTTP/1.1 and HTTP/2 cases) appear to be exercising the congestive limit of the network I am on. Additionally, my server is capable of extracting TCP ESTAT data for each request, and the results are unambiguous (and confirmed via packet capture) - congestive loss conspired to reduce the congestion window and cause slower throughput overall.

There does appear to be an issue in read rate in the SocketsHttpHandler HTTP/1.1 case that causes the receive window to grow more slowly (accounting for the very repeatable pullback of ~10MB/s at 4ms RTT between SocketsHttpHandler and WinHttpHandler). That's not the issue in this bug, though.

The real issue is the enormous pullback for HTTP/2 in SocketsHttpHandler. That transfer rate is not explained by any activity at the TCP layer. Receive window space is ample, no congestive loss is observed (and in fact at the server there is ample congestion window space available). TCP is mostly not sending data because it has not been provided data to send. This would indicate an issue at the server, except alternative HTTP/2 client implementations do not exhibit this behavior to this same server (e.g. WinHttpHandler, curl, etc.). Note that in the above dataset, WinHttpHandler using HTTP/2 can hit 53MB/s (again, the congestive limit of this network confirmed with server-side TCP ESTATS and a packet capture).

Interestingly, a packet capture at the client shows that the segments follow a transmission pattern where a small number of bytes are sent punctuated by an RTT-based delay. This indicates a buffering issue.

Given the lack of TCP receive window flow-control impact and the lack of congestive loss in the slow SocketsHttpHandler HTTP/2 case, I suspect an issue in the implementation of HTTP/2 flow control in SocketsHttpHandler is causing the server to starve the TCP connection for bytes.

ghost · 2020-10-06T13:45:47Z

Tagging subscribers to this area: @dotnet/ncl
See info in area-owners.md if you want to be subscribed.

scalablecory · 2020-10-06T19:32:07Z

This is almost certainly due to our using a fixed size receive window rather than growing it based on utilization/latency. I verified our bandwidth utilization for HTTP/2 is affected -- the result is that if we're receiving something larger than the per-stream receive window, the bandwidth is negatively affected as latency grows.

Kestrel was affected by this too, when I last checked, though they used a larger buffer size and so they would be a little less affected. Regardless, we should try to share this window management code as part of this fix.

brporter · 2020-10-06T20:50:20Z

@scalablecory I ran my repro with tracing enabled for the Microsoft-System-Net-Http trace provider, and I see a ton of window increments being generated for very small sizes, typically the exact size of the previously received frame. Example:

[38:706] [ProcessIncomingFramesAsync]: Frame 48: StreamId=1; Type=Data; Flags=None; Length=8192.
[38:706] [ExtendWindow]: amount=8192
[38:706] [ExtendWindow]: _pendingWindowUpdate 289008 < 8388608.
[38:706] [ReadFrameAsync]: initialFrame=False
[38:706] [EnsureIncomingBytesAsync]: minReadBytes=9
[38:706] [StartWriteAsync]: writeBytes=13
[38:706] [SendWindowUpdateAsync]: Started writing. amount=8192
[38:706] [FinishWrite]: flush=Now

Examining an HTTP/2 transaction with (fumbles about for any other implementation I can lay hands on, grabs Firefox) and I noticed two things:

Firefox updates the window significantly less frequently than SocketsHttpHandler.
Those updates are of sizes that seem to equal large coalesced reads (e.g. Firefox increments the window by megabytes at a time)

I can share the full log + decrypted Firefox capture should you be so interested.

geoffkizer · 2020-10-06T21:06:50Z

There's code to defer window updates until they meet a minimum threshold. The threshold is 1/8th of window size. For the connection window, the window size is large -- 64MB. For the stream window, it's 64K, which explains why you see updates each time when you receive 8K.

So it's really the same problem here -- stream receive window is too small.

See #1587.

We either need to
(a) dynamically size the stream receive window
(b) or alternatively, let you configure it yourself, i.e. make it your problem instead of ours.

If Kestrel already has an implementation for (a) then we should just steal it.

scalablecory · 2020-10-10T16:07:12Z

One might look at compatibly-licensed MsQuic to find a good way to dynamically size the receive window.

halter73 · 2020-10-12T21:00:02Z

If Kestrel already has an implementation for (a) then we should just steal it.

Kestrel connection and streams window sizes are not dynamic, but they are configurable. The default sizes are 128KB for the connection window and 96KB for the stream window.

Kestrel does send less window updates because Kestrel's threshold is half of the window size instead of 1/8th of the window size for both the connection and stream. This hasn't been tuned. I do worry that sending window updates too infrequently causes stalls in high-latency scenarios. We should work together to improve this in Kestrel as well.

kamronbatman · 2020-12-04T19:30:34Z

Any updates on this? This affects quite a bit of run-of-the-mill application logic that reaches out to 3rd party APIs. I haven't tested it with large downloads (which is another use-case I have), but I assume that is negatively affected considerably.

scalablecory · 2020-12-04T23:58:04Z

@kamronbatman the issue being discussed in this PR will only affect downloads >64KB. If you've got a perf case with content smaller than that, we'd be very interested -- please file a separate issue so we can keep each one focused on a single problem.

kamronbatman · 2020-12-05T16:15:51Z

I have downloads ranging from 135 bytes to 600MB. So in general I am interested to know the status since it definitely affects my use cases one way or another. :)

DenSmoke · 2021-03-11T21:55:08Z

Is there any workaround until .NET 6 release? We are facing this problem since .NET Core 3.

EatonZ · 2021-03-14T03:16:53Z

@DenSmoke My solution is basically this:

If you know the response body will be larger than 64 KB, then use HTTP 1.1. You can do so like this:
var request = new HttpRequestMessage(RequestMethod, URI) { Version = HttpVersion.Version11, VersionPolicy = ForceHTTP11 ? HttpVersionPolicy.RequestVersionExact : HttpVersionPolicy.RequestVersionOrHigher };

It looks like the .NET team will improve things for .NET 6, but until then, this seems to be the best workaround.

JamesNK · 2021-03-18T05:16:14Z

This impacts gRPC clients that send/receive large messages. Unfortunatly switching to HTTP/1.1 isn't an option with gRPC.

erikmav · 2021-03-19T20:45:52Z

+1 to @JamesNK's comment, a switch from the Google gRPC client to the .NET client in a system that does ~100MB downloads results in a download speed of 4.5MB/sec over WAN, as opposed to link saturation (1Gbps, 125MB/sec max theoretical) like the Google client. If I park the .NET client on the same datacenter backbone as the server it goes back to link saturation.

SartorialOffense · 2021-03-19T22:10:58Z

@Erikma I see very similar results.

karelz · 2021-03-23T09:06:02Z

We have plans to address the issue in .NET 6 -- we have proposal for a proper fix and we also started working last week on initial window update size settings to enable advanced custom configuration (see #49897).
Sadly, the proper fix seems to be rather larger feature. It may not be eligible for backporting to servicing branches of .NET 5.

kamronbatman · 2021-03-23T19:49:20Z

That's pretty unfortunate especially if there will be large changes needed to go from .NET 5 to .NET 6. Damn.

karelz · 2021-03-23T21:17:43Z

Why would you expect large changes to go from 5 to 6?

kamronbatman · 2021-03-23T21:24:42Z

I have no expectations that there will be, and I am also hoping there won't. :)

Fixes #43086 by introducing automatic scaling of the HTTP/2 stream receive window based on measuring RTT of PING frames.

erikmav · 2021-07-15T18:54:43Z

@antonfirsov @karelz Did this make it into net6-preview6 or do we need to wait for preview7 to retest?

karelz · 2021-07-15T20:07:04Z

It is part of Preview 7, you can check daily builds, or preview branch early builds to check it out.
We would welcome feedback early in case we have to adjust things based on the feedback.
Thanks!

brporter added the tenet-performance Performance related issue label Oct 6, 2020

Dotnet-GitSync-Bot added area-System.Net.Http untriaged New issue has not been triaged by the area owner labels Oct 6, 2020

karelz added this to the 6.0.0 milestone Oct 6, 2020

karelz added bug and removed untriaged New issue has not been triaged by the area owner labels Oct 6, 2020

CarnaViire self-assigned this Oct 26, 2020

scalablecory changed the title ~~SocketsHttpHandler HTTP/2.0 Performance vs. HTTP/1.1 Performance~~ HTTP/2 Receive window has poor throughput with content large than initial window size Dec 4, 2020

scalablecory changed the title ~~HTTP/2 Receive window has poor throughput with content large than initial window size~~ HTTP/2 has poor throughput with content larger than initial receive window size Dec 4, 2020

scalablecory added enhancement Product code improvement that does NOT require public API changes/additions and removed bug labels Dec 4, 2020

karelz mentioned this issue Jan 12, 2021

Developers get high-performant HTTP stack #46850

Closed

2 tasks

ManickaP mentioned this issue Jan 18, 2021

HTTP/2 - Large download speed is dramatically slower than HTTP/1.1 #47092

Closed

This was referenced Feb 25, 2021

Http/2 Post With Large Message Size Hangs Until Timeout #48736

Closed

Http/2 Significantly Slower Than Http/1 With Large Content and High Latency #48739

Closed

davidfowl mentioned this issue Mar 15, 2021

Low throughput with large messages and high latency (25ms RTT) grpc/grpc-dotnet#1238

Closed

CarnaViire mentioned this issue Mar 19, 2021

Configure HTTP/2 receive window size #49897

Closed

JamesNK mentioned this issue Mar 30, 2021

Tune Window Update batching algorithm dotnet/aspnetcore#4734

Open

antonfirsov self-assigned this Apr 15, 2021

karelz unassigned CarnaViire May 6, 2021

antonfirsov mentioned this issue May 17, 2021

HTTP2 Dynamic Window (prototype/early draft) #52862

Closed

antonfirsov mentioned this issue May 27, 2021

Minimalistic HTTP2 flow control API #53372

Closed

antonfirsov mentioned this issue Jun 25, 2021

Implement dynamic HTTP/2 window scaling #54755

Merged

ghost added the in-pr There is an active PR which will close this issue when it is merged label Jun 25, 2021

antonfirsov closed this as completed in #54755 Jul 8, 2021

antonfirsov added a commit that referenced this issue Jul 8, 2021

Implement dynamic HTTP/2 window scaling (#54755)

c7ffa32

Fixes #43086 by introducing automatic scaling of the HTTP/2 stream receive window based on measuring RTT of PING frames.

ghost removed the in-pr There is an active PR which will close this issue when it is merged label Jul 8, 2021

ghost locked as resolved and limited conversation to collaborators Aug 14, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HTTP/2 has poor throughput with content larger than initial receive window size #43086

HTTP/2 has poor throughput with content larger than initial receive window size #43086

brporter commented Oct 6, 2020 •

edited by karelz

Loading

ghost commented Oct 6, 2020

scalablecory commented Oct 6, 2020 •

edited

Loading

brporter commented Oct 6, 2020

geoffkizer commented Oct 6, 2020 •

edited

Loading

scalablecory commented Oct 10, 2020

halter73 commented Oct 12, 2020

kamronbatman commented Dec 4, 2020

scalablecory commented Dec 4, 2020

kamronbatman commented Dec 5, 2020 •

edited

Loading

DenSmoke commented Mar 11, 2021

EatonZ commented Mar 14, 2021

JamesNK commented Mar 18, 2021 •

edited

Loading

erikmav commented Mar 19, 2021

SartorialOffense commented Mar 19, 2021

karelz commented Mar 23, 2021

kamronbatman commented Mar 23, 2021

karelz commented Mar 23, 2021

kamronbatman commented Mar 23, 2021

erikmav commented Jul 15, 2021

karelz commented Jul 15, 2021

HTTP/2 has poor throughput with content larger than initial receive window size #43086

HTTP/2 has poor throughput with content larger than initial receive window size #43086

Comments

brporter commented Oct 6, 2020 • edited by karelz Loading

Description

Configuration

Regression?

Data

Analysis

ghost commented Oct 6, 2020

scalablecory commented Oct 6, 2020 • edited Loading

brporter commented Oct 6, 2020

geoffkizer commented Oct 6, 2020 • edited Loading

scalablecory commented Oct 10, 2020

halter73 commented Oct 12, 2020

kamronbatman commented Dec 4, 2020

scalablecory commented Dec 4, 2020

kamronbatman commented Dec 5, 2020 • edited Loading

DenSmoke commented Mar 11, 2021

EatonZ commented Mar 14, 2021

JamesNK commented Mar 18, 2021 • edited Loading

erikmav commented Mar 19, 2021

SartorialOffense commented Mar 19, 2021

karelz commented Mar 23, 2021

kamronbatman commented Mar 23, 2021

karelz commented Mar 23, 2021

kamronbatman commented Mar 23, 2021

erikmav commented Jul 15, 2021

karelz commented Jul 15, 2021

brporter commented Oct 6, 2020 •

edited by karelz

Loading

scalablecory commented Oct 6, 2020 •

edited

Loading

geoffkizer commented Oct 6, 2020 •

edited

Loading

kamronbatman commented Dec 5, 2020 •

edited

Loading

JamesNK commented Mar 18, 2021 •

edited

Loading