Watchdog in AsyncTCP when sending very long chunked response #165

Levak · 2024-12-02T22:48:50Z

Levak
Dec 2, 2024

Hi there!

Description

When the ESPAsyncWebServer sends a very long chunked response (over 30 seconds), the ESP32 crashes on a watchdog timeout in AsyncTCP. I can see in Chrome DevTools that the chunked answer is being sent correctly, one chunk at a time, but when the 30 seconds mark hits, the ESP32 resets.

In my code, I am trying to traverse a list of files from an SD card. After roughly 300 files (700 in this particular test) sent, aka 30 seconds, the problem appears.
Pagination can be implemented, but we are trying to stay retro-compatible with an official WebUI that lacks such feature. This is sadly a regression compared to a basic WebServer app, where the timeout can be as long as one wants, as long as the client doesn't disconnect.

Link: https://github.com/Levak/sdwifi/blob/async/sdwifi.ino#L714

Board: esp32-pico-d4 (Fysetc SD WIFI PRO)

Stack trace

E (809882) task_wdt: Task watchdog got triggered. The following tasks/users did not reset the watchdog in time:
E (809882) task_wdt: - async_tcp (CPU 0/1)
E (809882) task_wdt: Tasks currently running:
E (809882) task_wdt: CPU 0: IDLE0
E (809882) task_wdt: CPU 1: loopTask

Additional notes

Old discussion about this code

mathieucarbou · 2024-12-02T23:06:34Z

mathieucarbou
Dec 2, 2024
Maintainer

That's not a bug, that is exactly what is expected.

async_tcp is the async com callback which executed in the context of tcp network callbacks. Code executing within these callbacks should be fast
the TWDT is a safety measure telling you that you are stalling all the cpu and core tasks like BLE, WiFi, etc cannot safely execute anymore.

You need to design you code to work another way, or use SSE or websockets.

9 replies

mathieucarbou Dec 9, 2024
Maintainer

EDIT: I just tried the 160k dummy request, I see now that it's way too fast, so it does spit the 160k in no time. I added while (millis() - start < 1500); to match my usecase where it's slow to retrieve data. I can reproduce the timeout.

Like I said earlier, and also on another issue, you must not stall the async_tcp response callback so much. The callback at this point is is the context of a tcp com, called to fill the pcb client space and send the buffer. If you write blocking code there, you will stall the ESP execution of remaining tasks and the watchdog triggers hopefully to tell you that.

You need to re-design the way you communicate the data.

The problem is that a SD card is really slow.

You could do that through websocket, or SSE, this would be more efficient and won't stall. The idea is to receive the list request from the UI, display a spinning icon, and trigger a task somewhere in the ESP (or a flag read from the loop) to start listing and each time you have a batch of files to send, you send it.

Decouple the heavy processing of the request from the request itself by using async responses (ws or sse).

Levak Dec 9, 2024
Author

Mathieu,

I totally am on board with you regarding list of files, but am still clueless for downloading files. See, if the download takes more than 30sec, because the SD is slow, the esp crashes on the watchdog. That's what a slowed down 160k dummy chunk handler looks like.

I find it weird that the whole process hangs when the watchdog is disabled, the chunk callback isn't called anymore, after 30 seconds, even when the client disconnects. While before those 30sec, I can handle any other requests just fine, simply because each chunk takes about 1.5ms and yields.

Am gonna repeat myself, but I expect the async library to do that for me. After reading its code yesterday, I found out that there is no busy loop under the hood, each chunk is treated as one small bit of the request, disables the watchdog and moves on, as far as I could tell. To me, something else is cutting the wire.

I don't have full understanding yet as to where things operate between asynctcp and espasync. The way I was testing was with the 160k dummy chunked handler and commented out the code that enables the wdt in asynctcp.

I really hope it's something that can be addressed, and I hope you understand my POV.

mathieucarbou Dec 9, 2024
Maintainer

Yes, I do understand, but do not have any solution either for now.

What do you mean by that ?

the chunk callback isn't called anymore, after 30 seconds, even when the client disconnects.

To me this is normal if the client disconnects to stop the processing ?

mathieucarbou Dec 9, 2024
Maintainer

Also did you try to put the ESP in verbose mode to see if you have some logs from espasyncWs or async tcp ?

mathieucarbou Dec 9, 2024
Maintainer

I pushed an example in main:

  // time curl -N -v -X GET http://192.168.4.1/slow.html --output -
  server.on("/slow.html", HTTP_GET, [](AsyncWebServerRequest* request) {
    request->client()->setRxTimeout(2000);
    AsyncWebServerResponse* response = request->beginChunkedResponse("text/html", [](uint8_t* buffer, size_t maxLen, size_t index) -> size_t {
      Serial.printf("%u\n", index);
      // finished ?
      if (index >= 160000)
        return 0;

      // slow down the task by 2 seconds
      // to simulate some heavy processing, like SD card reading
      delay(100);

      memset(buffer, characters[charactersIndex], 256);
      charactersIndex = (charactersIndex + 1) % sizeof(characters);
      return 256;
    });

    request->send(response);
  });

called with:

time curl -N -v -X GET http://192.168.4.1/slow.html --output -

The request works fine and lasts 1 min:

real	1m3.814s
user	0m0.027s
sys	0m0.122s

Are you sure this is not the browser that is timing out your request ?

Levak · 2024-12-09T09:02:53Z

Levak
Dec 9, 2024
Author

What do you mean by that ?

the chunk callback isn't called anymore, after 30 seconds, even when the client disconnects.

I meant that, after 30 seconds, the ESP stalls indefinitely, even if the client (chrome or curl) disconnects afterward. It does not accept new requests.

Also did you try to put the ESP in verbose mode to see if you have some logs from espasyncWs or async tcp ?

I did, nothing new is printed at the point of the stall.

I pushed an example in main:
called with:
time curl -N -v -X GET http://192.168.4.1/slow.html --output -

Yeah! That's exactly what I'm doing.
I'm a bit confused that it works on your end. I hope it's not something wrong/different in my libraries.
Just FYI (because I cannot test rn, maybe tonight), I am using Arduino IDE with esp32 3.1.0-RC1 lib.

Are you sure this is not the browser that is timing out your request ?

I am testing with curl without any option.

Thank you for trying on your end!

14 replies

Levak Dec 10, 2024
Author

If I put #define CONFIG_ASYNC_TCP_QUEUE_SIZE 128 in AsyncTCP.h then the 160k reaches 120k, and if I set it to 256, I reach the 160k.

I added a dummy counter when pushing/popping elements from _async_queue, and I see that it's rising at a rate of 2/sec reaching 133 queued elements at the last chunk.

If I increase the delay from 100 to 1500ms (closer to my usecase with a slow SD), it still rises at 3.2/1.5sec ~ 2/sec, except it's 10 times slower, hence never reaches the end. Crashes when index is 20k.

To play around, I added a GET parameter to change the time each chunk takes, and see how the number of elements in the _async_queue rises.
3000 ms => 6 every tick => 2/sec
2000 ms => 4 every tick => 2/sec
1000 ms => 2 every tick => 2/sec
500 ms => 1 every tick => 2/sec
250 ms => 1 every 2 tick => 2/sec
125 ms => 1 every 4 tick => 2/sec BUT, we see it fighting and sometimes consuming an element
62 ms => 1 every 7 tick
48 ms => slowly fighting but diverging eventually
47 ms => slowly fighting but always converging

Surprisingly, the default value of CONFIG_ASYNC_TCP_QUEUE_SIZE is 64. 64 / 32sec = 2 seconds. As long as the chunk response is longer than the time to send a packet, every 2 seconds, 3 new elements are added to the queue and never pulled out in time.

I found this post that looks very similar to this issue I'm having.

mathieucarbou Dec 10, 2024
Maintainer

@Levak : wow! Thanks for all this deeper analysis this will definitely help!

Did you try also playing with the ack timeout of async tcp and increase it ? It defaults to 5 sec but maybe it can have an effect here ? I don't know if you have reduced it to 3 sec like I used to in my app.

Maybe try to increase it to 10 sec just to see ?

Levak Dec 10, 2024
Author

Maybe try to increase it to 10 sec just to see ?

You mean this?

request->client()->setRxTimeout(200000);

If yes, then, yes, I was testing all along with 200sec.

mathieucarbou Dec 10, 2024
Maintainer

You mean this?

No I was talking about this one: -D CONFIG_ASYNC_TCP_MAX_ACK_TIME=3000

It defaults to 5 sec... I don't know if the ack timeout can have an effect.

The setRxTimeout should be removed: this is a copy paste leftover - I was surprised to see it there this evening when looking back at the code lol

Levak Dec 10, 2024
Author

No I was talking about this one: -D CONFIG_ASYNC_TCP_MAX_ACK_TIME=3000

Tested with 10 sec, no difference in the rate at which the queue fills up.

vortigont · 2024-12-10T00:34:01Z

vortigont
Dec 10, 2024
Collaborator

interesting.
I join the party. Was able to reproduce same on my side on some NodeMCU clone board.
Used /slow from example. Strange, it simply stops sending any data after around 33 secs.

Chip Info:                                                                                                                                                    
------------------------------------------                                                                                                                    
  Model             : ESP32                                                                                                                                   
  Package           : D0WD-Q5                                                                                                                                 
  Revision          : 3.01                                                                                                                                    
  Cores             : 2                                                                                                                                       
  CPU Frequency     : 240 MHz
  XTAL Frequency    : 40 MHz
  Features Bitfield : 0x00000032
  Embedded Flash    : No
  Embedded PSRAM    : No

Flash Info:
------------------------------------------
  Chip Size         :  4194304 B (4 MB) 
  Block Size        :    65536 B (  64.0 KB)
  Sector Size       :     4096 B (   4.0 KB)
  Page Size         :      256 B (   0.2 KB)
  Bus Speed         : 40 MHz
  Bus Mode          : DIO

0 replies

vortigont · 2024-12-10T14:03:33Z

vortigont
Dec 10, 2024
Collaborator

So... I was able to fix this. Or more like enforce some safety, anyone dare to try?
https://github.com/vortigont/AsyncTCP/tree/queue_ctrl

do not consider this as a final solution, but at least it allows me to pass this slow test with default values

time curl -N -v -X GET http://192.168.1.26/slow.html --output -

real	1m5,767s
user	0m0,030s
sys	0m0,058s

4 replies

mathieucarbou Dec 10, 2024
Maintainer

@vortigont : the problem is more the ack that is not coming faster, right ? So by doing that, you give more time to receive the onAck event ?

vortigont Dec 10, 2024
Collaborator

I'm still digging this code. It's so low level and error prone. Everything goes via a single queue of messages and all code is blocking on this queue, so it's just a matter of chance and time when it will gets stuck. We might need some heavy refactoring there.
So in this particular case with slow chunked response - this is not about the speed and time at all. The amount of data generated is really small and it has plenty of time to be sent over air. So how it stuck here - on response a client code generates first chunk of data then sends it to lwip. lwip pulls callbacks in two cases (simplified) - on network acknowledge and on periodic polling. When acked - an event is posted to queue and then this event again triggers a new user callback which in turn sends another portion on data (and posts another event to the queue). Literally it could autochain this chunked response this way. But periodic polls triggers sending additional messages to the queue and additional user code callbacks. I have not tested it, but it appears that this algo when applied to long lived chunked connections makes user code to fill the socket buff with a smaller and smaller chunks of data each time but the number of chunks in-flight increases with each poll event. On each onPoll we have one more message in the queue that triggers consecutive callbacks. And eventually it ends up with a queue filled with events in flight that triggers their own successors until it self locks. It's a bit similar queue management that I was experimenting on with SSEs, like a deja vu :)
I would rather introduce here some probability-based throttling for polling events or make two queues or event priorities. Not sure. Let's see if helps with that SD card case.

mathieucarbou Dec 10, 2024
Maintainer

Everything goes via a single queue of messages and all code is blocking on this queue, so it's just a matter of chance and time when it will gets stuck. We might need some heavy refactoring there.

That was also a complain from @yubox-node-org and one of the reason why he created AsyncTCPSock.

But strangely, AsyncTCPSock sometime stalls also on a heavy loaded MCU. I suppose there is a task priority issue somewhere - I didn't check deeper. Also AsyncTCPSock has lower perf for SSE, but supports a higher concurrent number of http requests.

@Levak : did you also try swapping AsyncTCP by AsyncTCPSock as described in the README ?

Levak Dec 10, 2024
Author

Nice one @vortigont !

So, I confirm that this commit does prevent the queue from filling up from poll requests on my end as long as there is less than 1/4 of space left in it.

Now, I understand that this fix isn't really pretty. I see one immediate effect to it: Now, AsyncTCP does not like any new connection when it reaches that far (I see the message is queued) until the full chunk response finished. When another "slow" request is queued, it seems to perform 1 or 2 chunks, and just drops when the first "slow" request finishes (meaning, we don't get the full answer of the second "slow" request).

@Levak : did you also try swapping AsyncTCP by AsyncTCPSock as described in the README ?

Well, with 3 "slow" in parallel, it was doing pretty fine (slowed down, of course, but still processing all requests), up until one of them finished and made the board crash on "watchdog timeout" 🤣.

I may be wrong, but I don't see any limit to his write queue, so it was probably allocating too much RAM. I also tried with 1 "fast" 160k, which also crashes at the end but for a different reason:

 assert failed: xQueueSemaphoreTake queue.c:1709 (( pxQueue ))

mathieucarbou · 2024-12-10T20:14:41Z

mathieucarbou
Dec 10, 2024
Maintainer

@vortigont @Levak : fyi i have updated the sample to be able to control the delay + total payload length:

time curl -N -v -G -d 'd=3000' -d 'l=10000' http://192.168.4.1/slow.html --output -

I am finally able to reproduce.

1 reply

mathieucarbou Dec 10, 2024
Maintainer

@Levak : fyi, I use to run AsyncTCP with -D CONFIG_ASYNC_TCP_RUNNING_CORE=1 to control the core used for its async task.

using CONFIG_ASYNC_TCP_RUNNING_CORE automatically disables the TWDT, that is why I was not able to reproduce easily (I had to bump delays a lot and reach a deadlock state without TWDT triggering)

using -D CONFIG_ASYNC_TCP_RUNNING_CORE=1 also avoids having the task pinned to core 0 where all the arduino core code is, and avoids troubles when reading shared states from core 1 (app).

Levak · 2024-12-10T21:09:00Z

Levak
Dec 10, 2024
Author

I guess I have a solution. Separate the _async_queue from poll requests. It works like charm. I went back to 64 messages. I reserve 16 messages for poll events. For every async event processed, I try to process a poll event. Everything is wonderful and it can handle multiple chunked requests at once like a charm 😍

I will need to make a fork to commit the changes a bit later tonight. For now, here is a preview:

 static void _async_service_task(void *pvParameters){
     lwip_event_packet_t * packet = NULL;
     for (;;) {
-        if(_get_async_event(&packet)){
+        if(_get_async_event(&packet) || _get_async_poll(&packet)){
 #if CONFIG_ASYNC_TCP_USE_WDT

QueueHandle_t _async_queue;
+QueueHandle_t _poll_queue;
+static inline bool _init_poll_queue(){
+    if(!_poll_queue){
+        _poll_queue = xQueueCreate(CONFIG_ASYNC_TCP_POLL_QUEUE_SIZE, sizeof(lwip_event_packet_t *));
+        if(!_poll_queue){
+            return false;
+        }
+    }
+    return true;
+}
+static inline bool _send_async_poll(lwip_event_packet_t ** e){
+    return _async_queue && xQueueSend(_poll_queue, e, portMAX_DELAY) == pdPASS;
+}
+static inline bool _get_async_poll(lwip_event_packet_t ** e){
+    return _async_queue && xQueueReceive(_poll_queue, e, portMAX_DELAY) == pdPASS;
+}

    } else if(e->event == LWIP_TCP_CLEAR){
-        _remove_events_with_arg(e->arg);
+        _remove_events_with_arg(_async_queue, e->arg);
+        _remove_events_with_arg(_poll_queue, e->arg);

 static int8_t _tcp_poll(void * arg, struct tcp_pcb * pcb) {
+  // inhibit polling when event queue is getting filled up, let it handle _onack's
+    if (uxQueueMessagesWaiting(_poll_queue) > CONFIG_ASYNC_TCP_POLL_QUEUE_SIZE - 1)
+        return ERR_OK;

     //ets_printf("+P: 0x%08x\n", pcb);
     lwip_event_packet_t * e = (lwip_event_packet_t *)malloc(sizeof(lwip_event_packet_t));
     e->event = LWIP_TCP_POLL;
     e->arg = arg;
     e->poll.pcb = pcb;
-    if (!_send_async_event(&e)) {
+    if (!_send_async_poll(&e)) {
         free((void*)(e));
     }
     return ERR_OK;
 }

3 replies

mathieucarbou Dec 10, 2024
Maintainer

Wow! thanks ! I am applying them to check in both projects the tests and perf tests

mathieucarbou Dec 10, 2024
Maintainer

@Levak :

I've reproduced and adapted your fix in PR: mathieucarbou/AsyncTCP#31

I tested it in the project.

The SSE perf test is 12% faster at now around 530 events / second with 10 concurrent connections

But the HTTP tests is horribly slow:

compared to without the patch:

I suspect that the change helps improves the data flow on long lived connections but something broke concurrency for http requests.

Levak Dec 10, 2024
Author

I've opened it to run some perf test and even if SSE is 12% faster, HTTP does not support concurrent requests anymore with the fix. I don't know why yet but spawning 10 workers with autocanon for 20 sec, I only reach 1 req / sec.

That is not good news. On my end I was able to fire 3 parallel chunk requests, started with a small interval. Tho I noticed the poll queue was full, but not affecting my test.

There is still some digging to do!

mathieucarbou · 2024-12-10T23:48:00Z

mathieucarbou
Dec 10, 2024
Maintainer

@Levak :

Well, you were faster than me :)
I was trying to test with the full application with SD, just found few bugs in it (not AsyncTCP, don't worry) :D
For instance, I am getting another Watchdog timeout BUT NO WORRY, this time, I sent 2 parallel SD listing of 700ish files and I see the time of the handler is rising up to 5 seconds before crashing. I would assume it's reading the SD that is that slow, for now.

I definitely think you should use websocket or sse for that because your SD reading would be done outside of the async_tcp callbacks and you would just need to send events / ws messages after having read a batch or files, folder or whatever.

Also, if you have a TWDT in your app (like i use to) you can apply a TWDT to only some specific tasks., not the SD reading task.

That's what I do in my apps with my TaskManager lib (https://github.com/mathieucarbou/MycilaTaskManager). I am splitting my app parts in tasks (once or recurrent) associate them to managers, which can be started async or not, linked to a TWDT or not. Typically for long running tasks like MQTT publishing I do not activate the TWDT.

8 replies

Levak Dec 11, 2024
Author

equivalent as saying we decrease the priority of polling checks

That was my goal. Did you try swapping them, instead of splitting them?

if(_get_async_poll(&packet) || _get_async_event(&packet)){

mathieucarbou Dec 11, 2024
Maintainer

equivalent as saying we decrease the priority of polling checks

That was my goal. Did you try swapping them, instead of splitting them?

yes, swapping them in the if is worse: html page do not open... no data.

mathieucarbou Dec 11, 2024
Maintainer

By splitting i meant:

        if(_get_async_event(&packet)){
#if CONFIG_ASYNC_TCP_USE_WDT
            if(esp_task_wdt_add(NULL) != ESP_OK){
                log_e("Failed to add async task to WDT");
            }
#endif
            _handle_async_event(packet);
#if CONFIG_ASYNC_TCP_USE_WDT
            if(esp_task_wdt_delete(NULL) != ESP_OK){
                log_e("Failed to remove loop task from WDT");
            }
#endif
        }
        if(_get_async_poll(&packet)){
#if CONFIG_ASYNC_TCP_USE_WDT
            if(esp_task_wdt_add(NULL) != ESP_OK){
                log_e("Failed to add async task to WDT");
            }
#endif
            _handle_async_event(packet);
#if CONFIG_ASYNC_TCP_USE_WDT
            if(esp_task_wdt_delete(NULL) != ESP_OK){
                log_e("Failed to remove loop task from WDT");
            }
#endif
        }

Levak Dec 11, 2024
Author

I am trying to download 4 files in parallel from SD using this command (specific to my project, but you should get the idea of what it's doing) :

# curl -s "http://192.168.1.27/list?path=" | jq '.[].name' | xargs -t -I % -P 4 curl -s "http://192.168.1.27/download?path=%" --output %

I see no issue, all files downloaded, no crash. Windows reports I'm downloading at 5Mbps average. Each file ranges from 1MB to 40MB. Those requests are with request->beginResponse with fixed size. While this is running, I can launch requests that use request->beginChunkedResponse and request->beginStreamResponse.

I don't know how to reproduce your case then.

mathieucarbou Dec 11, 2024
Maintainer

@Levak : with which flavor ? The original one in the PR ?
Did you try the http benchmark like I did ? Serving small requests concurrently on 10 workers for 20 seconds ?

vortigont · 2024-12-11T04:42:53Z

vortigont
Dec 11, 2024
Collaborator

Wow, you had quite productive discussion here, guys.

I understand that this fix isn't really pretty

yeah, that was just a PoC as I told

Separate the _async_queue from poll requests. It works like charm. I went back to 64 messages. I reserve 16 messages for poll events. > For every async event processed, I try to process a poll event.

that could be a solution but not that simple, having two queues brings you an execution ordering problem that might pop out randomly. Some of the messages should be processed in order for a specific connection or a special care should be taken for cases like closing/timeouting etc.. Another side effect is that high polling rate could give you a very low latency but quite slow connections. Which is what Mathieu hit with his benchmark as I see later on.

A very simple, but not perfect, solution is to use probability-based throttling as I mentioned. I've updated my [branch](https://
github.com/vortigont/AsyncTCP/tree/queue_ctrl) for you to test.

    if (uxQueueMessagesWaiting(_async_queue) > (rand() % CONFIG_ASYNC_TCP_QUEUE_SIZE / 2 + CONFIG_ASYNC_TCP_QUEUE_SIZE / 4) )
        return ERR_OK;

Very simple and cheap in terms of memory/resources but it works OK for multiple slow connections too, 'cause it allows poll events from other connections to enter the queue. It distributes unfair and make new connections to start really slow but it would catch up eventually.

In general it needs a more deep refactoring here. I'm not that good at this level but some old networking skills working with traffic shapers rings some bells in my head :) Will try to optimize it a bit more (with a help of testing) :)

Here what it gives me with Mathieu's cannon (I had to comment out rate-limit midware, btw)

autocannon -c 10 -d 20 http://192.168.1.26
Running 20s test @ http://192.168.1.26
10 connections

┌─────────┬────────┬─────────┬──────────┬──────────┬────────────┬────────────┬──────────┐
│ Stat    │ 2.5%   │ 50%     │ 97.5%    │ 99%      │ Avg        │ Stdev      │ Max      │
├─────────┼────────┼─────────┼──────────┼──────────┼────────────┼────────────┼──────────┤
│ Latency │ 378 ms │ 4994 ms │ 10659 ms │ 11076 ms │ 5035.91 ms │ 2817.65 ms │ 11533 ms │
└─────────┴────────┴─────────┴──────────┴──────────┴────────────┴────────────┴──────────┘
┌───────────┬─────────┬─────────┬─────────┬─────────┬─────────┬─────────┬─────────┐
│ Stat      │ 1%      │ 2.5%    │ 50%     │ 97.5%   │ Avg     │ Stdev   │ Min     │
├───────────┼─────────┼─────────┼─────────┼─────────┼─────────┼─────────┼─────────┤
│ Req/Sec   │ 10      │ 10      │ 12      │ 15      │ 12.25   │ 2       │ 10      │
├───────────┼─────────┼─────────┼─────────┼─────────┼─────────┼─────────┼─────────┤
│ Bytes/Sec │ 43.8 kB │ 43.8 kB │ 52.5 kB │ 65.7 kB │ 53.6 kB │ 8.75 kB │ 43.8 kB │
└───────────┴─────────┴─────────┴─────────┴─────────┴─────────┴─────────┴─────────┘

Req/Bytes counts sampled once per second.
# of samples: 20

493 requests in 20.02s, 1.07 MB read

3 replies

mathieucarbou Dec 11, 2024
Maintainer

Here what it gives me with Mathieu's cannon (I had to comment out rate-limit midware, btw)

There is a perf environment in the Pio file: one for AsyncTCPSock and one for AsyncTCP

mathieucarbou Dec 11, 2024
Maintainer

@Levak : could you please test this branch also ?
mathieucarbou/AsyncTCP#32

mathieucarbou Dec 11, 2024
Maintainer

@vortigont : I updated your code to add a debug line:

    if (uxQueueMessagesWaiting(_async_queue) > (rand() % CONFIG_ASYNC_TCP_QUEUE_SIZE / 2 + CONFIG_ASYNC_TCP_QUEUE_SIZE / 4) ) {
        log_d("throttling _tcp_poll");
        return ERR_OK;
    }

I am running with:

  -D CONFIG_ASYNC_TCP_MAX_ACK_TIME=5000 // (keep default)
  -D CONFIG_ASYNC_TCP_PRIORITY=10 // (keep default)
  -D CONFIG_ASYNC_TCP_QUEUE_SIZE=64 // (keep default)
  -D CONFIG_ASYNC_TCP_RUNNING_CORE=1 // force async_tcp task to be on same core as the app (default is core 0)
  -D CONFIG_ASYNC_TCP_STACK_SIZE=4096 // reduce the stack size (default is 16K)

http perf test is what you saw and also match the one from main:

SSE perf tests:

  // With AsyncTCP, with 16 workers: some "Event message queue overflow: discard message", but no crash with a queue size of 64 (128 is too big - especially crashes)
  //
  // Total: 1711 events, 427.75 events / second
  // Total: 1711 events, 427.75 events / second
  // Total: 1626 events, 406.50 events / second
  // Total: 1562 events, 390.50 events / second
  // Total: 1706 events, 426.50 events / second
  // Total: 1659 events, 414.75 events / second
  // Total: 1624 events, 406.00 events / second
  // Total: 1706 events, 426.50 events / second
  // Total: 1487 events, 371.75 events / second
  // Total: 1573 events, 393.25 events / second
  // Total: 1569 events, 392.25 events / second
  // Total: 1559 events, 389.75 events / second
  // Total: 1560 events, 390.00 events / second
  // Total: 1562 events, 390.50 events / second
  // Total: 1626 events, 406.50 events / second
  //
  // With AsyncTCP, with 10 workers:
  //
  // Total: 2038 events, 509.50 events / second
  // Total: 2120 events, 530.00 events / second
  // Total: 2119 events, 529.75 events / second
  // Total: 2038 events, 509.50 events / second
  // Total: 2037 events, 509.25 events / second
  // Total: 2119 events, 529.75 events / second
  // Total: 2119 events, 529.75 events / second
  // Total: 2120 events, 530.00 events / second
  // Total: 2038 events, 509.50 events / second
  // Total: 2038 events, 509.50 events / second

Slow test: time curl -N -v -G -d 'd=3000' -d 'l=10000' http://192.168.4.1/slow.html --output -

Finishes in 2 minutes

real	2m0.157s
user	0m0.011s
sys	0m0.026s

In the output, we can see the random throttling

d = 3000, l = 10000
0
256
512
768
[1043389][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1043896][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
1024
[1046903][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
1280
[1048410][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1049917][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
1536
[1051924][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1052931][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
1792
[1053438][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1054445][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
2048
[1056452][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1057459][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1057966][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1058973][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
2304
[1059480][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1059987][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1060994][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1061501][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
2560
[1062509][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1063516][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1064023][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1064530][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1065036][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
2816
[1066043][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1066550][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1067057][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1067563][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1068070][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
3072
[1068577][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1069084][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1069591][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1070098][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1071105][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
3328
[1071612][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1072119][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1072626][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1073133][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1073640][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1074147][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
3584
[1074654][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1075161][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1075668][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1076175][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1076682][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1077189][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
3840
[1077696][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1078203][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1078710][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1079217][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1079724][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1080231][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
4096
[1080738][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1081245][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1081752][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1082259][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1082766][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1083273][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
4352
[1083780][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1084287][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1084794][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1085301][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1085808][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1086315][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
4608
[1087322][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1087829][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1088336][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1088843][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1089350][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
4864
[1089857][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1090364][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1090871][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1091378][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1091885][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
5120
[1092892][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1093399][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1093906][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1094413][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1094920][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1095427][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
5376
[1095935][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1096442][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1096949][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1097456][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1097963][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
5632
[1098470][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1098977][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1099484][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1099991][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1100498][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1101005][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
5888
[1101511][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1102018][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1102525][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1103032][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1103539][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
6144
[1104546][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1105053][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1105560][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1106067][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1106573][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1107080][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
6400
[1107587][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1108094][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1108601][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1109108][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1109615][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1110122][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
6656
[1110629][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1111136][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1111643][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1112150][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1112657][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1113164][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
6912
[1113671][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1114178][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1114685][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1115192][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1115699][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1116205][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
7168
[1116712][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1117219][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1117726][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1118233][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1118740][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1119247][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
7424
[1119754][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1120261][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1120768][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1121275][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1121782][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1122289][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
7680
[1122796][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1123303][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1123810][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1124317][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1124823][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1125329][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
7936
[1125836][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1126343][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1126849][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1127355][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1127861][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1128368][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
8192
[1128875][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1129382][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1129889][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1130396][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1130903][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1131410][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
8448
[1131917][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1132424][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1132931][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1133438][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1133945][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1134452][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
8704
[1134959][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1135466][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1135973][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1136480][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1136987][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
8960
[1137494][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1138001][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1138508][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1139015][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1139522][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1140029][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
9216
[1140536][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1141043][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1141550][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1142057][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1142564][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1143071][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
9472
[1143578][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1144085][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1144592][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1145099][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1145606][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1146113][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
9728
[1146620][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1147127][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1147634][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1148141][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1148648][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1149155][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
9984
[1149662][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1150169][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1150676][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1151183][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1151690][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
[1152197][D][AsyncTCP.cpp:307] _tcp_poll(): throttling _tcp_poll
10240

In conclusion, so far so good, thanks @vortigont !

I will wait for some more test from @Levak of this PR mathieucarbou/AsyncTCP#32 and once I have your GO, I merge and release.

Note: I also tested Websocket, no issue.

mathieucarbou · 2024-12-11T10:42:07Z

mathieucarbou
Dec 11, 2024
Maintainer

Kudos for all of us participating in this discussion!
This is a good demonstration of remote OSS collaboration on a complex issue!
👏

0 replies

vortigont · 2024-12-11T13:51:59Z

vortigont
Dec 11, 2024
Collaborator

looks not bad as for me. Could be considered as minimal acceptable solution, other more deeper changes (if required) we could suggest/estimate separately?

4 replies

mathieucarbou Dec 11, 2024
Maintainer

Yes, definitely.
I am waiting for @Levak to confirm that your PR works :-)

Levak Dec 11, 2024
Author

I tested few things,

1 long SD list. Works
2 long SD list in parallel > timeout but because the handler itself exceeds 5 seconds. I consider this a bug of my handler, tho I don't understand why the processing time is gradually increased.
2 parallel "slow" (3 sec per chunk) all good
2 parallel "slow" (3sec per chunck) and when it reaches the "throttle", I sent small requests but they would not connect at all. I noticed that if I disconnected and connected several times it would accumulate in the queue, until eventually the watchdog timeouted.

Overall, it's better than nothing. But as @vortigont said, it's not a definitive solution.

Thanks all 👍

mathieucarbou Dec 11, 2024
Maintainer

Thanks! I will issue releases late this evening then.

mathieucarbou Dec 11, 2024
Maintainer

2 parallel "slow" (3sec per chunck) and when it reaches the "throttle", I sent small requests but they would not connect at all. I noticed that if I disconnected and connected several times it would accumulate in the queue, until eventually the watchdog timeouted.

@Levak : I think for this use case you could implement a middleware to allow some download permits: there is an exemple in the project of a rate limiting middleware, so this could be something similar, making sure you limit the number of concurrent downloads with at the same time the number of requests.

mathieucarbou · 2024-12-11T19:16:38Z

mathieucarbou
Dec 11, 2024
Maintainer

Released!

mathieucarbou/AsyncTCP @ 3.2.15
mathieucarbou/ESPAsyncWebServer @ 3.4.0

0 replies

vortigont · 2024-12-12T02:58:09Z

vortigont
Dec 12, 2024
Collaborator

Nice done, team!
Without your inputs it would not be so easy to get into this.

Overall, it's better than nothing. But as @vortigont said, it's not a definitive solution.

Not at all, actually it's not a proper fix for chunked responses as I think, but more of a things that must be there for other cases, like massive SSE's or overloaded websockets which relies on polls a lot. For chunked responses it should be other approach and I'm already have some ideas where to look into. Will keep you in touch in this thread.

And another thing - AsycnTCPSock's implementation is definitely worth a closer look in the light of modern IDF's API. I like its being more abstracted. Let's keep it in mind for upcoming no-8266 option.

1 reply

mathieucarbou Dec 12, 2024
Maintainer

About esp-idf: I am also one of the collaborator in Psychichttp project which is based on esp-idf.

The perf is definitely better when using the esp-idf API, but some features are not possible, like getting all headers.

On my spare time I try to improve Psychic also and do a ESPAsyncWebServer - psychic bridge, but this is very low priority for me since I only use ESPAsyncWS.

For next version 4 of ESPAsyncWS, we also have the choice of dropping AsyncTCP if we want because it was done in the purpose of being swapped by another lib depending on the platform.

But something I would like is that we try to keep the API as close as possible as today, so that users could easily migrate by just doing some search and replace if needs to.

And if we do a v4, I will consider detaching the fork ;-)

vortigont · 2024-12-13T09:02:35Z

vortigont
Dec 13, 2024
Collaborator

I've found out how deep this rabbit hole is :)
Now I have a much more adequate solution and improvement for both AsyncTCP and AsyncSwebServer. It needs a combined efforts to mitigate similar side effects as much as possible.
So I implemented a couple of things that could drastically improve all long lived / high volume responses processing and at least reduce a negative effects of poorly designed user callbacks and provide much more robustness and reliability for multiple simultaneous connections.

One thing is in-flight buffer credits - it intended to throttle refill (user callback) calls for long lived connections for a constantly incoming poll events if there is already enough chunks being transmitted.

Another is buffer refill moderation - it tries to reduce refill (user callback) calls from being called too often when socket buff space is small compared to in-flight data size. This effectively drops ack events and reduces queue size while making sure to have a buffer at leat 50% loaded.

And from the AsyncTCP side it tries to coalesce consecutive poll events into one, it evicts the queue from excessive polls and allows other connections to get more chance to get task's time.
You need both #171 and mathieucarbou/AsyncTCP#33 to work properly.

I've done some basic testing with downloading static file from LittleFS and running that ugly /slow request from test example.
LittleFS file size is 921654 bytes.
Here what I got:

Downloading 10 files in parallel from LittleFS with autocannon -d 60 -t 60 -c 10 -M 1 http://192.168.1.26/splash.bmp
Origin: crash
New code: passed
running /slow.html and downloading one static file in parallel
curl -N -v -G -d 'd=2000' -d 'l=10000' http://192.168.1.26/slow.html --output -
curl -v -X GET http://192.168.1.26/splash.bmp > /dev/null
Origin: file download aborted with 898860 bytes remaining
New code: passed
running /slow.html and downloading two static files in parallel
curl -N -v -G -d 'd=2000' -d 'l=10000' http://192.168.1.26/slow.html --output -
curl -v -X GET http://192.168.1.26/splash.bmp > /dev/null
Origin: both file downloads aborted somewhere in the begining
New code: passed
Downloading 10 files sequentially from LittleFS with autocannon -d 60 -t 60 -c 1 -M 10 http://192.168.1.26/splash.bmp
Origin: passed
New code: passed
simultaneously downloading large file from LittleFS, 7 connections, + running consecutive short static calls, 7 connections

autocannon -d 60 -t 60 -c 7 -M 1 http://192.168.1.26/splash.bmp
autocannon -d 60 -t 60 -c 7 -M 10 http://192.168.1.26/index.html

Origin: crash by watchdog
New code: passed

I do not have SD card setup currently to test, so would appreciate some feedback from @Levak
Not sure how long are SD-CARD operations, but the callbacks there are soooo slow it needs some other approach to drive.
The main thing to conclude here is that the only slow callback task would poison ALL other parallel connections by design no matter how we try.

4 replies

mathieucarbou Dec 13, 2024
Maintainer

Thanks! Let me know when this is ready to review, I will do some perf tests.

For what I understand from te AsyncTCP change, any way, it could be merged as-is right ?

The queue cleanup that you do is already beneficial, even without the ESAsyncWS change right ?

vortigont Dec 14, 2024
Collaborator

Any kind of tests are welcome.

The queue cleanup that you do is already beneficial, even without the ESAsyncWS change right ?

yes, it could be beneficial but in some corner cases like one ugly slow callback or some spike delays. I'm thinking about improving it to also handle interleaved events. But on the other hand I do not want to make it a chase of mitigating all kind of improperly designed code.

Levak Dec 15, 2024
Author

Hey! Thank you for your solution @vortigont !
In the end, I'm happy that it wasn't just a "bad design" of the user code, but rather, a limitation in the library that could be lifted with your work !

Putting that aside, this allowed me to focus more on a more serious problem on the "user code". TL;DR, listing files from an SD card formatted with FAT makes every single open() longer and longer. We're talking about adding 3 more milliseconds for every file. File 1 opens in no time, next one 3ms, next one opens in 6ms, etc. Up to a point where opening a single file takes a second or 2 :D
And once again, it's a known issue in ESP-IDF since 2022, which got supposedly fixed in the next upcoming major release 5.4.

So, thanks to the limitation in ESP-IDF about SD/FAT, we were able to trigger very long chunk handlers in ESPAsyncWebserver and find a limitation in AsyncTCP, what a journey :D

The way I'll be doing the Chunk handler for now, is to put both a limit on size (max_len) but also a limit on how much time each chunk takes, capped at 2 sec. That way, the watchdog that seems to trigger at 5 sec will never trigger, and it's also a good middle ground between browsing lots of files and not being too "hangy".

Thanks all!

mathieucarbou Dec 15, 2024
Maintainer

Note that you also can control the watchdog timeout ;-)

bool Mycila::TaskManager::configureWDT(uint32_t timeoutSeconds, bool panic) {
  LOGI(TAG, "Configuring Task Watchdog Timer (TWDT) to %" PRIu32 " seconds", timeoutSeconds);
#if ESP_IDF_VERSION_MAJOR < 5
  return esp_task_wdt_init(timeoutSeconds, panic) == ESP_OK;
#else
  esp_task_wdt_config_t config = {
    .timeout_ms = timeoutSeconds * 1000,
    .idle_core_mask = (1 << SOC_CPU_CORES_NUM) - 1, // Bitmask of all cores
    .trigger_panic = panic,
  };
  return esp_task_wdt_init(&config) == ESP_OK || esp_task_wdt_reconfigure(&config) == ESP_OK;
#endif
}

vortigont · 2024-12-15T05:39:37Z

vortigont
Dec 15, 2024
Collaborator

another testcase:

simultaneously downloading large file from LittleFS, 7 connections, + running consecutive short static calls, 7 connections

autocannon -d 60 -t 60 -c 7 -M 1 http://192.168.1.26/splash.bmp
autocannon -d 60 -t 60 -c 7 -M 10 http://192.168.1.26/index.html

Origin: crash by watchdog
New code: passed

2 replies

mathieucarbou Dec 15, 2024
Maintainer

Awesome! I will test that today!
Thanks a lot!

mathieucarbou Dec 15, 2024
Maintainer

@vortigont here are some results:

SSE Perf test:

for i in {1..10}; do ( count=$(gtimeout 30 curl -s -N -H "Accept: text/event-stream" http://192.168.4.1/events 2>&1 | grep -c "^data:"); echo "Total: $count events, $(echo "$count / 4" | bc -l) events / second" ) & done;

=> OK, same as main

for i in {1..16}; do ( count=$(gtimeout 30 curl -s -N -H "Accept: text/event-stream" http://192.168.4.1/events 2>&1 | grep -c "^data:"); echo "Total: $count events, $(echo "$count / 4" | bc -l) events / second" ) & done;

=> Wow !!! No messages discarded anymore! This is better!

Total: 1751 events, 437.75000000000000000000 events / second
Total: 1750 events, 437.50000000000000000000 events / second
Total: 1667 events, 416.75000000000000000000 events / second
Total: 1666 events, 416.50000000000000000000 events / second
Total: 1666 events, 416.50000000000000000000 events / second
Total: 1750 events, 437.50000000000000000000 events / second
Total: 1751 events, 437.75000000000000000000 events / second
Total: 1667 events, 416.75000000000000000000 events / second
Total: 1667 events, 416.75000000000000000000 events / second
Total: 1605 events, 401.25000000000000000000 events / second
Total: 1605 events, 401.25000000000000000000 events / second
Total: 1606 events, 401.50000000000000000000 events / second
Total: 1605 events, 401.25000000000000000000 events / second
Total: 1752 events, 438.00000000000000000000 events / second
Total: 1668 events, 417.00000000000000000000 events / second

WS tests

=> OK

HTTP perf test

10 workers: same as main (autocannon -c 10 -w 10 -d 20 http://192.168.4.1)

16 workers (autocannon -c 16 -w 16 -d 20 http://192.168.4.1)

slow test

time curl -N -v -G -d 'd=3000' -d 'l=10000' http://192.168.4.1/slow.html --output -

ends correctly with:

real	2m2.823s
user	0m0.010s
sys	0m0.021s

I see a bunch of debug logs (expected):

649559][D][AsyncTCP.cpp:187] _get_async_event(): coalescing polls, network congestion or async callbacks might be too slow!
[649571][D][AsyncTCP.cpp:187] _get_async_event(): coalescing polls, network congestion or async callbacks might be too slow!
[649583][D][AsyncTCP.cpp:187] _get_async_event(): coalescing polls, network congestion or async callbacks might be too slow!
[649595][D][AsyncTCP.cpp:187] _get_async_event(): coalescing polls, network congestion or async callbacks might be too slow!
[649607][D][AsyncTCP.cpp:187] _get_async_event(): coalescing polls, network congestion or async callbacks might be too slow!
[649619][D][AsyncTCP.cpp:187] _get_async_event(): coalescing polls, network congestion or async callbacks might be too slow!
[649631][D][WebResponses.cpp:362] _ack(): (chunk) out of in-flight credits

Now, I am combining:

time curl -N -v -G -d 'd=3000' -d 'l=10000' http://192.168.4.1/slow.html --output -
with
autocannon -c 10 -w 10 -d 20 http://192.168.4.1

I see a lot of :

890246][D][AsyncTCP.cpp:379] _tcp_poll(): throttling
[890252][D][AsyncTCP.cpp:379] _tcp_poll(): throttling
[890257][D][AsyncTCP.cpp:379] _tcp_poll(): throttling
[890263][D][AsyncTCP.cpp:379] _tcp_poll(): throttling
[890269][D][AsyncTCP.cpp:379] _tcp_poll(): throttling
[890275][D][AsyncTCP.cpp:379] _tcp_poll(): throttling
[890280][D][AsyncTCP.cpp:379] _tcp_poll(): throttling
[890286][D][AsyncTCP.cpp:379] _tcp_poll(): throttling
[890292][D][AsyncTCP.cpp:379] _tcp_poll(): throttling
[890298][D][AsyncTCP.cpp:379] _tcp_poll(): throttling
[890304][D][AsyncTCP.cpp:379] _tcp_poll(): throttling
[890618][D][AsyncTCP.cpp:187] _get_async_event(): coalescing polls, network congestion or async callbacks might be too slow!
[890630][D][WebResponses.cpp:362] _ack(): (chunk) out of in-flight credits

Maybe too much...

So I am now trying:

time curl -N -v -G -d 'd=3000' -d 'l=10000' http://192.168.4.1/slow.html --output -
with
autocannon -c 2 -w 2 -d 20 http://192.168.4.1

So concurrent requests are still served, but slower. And the more we have concurrent req, the more the req/sec number increasses.

But what I do not understand is why it takes about 5 sec on average to serve 2 requests... The delay in my slow test is 3 sec, which is pausing the chunk process for a very long time.

But I a wondering if there is a way to give a priority over serving slower requests... This might be a little complicated because it would require probably to save some states (counters) and be able to re-order thee queue to process events with lower counter values...

Any way... I will merge all that and release.

Thanks!

vortigont · 2024-12-16T03:26:26Z

vortigont
Dec 16, 2024
Collaborator

thanks for that issues links, @Levak. I never know that this problem exists in fatfs code actially. Maybe because I have never used it yet? :)
Would appreciate if you could test large files (a dozens of MiBs) downloads from an SD-card with a new code.

The way I'll be doing the Chunk handler for now, is to put both a limit on size (max_len) but also a limit on how much time each chunk takes, capped at 2 sec.

sorry to say but that is not the right way to do this also :( By doing this you would suffocate all other webserver operations also including websockets if you would switch to this approach.
Putting aside that this is a problem of fatsfs for now any other similar scenarios should be done in a different way.

Something like:

a request is received by webserver to reply with a long chunked data where each iteration could take tons of time
a request should trigger creating another separate task that would do the heavy lifting and postpone fs operation to this task, bind this task to arduino's core 1 so not to affect all other stuff running on core 0
immediately reply with empty chunked response message to "come later"
a task should do fs reading and put the result into some shared buffer or queue
a chunked callback should periodically check the buffer for a new data, if found then send a next chunk of data, otherwise immediately reply with "come later"

That's not simple piece of code but "this is the way" :)
Maybe I should create an example code for asyncwebserver on how to do this for future reference.

6 replies

vortigont Dec 16, 2024
Collaborator

File downloads (even very large ones) should no longer be affected with a recent in-flight credits control.
FileResponse handler would use it too.
I need to get a board with sd-card and test it.

Levak Dec 17, 2024
Author

You would not believe me, but I found a way to speed up SD directory listing by more than 300 times!

I looked up a lot of existing threads of ppl sad that SD listing is slow on ESP32. The root of the issue is first because FATFS is a linked list of sectors and blocks, so listing a directory is sure straightforward, but using functions like open() or stat() would rewind that linked list from the beginning. I measured stat() took less than 1ms on the first file of the directory, but 25ms for file ID 655. That's 655*25ms/2 = 8 seconds just for calling stat() on all files of that directory. And here is the second problem: open() calls stat() twice, and I was doing it 3 times too (one for get_sfn(), one for f.size() and one for f.getLastWrite()). 5*8 = 40! That's precisely the time it took for the full directory traversal in my code.

One "optimisation" found by some other community members was to introduce getNextFileName() instead of openNextFile(), but since it was only running readdir(), this would solely spit file names. To get more information, one would need to stat() the file name and get the same slowdowns (in my example, +8 secs).

Then I dug at the lowlevel implementation of readdir() trying to find if the information I needed (size, last modified time and ShortFileName format) was available somewhere. readdir() returning a struct dirent, this was a deadend. Then I found about f_readdir() which returns a FILINFO, exactly the type I request in my get_sfn() function that calls f_stat().

The implementation becomes trivial:

FRESULT res = f_opendir(&states->dir, path.c_str());
FILINFO fno;
while ( FR_OK == f_readdir(&states->dir, &fno) && fno.fname[0] != 0) {
  isDir = fno.fattrib & AM_DIR;
  // fno.altname (SFN)
  // fno.fname (LFN)
  // fno.fdate & fno.ftime (last modified time)
  // fno.fsize (Size)
  [ ... ]
}

Before:

# time curl -s "http://192.168.1.27/list?path=heavydir" > /dev/null
real    0m44.313s
user    0m0.000s
sys     0m0.016s

After:

# time curl -s "http://192.168.1.27/list?path=heavydir" > /dev/null
real    0m0.481s
user    0m0.000s
sys     0m0.016s

Logs:

[141679][I][sdwifi.ino:364] mountSD(): SD Card mount
[141777][I][sdwifi.ino:836] operator()(): index:0, max_len:5511, AQ:0
[141792][I][sdwifi.ino:877] operator()(): handler took 15ms
[141807][I][sdwifi.ino:836] operator()(): index:5511, max_len:4300, AQ:0
[141818][I][sdwifi.ino:877] operator()(): handler took 11ms
[141831][I][sdwifi.ino:836] operator()(): index:9811, max_len:2880, AQ:0
[141841][I][sdwifi.ino:877] operator()(): handler took 10ms
[141854][I][sdwifi.ino:836] operator()(): index:12691, max_len:4300, AQ:0
[141865][I][sdwifi.ino:877] operator()(): handler took 11ms
[141878][I][sdwifi.ino:836] operator()(): index:16991, max_len:2880, AQ:0
[141887][I][sdwifi.ino:877] operator()(): handler took 9ms
[141899][I][sdwifi.ino:836] operator()(): index:19871, max_len:4300, AQ:0
[141910][I][sdwifi.ino:877] operator()(): handler took 11ms
[141921][I][sdwifi.ino:836] operator()(): index:24171, max_len:2880, AQ:0
[141931][I][sdwifi.ino:877] operator()(): handler took 10ms
[141942][I][sdwifi.ino:836] operator()(): index:27051, max_len:4300, AQ:0
[141952][I][sdwifi.ino:877] operator()(): handler took 11ms
[141966][I][sdwifi.ino:836] operator()(): index:31351, max_len:4316, AQ:0
[141977][I][sdwifi.ino:877] operator()(): handler took 11ms
[141992][I][sdwifi.ino:836] operator()(): index:35667, max_len:5752, AQ:0
[142005][I][sdwifi.ino:877] operator()(): handler took 13ms
[142018][I][sdwifi.ino:836] operator()(): index:41419, max_len:5752, AQ:0
[142030][I][sdwifi.ino:877] operator()(): handler took 12ms
[142043][I][sdwifi.ino:836] operator()(): index:47171, max_len:5752, AQ:0
[142055][I][sdwifi.ino:877] operator()(): handler took 12ms
[142068][I][sdwifi.ino:836] operator()(): index:52923, max_len:5736, AQ:0
[142080][I][sdwifi.ino:397] umountSD(): In SD Card Unmount
[142086][I][sdwifi.ino:888] operator()(): handler took 18ms
[142112][I][sdwifi.ino:814] operator()(): LIST disconnect

So, out of the 481ms, really the handler ran for 155ms! I'm shocked.

mathieucarbou Dec 17, 2024
Maintainer

Wow! Nice improvement!

Is that similar to what was proposing @chipweinberger who opened espressif/esp-idf#10220 ?

He seems to also use f_readdir

Levak Dec 17, 2024
Author

Is that similar to what was proposing @chipweinberger who opened espressif/esp-idf#10220 ?

He seems to also use f_readdir

Yes, I only found about it when closing all my tabs :p
From my understanding, he did not go through with f_readdir in Arduino ESP32 core in the pull request that later added getNextFileName(). He probably is still using f_readdir in his project tho.

Furthermore, from my understanding, the IDF v5.4 may add caching to stat() but will still be slower (8 seconds in my case) than a directory traversal with f_readdir. Sure 40 sec -> 8 sec is good, but why not 150ms? :P

Levak Dec 17, 2024
Author

Would appreciate if you could test large files (a dozens of MiBs) downloads from an SD-card with a new code.

Did just now. I tried to download the same file of 41MB.

75 sec alone. ~4.5Mbps
10 times with 3 workers in parallel: 115 sec per file avg. ~10Mbps
10 times with 4 workers in parallel: 130 sec per file avg. ~12Mbps
=> BUT Sometimes, 9-10th gets "rejected" (no logs sorry).

# seq 10 | xargs -t -I % -P 4 time curl -s "http://192.168.1.27/download
?path=heavy/2.zip" --output /dev/null

vortigont · 2024-12-16T04:46:01Z

vortigont
Dec 16, 2024
Collaborator

thanks for the tests @mathieucarbou

The delay in my slow test is 3 sec, which is pausing the chunk process for a very long time.

yes, that is exactly what 'd=3000' in slow does - it stalls AsyncTCP task in user callback and does not allow any other connections to be served at all

I a wondering if there is a way to give a priority over serving slower requests...

all my changes so far to asynctcp was to try minimize the impact of long callbacks to the queue itself were pure math, so very cheap, costs no memory and does not do any intermediate mallocs. But in the end all does come to the same thing - if a single event takes the task for too long - all other events (i.e. connections) are blocked. Even if the q could reorder it somehow it will anyway stall eventually on slowest ones. The only way here is to create a pool of worker threads and somehow manage it. Won't be that easy and cheap and also won't work for single core boards at all.

Have some other idea in mind though...

1 reply

vortigont Dec 16, 2024
Collaborator

sharing other ideas in #173

vortigont · 2024-12-16T13:43:57Z

vortigont
Dec 16, 2024
Collaborator

I've tested downloading of files from SDCARD. Work perfectly.
5x15MiB files in parallel completed in less then 5 mins.
Looks like 5 files is a max limit from ffatfs.

autocannon -d 600 -t 600 -c 5 -M 1 http://192.168.1.29/15MiB.mp4
Running 600s test @ http://192.168.1.29/15MiB.mp4
5 connections

running [=========           ] 43%
┌─────────┬───────────┬───────────┬───────────┬───────────┬───────────┬───────────┬───────────┐
│ Stat    │ 2.5%      │ 50%       │ 97.5%     │ 99%       │ Avg       │ Stdev     │ Max       │
├─────────┼───────────┼───────────┼───────────┼───────────┼───────────┼───────────┼───────────┤
│ Latency │ 257208 ms │ 257441 ms │ 257563 ms │ 257563 ms │ 257407 ms │ 126.85 ms │ 257563 ms │
└─────────┴───────────┴───────────┴───────────┴───────────┴───────────┴───────────┴───────────┘
┌───────────┬─────┬──────┬─────┬───────┬────────┬─────────┬─────────┐
│ Stat      │ 1%  │ 2.5% │ 50% │ 97.5% │ Avg    │ Stdev   │ Min     │
├───────────┼─────┼──────┼─────┼───────┼────────┼─────────┼─────────┤
│ Req/Sec   │ 0   │ 0    │ 0   │ 0     │ 0.02   │ 0.32    │ 5       │
├───────────┼─────┼──────┼─────┼───────┼────────┼─────────┼─────────┤
│ Bytes/Sec │ 0 B │ 0 B  │ 0 B │ 0 B   │ 312 kB │ 4.99 MB │ 80.4 MB │
└───────────┴─────┴──────┴─────┴───────┴────────┴─────────┴─────────┘

Req/Bytes counts sampled once per second.
# of samples: 258

5 requests in 258.08s, 80.4 MB read

Two large files in parallel with multiple small reqs is also fine. Large files are not affecting small downloads at all.

30 requests in 3.02s, 3.81 kB read

1 reply

mathieucarbou Dec 16, 2024
Maintainer

Right, this is because the chunk callback executes fast.
I think the "slow" use case (d=1000) is worsen by the delay added in the callback, so it does not really represent the reality.

Probably that file listing is slower than downloading in terms of chunk callback execution time because sequential reading is faster than querying the file system.

Which means that @Levak should definitely think about moving the listing part outside of the async_tcp task, by using SSE or websocket, and only keep file download through the chunk mechanism.

vortigont · 2024-12-18T01:47:56Z

vortigont
Dec 18, 2024
Collaborator

that's a nice catch with f_readdir @Levak! I was really surprised to see fopen just to get file's size, etc... but never worked close with fatfs driver on esp, so can't comment this. Glad that you found a proper way. Linked lists could be dangerous things if traversed from anything than RAM :)

10 times with 3 workers in parallel: 115 sec per file avg. ~10Mbps
Wow! that is impressive! My board never goes above ~3-4 Mbps, dunno why, maybe it's an access-point device specific.
So we can consider file downloads as rock solid now. Great!

0 replies

Watchdog in AsyncTCP when sending very long chunked response #165

Levak Dec 2, 2024

Replies: 18 comments · 61 replies

mathieucarbou Dec 2, 2024 Maintainer

mathieucarbou Dec 9, 2024 Maintainer

Levak Dec 9, 2024 Author

mathieucarbou Dec 9, 2024 Maintainer

mathieucarbou Dec 9, 2024 Maintainer

mathieucarbou Dec 9, 2024 Maintainer

Levak Dec 9, 2024 Author

Levak Dec 10, 2024 Author

mathieucarbou Dec 10, 2024 Maintainer

Levak Dec 10, 2024 Author

mathieucarbou Dec 10, 2024 Maintainer

Levak Dec 10, 2024 Author

vortigont Dec 10, 2024 Collaborator

vortigont Dec 10, 2024 Collaborator

mathieucarbou Dec 10, 2024 Maintainer

vortigont Dec 10, 2024 Collaborator

mathieucarbou Dec 10, 2024 Maintainer

Levak Dec 10, 2024 Author

mathieucarbou Dec 10, 2024 Maintainer

mathieucarbou Dec 10, 2024 Maintainer

Levak Dec 10, 2024 Author

mathieucarbou Dec 10, 2024 Maintainer

mathieucarbou Dec 10, 2024 Maintainer

Levak Dec 10, 2024 Author

mathieucarbou Dec 10, 2024 Maintainer

Levak Dec 11, 2024 Author

mathieucarbou Dec 11, 2024 Maintainer

mathieucarbou Dec 11, 2024 Maintainer

Levak Dec 11, 2024 Author

mathieucarbou Dec 11, 2024 Maintainer

vortigont Dec 11, 2024 Collaborator

mathieucarbou Dec 11, 2024 Maintainer

mathieucarbou Dec 11, 2024 Maintainer

mathieucarbou Dec 11, 2024 Maintainer

mathieucarbou Dec 11, 2024 Maintainer

vortigont Dec 11, 2024 Collaborator

mathieucarbou Dec 11, 2024 Maintainer

Levak Dec 11, 2024 Author

mathieucarbou Dec 11, 2024 Maintainer

mathieucarbou Dec 11, 2024 Maintainer

mathieucarbou Dec 11, 2024 Maintainer

vortigont Dec 12, 2024 Collaborator

mathieucarbou Dec 12, 2024 Maintainer

vortigont Dec 13, 2024 Collaborator

mathieucarbou Dec 13, 2024 Maintainer

vortigont Dec 14, 2024 Collaborator

Levak Dec 15, 2024 Author

mathieucarbou Dec 15, 2024 Maintainer

vortigont Dec 15, 2024 Collaborator

mathieucarbou Dec 15, 2024 Maintainer

mathieucarbou Dec 15, 2024 Maintainer

vortigont Dec 16, 2024 Collaborator

vortigont Dec 16, 2024 Collaborator

Levak
Dec 2, 2024

Replies: 18 comments 61 replies

mathieucarbou
Dec 2, 2024
Maintainer

mathieucarbou Dec 9, 2024
Maintainer

Levak Dec 9, 2024
Author

mathieucarbou Dec 9, 2024
Maintainer

mathieucarbou Dec 9, 2024
Maintainer

mathieucarbou Dec 9, 2024
Maintainer

Levak
Dec 9, 2024
Author

Levak Dec 10, 2024
Author

mathieucarbou Dec 10, 2024
Maintainer

Levak Dec 10, 2024
Author

mathieucarbou Dec 10, 2024
Maintainer

Levak Dec 10, 2024
Author

vortigont
Dec 10, 2024
Collaborator

vortigont
Dec 10, 2024
Collaborator

mathieucarbou Dec 10, 2024
Maintainer

vortigont Dec 10, 2024
Collaborator

mathieucarbou Dec 10, 2024
Maintainer

Levak Dec 10, 2024
Author

mathieucarbou
Dec 10, 2024
Maintainer

mathieucarbou Dec 10, 2024
Maintainer

Levak
Dec 10, 2024
Author

mathieucarbou Dec 10, 2024
Maintainer

mathieucarbou Dec 10, 2024
Maintainer

Levak Dec 10, 2024
Author

mathieucarbou
Dec 10, 2024
Maintainer

Levak Dec 11, 2024
Author

mathieucarbou Dec 11, 2024
Maintainer

mathieucarbou Dec 11, 2024
Maintainer

Levak Dec 11, 2024
Author

mathieucarbou Dec 11, 2024
Maintainer

vortigont
Dec 11, 2024
Collaborator

mathieucarbou Dec 11, 2024
Maintainer

mathieucarbou Dec 11, 2024
Maintainer

mathieucarbou Dec 11, 2024
Maintainer

mathieucarbou
Dec 11, 2024
Maintainer

vortigont
Dec 11, 2024
Collaborator

mathieucarbou Dec 11, 2024
Maintainer

Levak Dec 11, 2024
Author

mathieucarbou Dec 11, 2024
Maintainer

mathieucarbou Dec 11, 2024
Maintainer

mathieucarbou
Dec 11, 2024
Maintainer

vortigont
Dec 12, 2024
Collaborator

mathieucarbou Dec 12, 2024
Maintainer

vortigont
Dec 13, 2024
Collaborator

mathieucarbou Dec 13, 2024
Maintainer

vortigont Dec 14, 2024
Collaborator

Levak Dec 15, 2024
Author

mathieucarbou Dec 15, 2024
Maintainer

vortigont
Dec 15, 2024
Collaborator

mathieucarbou Dec 15, 2024
Maintainer

mathieucarbou Dec 15, 2024
Maintainer

vortigont
Dec 16, 2024
Collaborator

vortigont Dec 16, 2024
Collaborator