Make transaction status service multi-threaded. #4032

fkouteib · 2024-12-10T06:04:35Z

Problem

As part of an investigation into Agave OOM issues in internal private cluster tests, I found that TSS receiver channel would get severely backed up (80k+ pending msg) when the cluster is running at 40k TPS sustained (bench-TPS wkld; 80-20 FD to Agave node ratio). This would cause slow down across the system and build up memory usage until the node OOMs (crashed agave 256 GB node, and Agave tile on FD 512 GB node in my tests). This issue reproduces more prominently when running with '--enable-rpc-transaction-history' and '--enable-extended-tx-metadata-storage' enabled.

Summary of Changes

Make transaction status receiver multi-threaded running 4 worker threads.
With this change, the queue can get from 1k to 5k pending messages.

Original issue:

FD node failures are agave tile oom'ing.

Improved state:

tiv1 and tiv2 are agave nodes running the fix. Other nodes running same FD code as before.

original code (without tx history flags):

alessandrod · 2024-12-11T03:29:25Z

Thanks for looking at this! Haven't done a proper review yet, but skimming through the code, it looks like it would parallelize well using rayon instead?

fkouteib · 2024-12-11T05:02:53Z

Thanks for the feedback Alessandro. That makes sense, and spinning off just the write_transaction_status_batch() into a rayon thread after a message is dequeued would be cleaner and achieve the same desired outcome. One follow-up, mostly because I am not super familiar with how we manage this at large on Agave, we should do it with a private rayon thread pool that's still capped, rather than the global rayon pool. I am worried about introducing other perf variations and resource starvation with tapping the global pool. Is that what you have in mind?

alessandrod · 2024-12-11T09:54:07Z

Thanks for the feedback Alessandro. That makes sense, and spinning off just the write_transaction_status_batch() into a rayon thread after a message is dequeued would be cleaner and achieve the same desired outcome. One follow-up, mostly because I am not super familiar with how we manage this at large on Agave, we should do it with a private rayon thread pool that's still capped, rather than the global rayon pool. I am worried about introducing other perf variations and resource starvation with tapping the global pool. Is that what you have in mind?

This is a tricky one because the global rayon pool is actually 99.9% unused. But it does have a bajillion threads (num_cpus() I think?), so we should be careful to not crank it too hard. Between adding another pool and using the global one I'd vote for using the global one, and then hopefully someone makes that pool smaller.

Make TSS multi-threaded.

b874697

fkouteib requested review from bw-solana and alessandrod December 10, 2024 06:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make transaction status service multi-threaded. #4032

Make transaction status service multi-threaded. #4032

fkouteib commented Dec 10, 2024

alessandrod commented Dec 11, 2024

fkouteib commented Dec 11, 2024

alessandrod commented Dec 11, 2024

Make transaction status service multi-threaded. #4032

Are you sure you want to change the base?

Make transaction status service multi-threaded. #4032

Conversation

fkouteib commented Dec 10, 2024

Problem

Summary of Changes

Original issue:

Improved state:

original code (without tx history flags):

alessandrod commented Dec 11, 2024

fkouteib commented Dec 11, 2024

alessandrod commented Dec 11, 2024