Detach execution context scheduler from running thread during blocking syscall by ysbaddaden · Pull Request #15871 · crystal-lang/crystal

ysbaddaden · 2025-06-03T12:38:22Z

Some syscalls can block the current thread in certain circumstances, for example:

open(2) when opening a FIFO, pipe or character
device until another end is connected (from another thread or process);
getaddrinfo(3) until a DNS response (or error, or timeout) is received.

This patch introduces a mechanism to declare the scheduler as "doing a syscall" which the monitor thread (SYSMON) can detect on its next iteration and will try to move the scheduler to another thread, so that only the fiber doing the syscall will be blocked, and the other fibers can be resumed.

Usually, the syscall should terminate before the monitor thread notices (for example opening a regular file), so the impact on performance is an atomic STORE + atomic CAS per syscall. At worst, a thread will be blocked for 10ms (SYSMON frequency). For example the updated opening FIFO file spec takes ~11ms to complete.

It works for the MT execution contexts and the ST context. It doesn't invalidate the ST guarantee that fibers in the context will never run in parallel: the blocked fiber is blocked on a syscall and will be re-enqueued immediately after the syscall has completed; also the syscalls don't invoke callbacks that would execute crystal code, so AFAICT fibers still won't run in parallel (please correct me if I'm wrong).

NOTES

The isolated context expects to block, so the #syscall(&) method is a no-op there.

There are probably other blocking syscalls that we might want to consider. For example, reading from STDIN on Windows could be greatly simplified.

Another example is flock that is currently retried every 100ms when it doesn't block the current thread. We might want to be able to actively detach a scheduler when calling #syscall(&), so we could try once (non-blocking) then on failure detach the scheduler and try again (blocking) without waiting for SYSMON to notice.

EXAMPLE

The following example blocks the current thread, yet the spawned fiber keeps ticking every second. Remove the Fiber.syscall wrapper, and the fiber won't even start!

# bin/crystal foo.cr -Dpreview_mt -Dexecution_context

spawn do
  loop do
    sleep 1.second
    puts "tick"
  end
end

Fiber.syscall do
  Thread.sleep(5.seconds)
end

FOLLOW UP

We plan to use this in the future to rework and simplify use cases in the stdlib. For example:

Polling event loops could support blocking file descriptors, so we could stop setting O_NONBLOCK on standard descriptors (shared), including pipes to spawned processes (Crystal's duplicated stdios are broken #16353).
Same on Windows where console streams don't support OVERLAPPED (Console streams are blocking on Windows #14576, Emulate non-blocking STDIN console on Windows #14947).

THREAD POOL

This PoC also introduces a pool of threads. It changes the behavior of threads: we don't start a thread to run a specific scheduler run loop, but each thread now has its own inner loop that basically switches to a scheduler loop then switches back to its inner loop to sleep.

The benefit of the global thread loop is that threads are kept around instead of being created and thrown away. If you regularly spawn an isolated fiber, it will likely keep reusing the same thread(s). Threads still eventually shutdown after some inactive time (configurable) except for the main thread (we need to keep the main fiber alive).

A potential evolution will park MT threads into the thread pool, instead of keeping them tied to the MT context, so they can be reused by any context that needs parallelism, or to boot a new isolated fiber or ST context.

Extracted to #15885.

POTENTIAL ISSUES

I got one segfault in a gc call nested a libxml2 callback in one early run of the std specs (with -Dpreview_mt -Dexecution_context) but I couldn't reproduce it after fixing different issues in the PR.

Maybe it was a fluke (because of the bugs), or maybe it was just a regular MT issue with libxml2, or maybe sysmon moved the scheduler from the main thread to another thread then resumed a fiber doing something in libxml2, and the ~~global~~ thread local state couldn't be found?

This is the already known MT issue we have with libxml2. What's new is that the segfault might start happening in a ST environment 😢

☝️ expected to have been fixed by #15899 (and #15906).

src/fiber/execution_context.cr

src/fiber/execution_context/multi_threaded.cr

This changes the runtime behavior of threads: we no longer start a thread to run a specific scheduler run loop and never terminate in practice (except for the isolated context). Each thread now has its own inner loop that switches to a scheduler loop fiber (the scheduler's main fiber) then switches back to its inner loop (the thread's main fiber) to sleep for a while, then eventually terminates. The benefit of the global thread pool is that threads are kept around instead of being created and thrown away. This is for example helpful for #15871 that will allow moving a scheduler to another thread, as well as for applications that regularly start an isolated fiber. They can keep reusing a pending thread instead of having to create one every time. Threads still eventually terminate after some configurable inactive time, except for the main thread because we need to keep the main fiber's stack alive. A future improvement could park MT threads back into the thread pool, instead of keeping them tied to the MT context. They could be reused by any context that needs parallelism, or to boot a new isolated fiber or ST context, instead of sitting around.

Marks the scheduler has running a blocking syscall for the duration of the block. The monitor thread now ticks every 10ms to check if any scheduler in any concurrent or parallel context is blocked on a syscall, and if so tries to detach the scheduler from the thread. On success, the scheduler is moved to another thread, taken from the thread pool. The fiber doing a blocking syscall will still be blocked, but other fibers can be resumed by the scheduler. When the blocking syscall returns, the thread will try to unmark the scheduler as running a blocking syscall. On success, the scheduler is still attached to the thread, so it simply continues. On failure, the scheduler has been moved to another thread, so it enqueues the current fiber into its execution context, and returns itself back into the thread pool.

ysbaddaden · 2026-01-29T14:00:50Z

Rebased from master to bring #15885 along with a few fixes. Ready for review.

src/fiber/execution_context/monitor.cr

src/fiber/execution_context/scheduler.cr

ysbaddaden · 2026-01-29T18:43:57Z

I finally got a trace in gdb:

#4  0x00005a072d66a408 in scheduler (self=0x7b863afc6c00) at /home/julien/work/crystal-lang/crystal/src/crystal/system/thread.cr:82
#5  0x00005a072d67730e in current () at /home/julien/work/crystal-lang/crystal/src/fiber/execution_context/scheduler.cr:6
#6  0x00005a072d706e19 in enqueue (self=0x7b866f75dea0, fiber=0x7b863afc4600)
    at /home/julien/work/crystal-lang/crystal/src/fiber/execution_context/parallel.cr:250
#7  0x00005a072d6ea7b5 in open (self=0x7b866f77ce70, path=0x7b863b00ac80, flags=577, permissions=420,
    blocking=<error reading variable: Attempt to dereference a generic pointer.>)
    at /home/julien/work/crystal-lang/crystal/src/fiber/execution_context/scheduler.cr:106
#8  0x00005a072d6e2c0a in open (filename=0x7b863b00ac80, mode=0x5a072d2b6090, perm=64262,
    blocking=<error reading variable: Attempt to dereference a generic pointer.>)
    at /home/julien/work/crystal-lang/crystal/src/crystal/system/unix/file.cr:9
#9  0x00005a072d6d5996 in new_internal (filename=0x7b86327ffba0, mode=0x5a072d2b6090, perm=420,
    encoding=<error reading variable: Attempt to dereference a generic pointer.>,
    invalid=<error reading variable: Attempt to dereference a generic pointer.>, blocking=...)
    at /home/julien/work/crystal-lang/crystal/src/file.cr:176
#10 0x00005a072d6dc6e1 in write (filename=0x7b863b00ac80, content=..., perm=420, encoding=758155224, invalid=10063,
    mode=0x5a072d2b6090, blocking=989900027) at /home/julien/work/crystal-lang/crystal/src/file.cr:536
#11 0x00005a072d6dc5ff in write (filename=0x7b863b00ac80, content=...) at /home/julien/work/crystal-lang/crystal/src/file.cr:613
#12 0x00005a072ec92a3e in update_bitcode_cache (self=0x7b863af9bd90)
    at /home/julien/work/crystal-lang/crystal/src/compiler/crystal/compiler.cr:1039
#13 0x00005a072ec92694 in compile (self=0x7b863af9bd90, isolate_context=true)
    at /home/julien/work/crystal-lang/crystal/src/compiler/crystal/compiler.cr:997

The monitor thread detached the scheduler from the thread (it noticed it was doing a syscall) and the thread should now be a bare thread. We call Fiber::ExecutionContext::Parallel#enqueue that tries to do a local enqueue which tries to call Fiber::ExecutionContext::Scheduler.current that raises (it's nil).

The thing is, the current thread is now a bare thread, so the condition Fiber::ExecutionContext.current? == self should be nil == self and thus false. The debugger tells me that both are NULL pointers (aka nil).

Theory A: ~~I don't use atomic ops to set Thread#execution_context or Thread#scheduler so maybe open finishes without yielding the CPU time, and maybe it keeps a ref to Thread#execution_context.~~

Theory B: the thread is being detached by the monitor at the same time the not-yet-bare thread tries to enqueue the fiber... and that's super dangerous: we must ALWAYS do a safe enqueue to the global queue 💣 💥 🤦

The thread may have long been detached, but it may also have just lost the atomic against the monitor thread that hasn't detached it yet. Let's imagine the monitor thread gets preempted, then the thread will check the current context + scheduler to match, but is preempted before it can actually enqueue (ticking bomb); then the monitor moves the scheduler to another thread (oops); finally the thread tries a local enqueue (boom).

ysbaddaden · 2026-01-30T10:15:19Z

Theory B appears to have been correct. CI doesn't reproduce anymore.

…5871]

…] (#16679) We must access `Errno.value` within the `Fiber.syscall(&)` block, not after the block returned because there can be a context switch before the method returns, and `errno` is no longer valid.

ysbaddaden self-assigned this Jun 3, 2025

ysbaddaden added kind:feature topic:stdlib:runtime topic:multithreading labels Jun 3, 2025

github-project-automation bot added this to Multi-threading Jun 3, 2025

github-project-automation bot moved this to Review in Multi-threading Jun 3, 2025

ysbaddaden force-pushed the poc/execution-context-detach-thread-during-syscall branch from 4b1309b to 9e54c28 Compare June 3, 2025 15:05

ysbaddaden mentioned this pull request Jun 3, 2025

Fix: race condition in Fiber::ExecutionContext::Isolated#wait #15872

Merged

ysbaddaden force-pushed the poc/execution-context-detach-thread-during-syscall branch from 9e54c28 to 2ade29a Compare June 3, 2025 15:25

ysbaddaden commented Jun 3, 2025

View reviewed changes

src/fiber/execution_context.cr Outdated Show resolved Hide resolved

ysbaddaden commented Jun 3, 2025

View reviewed changes

src/fiber/execution_context/multi_threaded.cr Outdated Show resolved Hide resolved

ysbaddaden force-pushed the poc/execution-context-detach-thread-during-syscall branch from fe6a9e6 to 603c2ff Compare June 5, 2025 16:39

ysbaddaden mentioned this pull request Jun 6, 2025

Add Fiber::ExecutionContext::ThreadPool #15885

Merged

This was referenced Mar 25, 2025

Implement RFC 0002: ExecutionContext [EPIC] #15342

Open

PoC: Replace Thread::Local(T) with Thread::LocalStorage #16029

Closed

straight-shoota mentioned this pull request Nov 11, 2025

Crystal's duplicated stdios are broken #16353

Open

ysbaddaden mentioned this pull request Nov 17, 2025

Fix: closing system fd is thread unsafe #16289

Merged

ysbaddaden mentioned this pull request Dec 4, 2025

ExecutionContext::Runnables stress test is flaky with Crystal 1.0 #16470

Closed

straight-shoota mentioned this pull request Dec 15, 2025

Blocking mode of existing fds opened with IO::FileDescriptor.new is changed starting with crystal 1.17 #16507

Open

ysbaddaden moved this from Review to In Progress in Multi-threading Jan 13, 2026

ysbaddaden mentioned this pull request Jan 23, 2026

Improve docs for Fiber::ExecutionContext #16602

Draft

ysbaddaden added 3 commits January 29, 2026 14:19

Wrap LibC.open in syscall(&) in the polling event loops

9ce8d88

Wrap LibC.getaddrinfo in syscall(&) on UNIX and Windows 7

b1f47e3

ysbaddaden force-pushed the poc/execution-context-detach-thread-during-syscall branch from 603c2ff to b1f47e3 Compare January 29, 2026 13:25

ysbaddaden moved this from In Progress to Review in Multi-threading Jan 29, 2026

ysbaddaden marked this pull request as ready for review January 29, 2026 13:53

fixup! Add Fiber.syscall(&)

f5e191e

ysbaddaden changed the title ~~PoC: detach execution context scheduler from running thread during blocking syscall~~ Detach execution context scheduler from running thread during blocking syscall Jan 29, 2026

straight-shoota reviewed Jan 29, 2026

View reviewed changes

src/fiber/execution_context/monitor.cr Outdated Show resolved Hide resolved

src/fiber/execution_context/scheduler.cr Outdated Show resolved Hide resolved

ysbaddaden added 5 commits January 30, 2026 10:37

Add Fiber::ExecutionContext#external_enqueue

d465269

Fix: ameba warning

a2cf50a

Drop Monitor::Timer type and simplify Monitor

61a7125

fixup! Drop Monitor::Timer type and simplify Monitor

9e7ebf1

fixup! Drop Monitor::Timer type and simplify Monitor

be8d229

straight-shoota approved these changes Feb 13, 2026

View reviewed changes

straight-shoota moved this from Review to Approved in Multi-threading Feb 13, 2026

straight-shoota added this to the 1.20.0 milestone Feb 13, 2026

straight-shoota merged commit d321df9 into crystal-lang:master Feb 17, 2026
53 checks passed

github-project-automation bot moved this from Approved to Done in Multi-threading Feb 17, 2026

ysbaddaden deleted the poc/execution-context-detach-thread-during-syscall branch February 17, 2026 16:25

ysbaddaden added a commit to ysbaddaden/crystal that referenced this pull request Feb 24, 2026

Fix: must resolve errno within Fiber.syscall(&) [fixup crystal-lang#1…

2faaa01

…5871]

ysbaddaden mentioned this pull request Feb 24, 2026

Fix: must take Errno.value within Fiber.syscall(&) [fixup #15871] #16679

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Detach execution context scheduler from running thread during blocking syscall#15871

Detach execution context scheduler from running thread during blocking syscall#15871
straight-shoota merged 10 commits intocrystal-lang:masterfrom
ysbaddaden:poc/execution-context-detach-thread-during-syscall

ysbaddaden commented Jun 3, 2025 •

edited by straight-shoota

Loading

Uh oh!

Uh oh!

Uh oh!

ysbaddaden commented Jan 29, 2026

Uh oh!

Uh oh!

Uh oh!

ysbaddaden commented Jan 29, 2026

Uh oh!

ysbaddaden commented Jan 30, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

ysbaddaden commented Jun 3, 2025 • edited by straight-shoota Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

NOTES

EXAMPLE

FOLLOW UP

THREAD POOL

POTENTIAL ISSUES

Uh oh!

Uh oh!

Uh oh!

ysbaddaden commented Jan 29, 2026

Uh oh!

Uh oh!

Uh oh!

ysbaddaden commented Jan 29, 2026

Uh oh!

ysbaddaden commented Jan 30, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ysbaddaden commented Jun 3, 2025 •

edited by straight-shoota

Loading