Fix: closing system fd is thread unsafe by ysbaddaden · Pull Request #16289 · crystal-lang/crystal

ysbaddaden · 2025-10-28T16:47:20Z

This patch implements a reference counted lock to protect IO objects that depend on a reusable system fd (IO::FileDescriptor, File and Socket) to protect them against thread safety issues around close:

Thread 1 wants to read from fd 123;
The OS preempts Thread 1;
Thread 2 closes fd 123;
Thread 2 opens something else and the OS reuses fd 123;
The OS resumes Thread 1;
Thread 1 reads from the reused fd 123!!!

The same issue arises for any operation that would mutate the fd: write, fchown, ftruncate, setsockopt, ... as they risk affecting a reused fd.

NOTE: The lock is currently implemented on the UNIX target only, but we might want to use it on every target. Go uses its fdMutex on every targets.

Extracted from #16209 (follow-up with single reader/writer)
Depends on #16288 (EventLoop#shutdown)
Closes #16127
Obsoletes #16128

src/crystal/event_loop/wasi.cr

This patch implements a reference counted lock to protect IO objects that depend on a reusable system fd (IO::FileDescriptor, File and Socket) to protect them against thread safety issues around close: - Thread 1 wants to read from fd 123; - The OS preempts Thread 1; - Thread 2 closes fd 123; - Thread 2 opens something else and the OS reuses fd 123; - The OS resumes Thread 1; - Thread 1 reads from the reused fd 123!!! The issue arises for any operation that would mutate the fd: write, fchown, ftruncate, setsockopt, ... as they risk affecting a reused fd instead of the expected one.

Only operations that can affect the file descriptor are counted, for example read or write, truncating a file or changing file permissions. Mere queries with no side effects go through normally because at worst they will fail (they would have anyway).

ysbaddaden · 2025-11-14T14:28:43Z

Rebased on master to remove #16288 that has been merged + its fixup (#16366).

straight-shoota · 2025-11-14T18:20:07Z

Is there a particular reason why we're rolling this out only on Unix targets instead of globally?
There is merit in making smaller increments, but that's a bit offset by the extra method overrides only for the Unix implementations (system_read & co).

ysbaddaden · 2025-11-17T12:46:45Z

Because the issue is on UNIX.

I can move it out of Crystal::System if we believe there's value for every targets' IO::FileDescriptor and Socket.

straight-shoota · 2025-11-17T14:01:01Z

It seems useful to share the same implementation across platforms. Even if it's not strictly necessary on Windows, it's easier to maintain if we only have to worry one mechanism.
That's assuming there are no grave downsides to using this on Windows? I presume there might be some performance implications, but closing doesn't seem like a very contested operation.

ysbaddaden · 2025-11-17T17:02:29Z

Close doesn't create a contention point. The problem is concurrency to the same stdio, file or socket, because we must atomically increment. Many fibers frantically writing to STDOUT will see an impact.

The next step to have a single reader and a single writer (#16209) could be useful on Windows to replace the custom thread communication to read async from the console: when we could merely detach the current thread (#15871) yet make sure only one thread is blocked —which we could use on UNIX to replace the TTY hack (#16353).

straight-shoota · 2025-11-17T17:15:05Z

Many fibers frantically writing to STDOUT will see an impact.

That probably produces a big jumble anyway, so it doesn't seem like a very relevant use case.

ysbaddaden · 2025-11-17T17:59:35Z

If you're careful to buffer your message and to fit within PIPE_BUF then writing to an stdio is atomic (POSIX requirement). In practice it appears to be fine for files.

The tracing feature heavily relies on this.

In practice you don't need to write so frantically as printing every malloc or write something every few microseconds, and using a channel + fiber (as Log does) will completely remove the contention.

ysbaddaden · 2025-11-17T19:47:07Z

Anyway: I'll move @fd_lock out of Crystal::System 👍

ysbaddaden · 2025-11-20T17:58:10Z

I started moving @fd_lock out of Crystal::System and I don't like it 😢

The explicit relationship between the lock and the fd, for example @fd_lock.reference { LibC.fsync(fd) }, is replaced with a blind lock because the wrapped method might implicitly reference fd, for example @fd_lock.reference { system_fsync }.

That looks bad and feels brittle.

ysbaddaden · 2025-11-20T17:59:28Z

I'd prefer to duplicate the behavior in Crystal::System for Windows to protect the handle, and that could come as a follow up.

straight-shoota · 2025-11-20T18:08:47Z

There are already a number of indirect reference where the locked block delegates to the event loop that I'm concerned about.
For example, the wrappers at the end of unix/socket.cr.

The complexity of delegation is already quite high between the public API, system implementations and event loop.
Would be great if there was any chance to simplify that somehow.

This is totally not a stopper, though. Maybe we figure out something later (probably not, though 🤷).

ysbaddaden · 2025-11-21T12:10:19Z

Tried again, and from the point of view of "protecting the system_ methods" it feels better.

I hit a blocker though: we must implement Crystal::EventLoop::IOCP#shutdown otherwise the refcount won't be decremented and the files could at worst be never closed and fibers get stuck.

It's easy for Socket, but IO::FileDescriptor is another story: we must memorize the pending overlapped ops for every file, and actively cancel them (which may be in whatever IOCP instance, possibly multiple of them). We must also be careful with the STDIN console hack, as well as the blocking read/write calls —can they be canceled?

~~As for~~ Like the io_uring event loop, I believe we'll want to wait for the follow-up that serializes reads and writes so there can be only one reader and one writer at most.

straight-shoota · 2025-11-21T12:12:37Z

As for the io_uring event loop, I believe we'll want to wait for the follow-up that serializes reads and writes so there can be only one reader and one writer at most.

Would that make it simpler for IOCP as well?

ysbaddaden · 2025-11-21T12:59:39Z

Yes, this is what I meant.

This patch implements a reference counted lock to protect IO objects that depend on a reusable system fd (`IO::FileDescriptor`, `File` and `Socket`) to protect them against thread safety issues around close: - Thread 1 wants to read from fd 123; - The OS preempts Thread 1; - Thread 2 closes fd 123; - Thread 2 opens something else and the OS reuses fd 123; - The OS resumes Thread 1; - Thread 1 reads from the reused fd 123!!! The same issue arises for any operation that would mutate the fd: `write`, `fchown`, `ftruncate`, `setsockopt`, ... as they risk affecting a reused fd.

This patch extends the fdlock to **serialize reads and writes** by extending the reference counted lock with a read lock and a write lock, so taking a reference and locking acts as a single operation instead of two (1. acquire/release the lock; 2. take/return a reference). This avoids a race condition in the polling event loops: - Fiber 1 then Fiber 2 try to read from `fd`; - Since `fd` isn't ready, both fibers start waiting; - When `fd` becomes ready then Fiber 1 is resumed; - Fiber 1 doesn't read everything and _returns_; - Since events are edge-triggered, Fiber 2 won't be resumed!!! With the read lock, fiber 2 will wait on the lock then be resumed by fiber 1 when it returns. A concrete example is multiple fibers waiting to accept on a socket where fiber 1 would keep handling connections, while fiber 2 sits idle. The other benefit is that it can help to simplify the evloops that will now only deal with a single reader + single writer per `IO` and is required for the io_uring evloop (the MT version requires it). **NOTE**: While this patch only serializes reads/writes on UNIX at the `Crystal::System`, which is where the bugs are, we will move it into stdlib for all targets in a follow-up. See #16289 (comment)

crysbot · 2026-01-18T13:17:23Z

This pull request has been mentioned on Crystal Forum. There might be relevant details there:

https://forum.crystal-lang.org/t/there-is-a-way-to-optimize-this-program/6947/64

ysbaddaden added kind:bug A bug in the code. Does not apply to documentation, specs, etc. topic:stdlib:runtime topic:multithreading platform:unix labels Oct 28, 2025

github-project-automation bot added this to Multi-threading Oct 28, 2025

github-project-automation bot moved this to Review in Multi-threading Oct 28, 2025

This was referenced Oct 28, 2025

Ensure single reader and writer to system fd on Unix #16209

Merged

Extract Crystal::EventLoop#shutdown from #close #16288

Merged

ysbaddaden force-pushed the feature/add-crystal-fd-lock branch from a0cb837 to c309e3c Compare October 30, 2025 17:27

Sija reviewed Oct 30, 2025

View reviewed changes

src/crystal/event_loop/wasi.cr Outdated Show resolved Hide resolved

ysbaddaden added a commit to ysbaddaden/crystal that referenced this pull request Oct 30, 2025

Fix: closing system fd is thread unsafe (crystal-lang#16289)

409e6b7

ysbaddaden mentioned this pull request Nov 14, 2025

Fix: duplicated #shutdown methods in WASI event loop [fixup #16288] #16366

Merged

ysbaddaden force-pushed the feature/add-crystal-fd-lock branch from 2c17971 to ef5d08d Compare November 14, 2025 14:26

ysbaddaden added 2 commits November 14, 2025 15:28

ysbaddaden force-pushed the feature/add-crystal-fd-lock branch from ef5d08d to 3150772 Compare November 14, 2025 14:28

straight-shoota approved these changes Nov 21, 2025

View reviewed changes

Merge branch 'master' into feature/add-crystal-fd-lock

66e332c

ysbaddaden moved this from Review to Approved in Multi-threading Nov 24, 2025

ysbaddaden added this to the 1.19.0 milestone Nov 24, 2025

straight-shoota merged commit bf90884 into crystal-lang:master Nov 25, 2025
49 checks passed

github-project-automation bot moved this from Approved to Done in Multi-threading Nov 25, 2025

ysbaddaden deleted the feature/add-crystal-fd-lock branch November 26, 2025 13:59

zw963 mentioned this pull request Jan 18, 2026

BUG: transfering fd=3 to another evloop with pending reader/writer fibers (RuntimeError) Since #15658

Closed

Blacksmoke16 linked an issue Jan 18, 2026 that may be closed by this pull request

BUG: transfering fd=3 to another evloop with pending reader/writer fibers (RuntimeError) Since #15658

Closed

Uh oh!

Conversation

ysbaddaden commented Oct 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

ysbaddaden commented Nov 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

straight-shoota commented Nov 14, 2025

Uh oh!

ysbaddaden commented Nov 17, 2025

Uh oh!

straight-shoota commented Nov 17, 2025

Uh oh!

ysbaddaden commented Nov 17, 2025

Uh oh!

straight-shoota commented Nov 17, 2025

Uh oh!

ysbaddaden commented Nov 17, 2025

Uh oh!

ysbaddaden commented Nov 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ysbaddaden commented Nov 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ysbaddaden commented Nov 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

straight-shoota commented Nov 20, 2025

Uh oh!

ysbaddaden commented Nov 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

straight-shoota commented Nov 21, 2025

Uh oh!

ysbaddaden commented Nov 21, 2025

Uh oh!

Uh oh!

crysbot commented Jan 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ysbaddaden commented Oct 28, 2025 •

edited

Loading

ysbaddaden commented Nov 14, 2025 •

edited

Loading

ysbaddaden commented Nov 17, 2025 •

edited

Loading

ysbaddaden commented Nov 20, 2025 •

edited

Loading

ysbaddaden commented Nov 20, 2025 •

edited

Loading

ysbaddaden commented Nov 21, 2025 •

edited

Loading