Fix deadlocks due to invalid handling of SSL sockets and select #504

curdbecker · 2022-01-08T20:41:35Z

SSL sockets can't be used with select() in the same way raw sockets would, since the OS socket readable/writable state is not always identical to the one of the SSL stack. See also:
https://docs.python.org/3/library/ssl.html#notes-on-non-blocking-sockets

For instance, websockify is currently unable to proxy a connection to a SSL target - unless the server-side sends data first.

In this case, the OS socket is reported as readable - due to SSL protocol data like sessions tickets being exchanged - while the SSL socket is not readable since application data hasn't been received from the server-side yet. As SSL sockets are read from (and written to) in a blocking fashion, the current implementation then simply stalls. For an example, see here:

Screen.Recording.2022-01-08.at.19.14.27.3.mov

The following patch tries to leave most of the existing socket handling logic intact while addressing the special handling required for SSL sockets. In particular, SSL sockets are now put in non-blocking mode, so that false-positive readable reports from select() don't lead to a deadlock but an exception. (Unfortunately, the SSLSocket.pending() API doesn't seem to work reliably to archive the same with blocking SSL sockets.. :-/)

In addition, the patch introduces an assertion to ensure the internal buffer size is equal or above 16kb so that all available data can be directly read from a SSL socket in a single call. Otherwise, there would be additional (annoying) handling required to make sure no data is accidentally left behind when a SSL socket would be only partially read. In this case, select() wouldn't report this socket as readable again - unless new data is received - since the rest of the data has already been read and buffered by the SSL socket layer.

CendioOssman

Thanks. I don't think this is a common use case, so this is probably been overlooked for some time.

This looks mostly correct, except for a few notes.

websockify/websocketproxy.py

CendioOssman · 2022-01-10T08:14:17Z

websockify/websocketproxy.py

+                    # Otherwise, there could be still data available for reading, but select
+                    # wouldn't report the socket as readable again, since all data has already
+                    # been read by the SSL socket from the OS.
+                    # see also: https://docs.python.org/3/library/ssl.html#notes-on-non-blocking-sockets


I don't think it guarantees to give you everything in a single recv(). For one thing, there might be multiple frames buffered. So you need to do what that link specifies and continue calling recv() until it is drained. You have SSLSocket.pending() to help you out.

Yes, it does. That's the good thing. Let me quote the documentation on OpenSSL's SSL_read():

The read functions work based on the SSL/TLS records. The data are received in records (with a maximum record size of 16kB). Only when a record has been completely received, can it be processed (decryption and check of integrity). Therefore, data that was not retrieved at the last read call can still be buffered inside the SSL layer and will be retrieved on the next read call. If num is higher than the number of bytes buffered then the read functions will return with the bytes buffered. If no more bytes are in the buffer, the read functions will trigger the processing of the next record. Only when the record has been received and processed completely will the read functions return reporting success. At most the contents of one record will be returned.

And unfortunately, SSLSocket.pending() doesn't seem to work at all on some platforms. For me, it always returns zero regardless if there's something pending or not.

The way I read that text it supports my initial assertion. If there are multiple records buffered then it states that only the first record will be returned. So you need to call it again.

CendioOssman requested changes Jan 10, 2022

View reviewed changes

Fix deadlocks due to invalid handling of SSL sockets and select

0cf2d9b

curdbecker force-pushed the master branch from 4322228 to 0cf2d9b Compare January 10, 2022 10:12

CendioOssman added the bug Something isn't working label Dec 12, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix deadlocks due to invalid handling of SSL sockets and select #504

Fix deadlocks due to invalid handling of SSL sockets and select #504

curdbecker commented Jan 8, 2022

CendioOssman left a comment

CendioOssman Jan 10, 2022

curdbecker Jan 10, 2022

CendioOssman Jan 27, 2022

Fix deadlocks due to invalid handling of SSL sockets and select #504

Are you sure you want to change the base?

Fix deadlocks due to invalid handling of SSL sockets and select #504

Conversation

curdbecker commented Jan 8, 2022

CendioOssman left a comment

Choose a reason for hiding this comment

CendioOssman Jan 10, 2022

Choose a reason for hiding this comment

curdbecker Jan 10, 2022

Choose a reason for hiding this comment

CendioOssman Jan 27, 2022

Choose a reason for hiding this comment