Pyfi fails when handling strings that are too long #3

pswoodworth · 2018-04-17T13:26:41Z

The python JSON parser appears to be running into issues with strings that are too long inside of python. Needs some further investigation.

pswoodworth · 2018-06-14T20:27:13Z

On further investigation, this seems to be an issue with how the OS handles buffering stdout pipes. When the stdout buffer overflows it flushes the buffer to the pipe, which results in arbitrarily splitting at the exact byte where it hits overflow. The result is that it will either break when python/js tries to convert to a string or when parsing JSON from the string, depending on where the string was split (ie, if it was at the end of a character or in the middle of a character).

There doesn't seem to be a straightforward way to increase the buffer size at the OS level, so we may be stuck with solving it in code.

Options there seem to be:

Split long strings before piping them across, and indicate somehow that the string has been split. This has the advantage that it should be pretty explicit, but the disadvantage that we'll essentially be guessing about at what point the error occurs – ie if the buffer size changes across systems or dynamically on a single system we could still end up hosed.
Try to create our own buffering inside js + python. This has the advantage that it should work regardless of system conditions – the system will flush the buffer whenever it wants to, and our code can handle it, but it has the disadvantage that since neither the sender nor the receiver knows for sure where a bytestring was split or will be split the receiver will essentially just need to assume that data is arriving in the order it was sent in, which is potentially a pretty brittle assumption, particularly given that we're multithreading python operations.

pswoodworth changed the title ~~Python appears to fail with strings that are too long inside JSON.~~ Pyfi fails when handling strings that are too long Jun 14, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pyfi fails when handling strings that are too long #3

Pyfi fails when handling strings that are too long #3

pswoodworth commented Apr 17, 2018 •

edited

Loading

pswoodworth commented Jun 14, 2018 •

edited

Loading

Pyfi fails when handling strings that are too long #3

Pyfi fails when handling strings that are too long #3

Comments

pswoodworth commented Apr 17, 2018 • edited Loading

pswoodworth commented Jun 14, 2018 • edited Loading

pswoodworth commented Apr 17, 2018 •

edited

Loading

pswoodworth commented Jun 14, 2018 •

edited

Loading