Allow limiting Sieve :regex execution time #4767

ksmurchison · 2023-12-08T18:56:26Z

This will about from regexec(), and return an error code so that execution of the script will fail (message still delivered to INBOX)
The lmtpd will be terminated after the message has been delivered to all recipients.

elliefm

My setjmp(3) man page says this about calling longjmp and siglongjmp from a signal handler, like we're doing here.

POSIX.1-2008 Technical Corrigendum 2 adds longjmp() and siglongjmp() to the list of async-signal-safe functions. However, the standard recommends avoiding the use of these functions from signal handlers and goes on to point out that if these functions are called from a signal handler that interrupted a call to a non-async-signal-safe function (or some equivalent, such as the steps equivalent to exit(3) that occur upon a return from the initial call to main()), the behavior is undefined if the program subsequently makes a call to a non-async-signal-safe function. The only way of avoiding undefined behavior is to ensure one of the following:

• After long jumping from the signal handler, the program does not call any non-async-signal-safe functions and does not return from the initial call to main().

• Any signal whose handler performs a long jump must be blocked during every call to a non-async-signal-safe function and no non-async-signal-safe functions are called after returning from the initial call to main().

It's not clear to me whether regcomp is a non-async-signal-safe function, and it probably varies depending on which regex library Cyrus is linked against anyway. If we suppose that regcomp is non-async-signal-safe, then it might not be safe to allow lmtpd to finish processing the rest of the recipients after one of them trips a regex timeout -- especially if any of the subsequent recipients also have regex in their sieve, which would definitely call the non-async-signal-safe regcomp again, which this documentation explicitly calls out as undefined.

sieve/comparator.c

elliefm · 2023-12-12T23:12:43Z

https://datatracker.ietf.org/doc/html/rfc2033.html#section-5:

The server SHOULD send each reply as soon as possible. If it is
going to spend a nontrivial amount of time handling delivery for the
next recipient, it SHOULD flush any outgoing LMTP buffer, so the
reply may be quickly received by the client.

The client SHOULD process the replies as they come in, instead of
waiting for all of the replies to arrive before processing any of
them. If the connection closes after replies for some, but not all,
recipients have arrived, the client MUST process the replies that
arrived and treat the rest as temporary failures.

I think this means that, as long as lmtpd makes sure to prot_flush() after writing each recipient reply, then at any time it should be okay for it to exit early, and anything it didn't get to will just queue and retry later.

I don't remember the lmtpd architecture in detail though. At the time we're processing some user's sieve regex, have we already sent the replies for any earlier recipients? If we have, then in the case of regexec timeout, it should be fine to just send (and flush) the failed reply for this recipient, and then shut_down(). That would save us worrying about any undefined behaviour while processing the remaining recipients.

It still leaves open a possibility for a user with a pathological regex in their sieve to occasionally delay mail for other users (if those other users are the queued-and-retried later recipients). We might want to recommend that deployments that allow user-supplied sieve and have the "regex" extension enabled should configure their MTA to send one recipient per message, like FM does. Though I'm not sure where in the documentation this recommendation would best fit.

rsto · 2024-10-15T06:49:33Z

@ksmurchison This currently fails on CI and it seems unclear if this is the approach we'd should be taking. I have converted this back to draft.

ksmurchison added the Do Not Merge label Dec 8, 2023

ksmurchison requested review from rsto and elliefm December 8, 2023 18:56

ksmurchison force-pushed the sieve_regex_timeout branch from 414d6f7 to f5e0491 Compare December 9, 2023 12:48

elliefm reviewed Dec 11, 2023

View reviewed changes

sieve/comparator.c Outdated Show resolved Hide resolved

sieve/comparator.c Show resolved Hide resolved

ksmurchison force-pushed the sieve_regex_timeout branch from f5e0491 to efab067 Compare December 11, 2023 13:58

rsto removed their request for review August 20, 2024 06:13

Allow limiting Sieve :regex execution time

4939c0d

rjbs force-pushed the sieve_regex_timeout branch from efab067 to 4939c0d Compare September 18, 2024 19:05

rsto self-assigned this Oct 14, 2024

rsto marked this pull request as draft October 15, 2024 06:49

rsto removed their assignment Oct 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow limiting Sieve :regex execution time #4767

Allow limiting Sieve :regex execution time #4767

ksmurchison commented Dec 8, 2023 •

edited

Loading

elliefm left a comment

elliefm commented Dec 12, 2023

rsto commented Oct 15, 2024

Allow limiting Sieve :regex execution time #4767

Are you sure you want to change the base?

Allow limiting Sieve :regex execution time #4767

Conversation

ksmurchison commented Dec 8, 2023 • edited Loading

elliefm left a comment

Choose a reason for hiding this comment

elliefm commented Dec 12, 2023

rsto commented Oct 15, 2024

ksmurchison commented Dec 8, 2023 •

edited

Loading