feat: add log1p function #273

thisisnic · 2022-08-02T13:15:44Z

This PR adds the log1p function.

extensions/functions_logarithmic.yaml

cpcloud · 2022-08-12T14:35:37Z

extensions/functions_logarithmic.yaml

+            options: [ NAN, ERROR ]
+            required: false
+          - value: fp32
+        return: fp32


I realize the surrounding functions don't do this but do these arguments need a nullability parameter?

Which arguments; as in - value: fp32? And by the same logic, would we also want a lot of the arithmetic functions to have a nullability parameter then? Or did you mean something else?

@cpcloud why? The default is MIRROR (see table here), which is correct here.

Does this signature handle the case of NULL inputs? It looks like it only accepts non-nullable inputs.

Yes, because MIRROR and DECLARED_OUTPUT are IMO weird special cases simply because the return type derivation expressions lack the generality to describe them. See https://substrait.io/expressions/scalar_functions/#nullability-handling. MIRROR and DECLARED_OUTPUT ignore the nullability flag of the arguments and accept any combination of both nullable and non-nullable arguments, and MIRROR ignores and overrides the nullability flag of the return type with the boolean or of all argument nullability flags.

FWIW, for the validator I'll be implementing a generalization of the type derivation grammar that uses type?? to specify "either nullable or non-nullable" in a type pattern, and type?parameter to bind the nullability to the name parameter, so variadics aside, you could describe MIRROR and DECLARED_OUTPUT more flexibly and explicitly if you want. I want to support that if only to avoid adding code for all the special cases, even if Substrait itself won't ever support that syntax.

ETA: heck, I could do away with the special cases even for variadics if I add a type?parameter? syntax for patterns, where parameter is bound to true iff any matched patterns were nullable :D But I realize that the syntax is arcane at best. Ultimately I just want to be able to round-trip the grammar with my IR for these things while being 100% compatible with Substrait grammar (for as far as that's possible at all due to all the conflicting descriptions in Substrait itself).

jvanstraten

The function definition looks good to me, though I'm not sure I would consider this function sufficiently primitive to include in core. After all, it's literally just (a probably optimized form of) log(add(literal(1), x)). Then again... the same could be said about all but one of log2, log10, natural log, and log with a specified base though, since they can all be rewritten to any other using the right identities. And then cos(x) is just sin(add(literal(pi/2), x)), and tan(x) is just divide(sin(x), cos(x)), and so on...

I don't know.

julianhyde · 2022-08-16T19:11:32Z

@jvanstraten For the record,log1p(x) is the same as log(1 + x) in math but not in floating point arithmetic. For values of x that are close to 0, log1p(x) will give a much more accurate result than log(x + 1). There is a good reason that many numeric libraries (e.g. NumPy and java.math.Math) have log1p and expm1.

jvanstraten · 2022-08-16T20:05:14Z

I didn't know about log1p specifically, but figured as much. There's lots of functions like that to trade off performance, accuracy, and stability, like inverse sqrt to name one. I'm just not sure where we should draw the line for Substrait core. A calculator doesn't have a dedicated button for log1p or invsqrt either, because when you're thinking at the abstraction level of doing abstract math those are pretty meaningless operations. I doubt SQL has it either for the same reason, but correct me if I'm wrong. You can make the case that Substrait explicitly operates on fp32 or fp64 though, not on abstract real numbers. So again, I don't feel strongly either way in this case. I'm just worried this kind of thing will explode without bound if we don't draw the line somewhere.

cpcloud reviewed Aug 11, 2022

View reviewed changes

extensions/functions_logarithmic.yaml Show resolved Hide resolved

thisisnic force-pushed the log1p branch from d1561ef to 0b03f00 Compare August 12, 2022 10:49

cpcloud reviewed Aug 12, 2022

View reviewed changes

jvanstraten previously approved these changes Aug 15, 2022

View reviewed changes

thisisnic added 2 commits September 1, 2022 14:55

feat: add log1p function

e19923c

docs: add names to log1p input args

77711aa

thisisnic dismissed jvanstraten’s stale review via 77711aa September 1, 2022 13:57

thisisnic force-pushed the log1p branch from 0b03f00 to 77711aa Compare September 1, 2022 13:57

jvanstraten approved these changes Sep 2, 2022

View reviewed changes

jvanstraten merged commit 55e8275 into substrait-io:main Sep 2, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add log1p function #273

feat: add log1p function #273

thisisnic commented Aug 2, 2022

cpcloud Aug 12, 2022

thisisnic Aug 15, 2022

jvanstraten Aug 15, 2022

cpcloud Aug 15, 2022

jvanstraten Aug 15, 2022 •

edited

Loading

jvanstraten Aug 15, 2022 •

edited

Loading

jvanstraten left a comment

julianhyde commented Aug 16, 2022

jvanstraten commented Aug 16, 2022

feat: add log1p function #273

feat: add log1p function #273

Conversation

thisisnic commented Aug 2, 2022

cpcloud Aug 12, 2022

Choose a reason for hiding this comment

thisisnic Aug 15, 2022

Choose a reason for hiding this comment

jvanstraten Aug 15, 2022

Choose a reason for hiding this comment

cpcloud Aug 15, 2022

Choose a reason for hiding this comment

jvanstraten Aug 15, 2022 • edited Loading

Choose a reason for hiding this comment

jvanstraten Aug 15, 2022 • edited Loading

Choose a reason for hiding this comment

jvanstraten left a comment

Choose a reason for hiding this comment

julianhyde commented Aug 16, 2022

jvanstraten commented Aug 16, 2022

jvanstraten Aug 15, 2022 •

edited

Loading

jvanstraten Aug 15, 2022 •

edited

Loading