optimize min/max and related for floats

Someone on Discourse [recently posted](https://discourse.julialang.org/t/c-implementation-of-function-being-4-times-faster-even-absence-of-allocs/126495) about `findmax` on floats being 4x slower than the C version. We do a bunch of special handling that doesn't necessarily match what the fast native instructions do, but I did some poking around and I *think* we could do much better than we currently are:

- On ARM the `fmax` and `fmin` instructions already do the right thing (propagate NaNs), so we should make sure that we emit the simple code there.
- On x64, it seems like the native instructions actually happen to implement `isless` (all NaNs sort after all non-NaNs), and we should make sure that this is used where appropriate.
- For `maximum` and `findmax` this is actually already the right order! So we should make sure that we just use the native instruction there.
- For `minimum` and `findmin` this is not the right order, but we can just negate, take the max, and then negate again at the end and get the right answer.

Am I missing something here, or could we be doing much better?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

optimize min/max and related for floats #57647

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

optimize min/max and related for floats #57647

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions