Trajectory optimisation #1946

corinnebosley · 2016-02-22T15:48:45Z

The changes in this module are to improve the performance of the trajectory interpolation routines for both linear and nearest neighbour schemes. The linear scheme now calls a cached linear interpolator (which improves performance by ~25%), and the nearest neighbour scheme calls a faster method (improving performance by ~62%).

There are a few points to make about these changes, firstly that the routines that are now employed by this module calculate a cube for every sample point before putting the data from that into a new trajectory cube, which I am sure takes up a lot of time. Secondly, one test (test_trajectory.py) fails after implementation of the scheme changes. There is a statement within this test to check whether the original cube data remains lazy, and it evidently does not when using extract_nearest_neighbour rather than _nearest_neighbour_indices_ndcoords.

I am open to advice or ideas about how to proceed with this problem. I am not fond of the idea of changing the test to fit the failure, so I could use a copy of the cube in the interpolation routine, but this seems a little convoluted.

… tests

pelson · 2016-02-22T16:03:28Z

lib/iris/analysis/trajectory.py

    cache = {}

+    # Cache the linear interpolator
+    scheme = iris.analysis.Linear()


Did you measure the impact of using iris.analysis.Nearest for nearest neighbour interpolation?

No I didn't. That's probably worth a go.

This works fine, but it's not as quick (more like a 45% improvement), and the test still fails.

I'm happy with a 45% improvement for the cost of consistency.

pelson · 2016-02-22T17:00:45Z

There is a statement within this test to check whether the original cube data remains lazy, and it evidently does not when using extract_nearest_neighbour rather than _nearest_neighbour_indices_ndcoords.

I don't think it is reasonable to expect an interpolation to keep the data lazy (at this moment in time). We've talked about adding such a capability (particularly for regrid), but it is beyond anything we've actually needed. In that sense, I'm ok with trajectory interpolate not being lazy (even though before, it was for nearest).

pelson · 2016-02-22T17:01:07Z

To be clear. It does constitute a major change, and I'd want that to be documented clearly.

corinnebosley · 2016-02-23T09:58:06Z

@pelson
I have updated the pull request to include the commit in which the nearest neighbour scheme is consistent with the format of the linear scheme.

I am still a little unclear about what to do regarding the failed test, though. Are you suggesting that I do change the test, or just ignore the failure? And what exactly did you want me to document?

pelson · 2016-02-23T10:01:44Z

Are you suggesting that I do change the test, or just ignore the failure?

Certainly the former. Update the test as part of this PR. Whoever merges it (me maybe) is accepting the loss of functionality (lazy interpolation) in exchange for improved performance.

And what exactly did you want me to document?

A "what's new": http://scitools.org.uk/iris/docs/latest/developers_guide/documenting/whats_new_contributions.html
Any mention of the fact that interpolate for trajectory is lazy (I'm not sure there will be any, but you will have read those docs more recently than me)

…edup

marqh · 2016-02-23T12:05:29Z

Are you suggesting that I do change the test, or just ignore the failure?

Certainly the former. Update the test as part of this PR. Whoever merges it (me maybe) is accepting the loss of functionality (lazy interpolation) in exchange for improved performance.

I agree

corinnebosley · 2016-02-23T17:21:13Z

After running the tests again, I have noticed a problem that occurs when interpolation is performed on hybrid-height cubes, so before this goes any further I will be doing some more investigations into where this originates from and what we can do about it.

corinnebosley · 2016-02-24T16:15:20Z

Also in the hybrid height function of the trajectory test is this:

    self.assertCML([cube, xsec], ('trajectory', 'hybrid_height.cml'))

which also fails. Naively, I expected the results of two different nearest neighbour routines to be the same. The closest point is the closest point, whatever, right?

No.

I have now discovered that the nearest neighbour interpolation routine which I have removed uses a dramatically different calculation sequence than the standard scheme. The original routine calculates the closest point spherically using loads of coordinate transformations and trigonometry and so on and so forth, whereas the routine I have changed it to uses a super-quick, 4-line function to numerically deduce the nearest neighbour. Hence, the 'nearest neighbours' in the two tests are different.

It occurs to me that I may have sacrificed a small degree of accuracy for a fairly reasonable increase in performance. I don't want to take this too lightly because it looks like somebody put really quite a lot of work into the original nearest neighbour routine, but it does sadly seem somewhat extravagant considering its purpose.

I am, again, open to advice and discussion regarding this issue.

corinnebosley · 2016-02-26T11:49:27Z

lib/iris/tests/test_trajectory.py

+        traj = (('grid_latitude', [20.499, 21.501, 22.501, 23.501]),
                ('grid_longitude', [31, 32, 33, 34]))
-        xsec = iris.analysis.trajectory.interpolate(cube, traj, method='nearest')



I have altered the numbers here slightly so that the test case is not so unstable. In the reference, the nearest neighbour scheme rounded the grid latitudes to [20, 22, 23, 24]. This should ensure that the standard nearest neighbour scheme yields the same result.

Although I've just noticed that it doesn't and I need to work out why.

rhattersley · 2016-04-26T08:41:43Z

Suggestions for how to progress:

Rescue the linear interpolation performance enhancement (in this PR?)
Update the docs for iris.analysis.trajectory.interpolate to make it clear that method='nearest' switches to a "spherical" mode and is loads slower. Also add a signpost to the faster (but more limited - it doesn't handle 2D coords which are useful for tri-polar grids) Nearest() scheme way of doing things.
Figure out the API for providing fast (i.e. non-spherical) nearest-neighbour interpolation for a trajectory. (In the context of rationalising all our interpolation, regridding, and indexing APIs.) How can a user choose between cartesian and spherical?
Do we need a transition API before (3) is ready? e.g. Add method='nearest_cartesian' to iris.analysis.trajectory.interpolate. (Possibly deprecated from the start if (3) suggests a preferred API.)

cc: @pp-mo

corinnebosley added 8 commits February 18, 2016 14:52

Optimised linear interpolation process in trajectory

c8c3151

Performance improvement by redirecting trajectory to Linear

99d19f1

Performance improvement by redirecting trajectory to Linear

2ee298f

Optimisation of trajectory interpolation

a580f66

Trajectory interpolation optimisation

820c120

Trajectory interpolation optimisation, working but with failed tests.

64a691a

Trajectory Interpo0lation Optimisation, working but still with failed…

d53d3e2

… tests

Trajectory interpolation optimisation

c731fa6

pelson reviewed Feb 22, 2016
View reviewed changes

More consistent trajectory interpolation optimisation

6397f75

Test changed and documentation added for trajectory interpolation spe…

f8fdf28

…edup

Test updated

0cfee0a

Altered test case for trajectory interpolate

2c04b03

corinnebosley reviewed Feb 26, 2016
View reviewed changes

corinnebosley mentioned this pull request Apr 29, 2016

Cache linear interpolator #1997

Closed

corinnebosley closed this Oct 24, 2017

corinnebosley deleted the trajectory_optimisation branch October 24, 2017 09:28

Trajectory optimisation #1946

Trajectory optimisation #1946

Uh oh!

Conversation

corinnebosley commented Feb 22, 2016

Uh oh!

pelson Feb 22, 2016

Choose a reason for hiding this comment

Uh oh!

corinnebosley Feb 22, 2016

Choose a reason for hiding this comment

Uh oh!

corinnebosley Feb 22, 2016

Choose a reason for hiding this comment

Uh oh!

pelson Feb 22, 2016

Choose a reason for hiding this comment

Uh oh!

pelson commented Feb 22, 2016

Uh oh!

pelson commented Feb 22, 2016

Uh oh!

corinnebosley commented Feb 23, 2016

Uh oh!

pelson commented Feb 23, 2016

Uh oh!

marqh commented Feb 23, 2016

Uh oh!

corinnebosley commented Feb 23, 2016

Uh oh!

corinnebosley commented Feb 24, 2016

Uh oh!

corinnebosley Feb 26, 2016

Choose a reason for hiding this comment

Uh oh!

corinnebosley Feb 26, 2016

Choose a reason for hiding this comment

Uh oh!

rhattersley commented Apr 26, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

rhattersley commented Apr 26, 2016 •

edited

Loading