biggus #900

rhattersley · 2013-12-20T13:11:21Z

This PR switches the handling of deferred data to biggus and gets rid of the data manager concept.

NB. To minimise the change, LazyArray is still used within AuxFactory and for deferred coordinate values.

shoyer · 2013-12-22T03:55:26Z

My favorite part about this PR is that it removes twice as many lines as it adds! 👍

Of course, biggus is a major dependency, but I think this removes significant complexity from Iris -- which is a major plus for people like me, who are excited about figuring out how to add new features.

shoyer · 2013-12-22T07:41:05Z

lib/iris/cube.py

Do you really want to switch from np.asarray to np.asanyarray? The advantage of asarray is that it only lets through direct instances of ndarray, which guarantees that there aren't any funny ndarray subclasses in use (like matrix), which might overload operators in unexpected ways. In my opinion, asarray is a safer way to go, because it means that people writing math operations on cubes know exactly what they're getting.

I used np.asanyarray because I wanted to let masked arrays through. But that could be handled with its own explicit clause.

Oh, okay. We usually use standard ndarrays using np.nan for masked values (because it's faster and more memory efficient), but I know Iris uses numpy's masked array module in many cases.

pelson · 2013-12-24T22:07:50Z

I've added several comments on top of b0034a6. Some are memory joggers or questions, others are review actions. Given the number of changes, please only append commits at this point.

In terms of getting this in, I wonder if we need more documentation of biggus (or at least an uptodate pip install). What are you're thoughts on that front.

Hope you have a good Christmas break.

esc24 · 2014-01-02T13:41:29Z

lib/iris/cube.py

I know you haven't changed this, but you've changed everything else so here goes. The beginning of this sentence now appears somewhat contradictory to the first paragraph. When would the object not contain phenomenon values? I'd also add that it can be a biggus array in here too.

I've tried to update this whilst keeping the benefits from #862. I hope @ajdawson and @jkettleb are happy with the result!

esc24 · 2014-01-02T17:09:12Z

I'm all done. This is very impressive work 👏 which should make the manipulation and aggregation of large data sets become so much simpler. @pelson raises a number of points that need addressing and I think I've spotted a few small ones too (in particular the fill_value and the logic in merge()), but this is close to being merged in my opinion.

rhattersley · 2014-01-23T16:40:17Z

I wonder if we need more documentation of biggus (or at least an uptodate pip install). What are you're thoughts on that front.

At a minimum I'd love to cut a new biggus version.

rhattersley · 2014-01-23T17:21:29Z

OK chaps. I've folded in the API experiments and a whole bunch of review feedback and cleaned up the net-null-change copyright nonsense. Sorry it's a fresh commit but it was all getting so complex I just wanted to get back to a comprehensible state.

rhattersley · 2014-01-24T13:50:28Z

I've pushed a new commit which avoids the need to define == on a biggus.NumpyArrayAdapter.

NB. It depends on SciTools/biggus#54, so it'll need updating if/when that makes it on to master.

rhattersley · 2014-01-24T15:58:05Z

(If anyone gets tempted to merge, perhaps it would be wise to tag biggus 0.3 first and update this PR to use that.)

bjlittle · 2014-01-27T15:19:09Z

lib/iris/cube.py

@rhattersley are you trying to avoid using self._data due to the old deferred data implementation? How's about self.__data instead?

Yes - I wanted to use a new attribute name to highlight any cases where code was being naughty and accessing the old private attribute. I'm certainly not fixated on the name _my_data but I don't want to use the double-underscore name-mangling.

bjlittle · 2014-01-28T11:10:11Z

lib/iris/cube.py

@rhattersley okay I'm splitting hairs here ...

There's been a couple of times now where I've come back to this method to remind myself what exactly has_data means. I guess I'm being all rather 👴, but for me a cube always has data. The real question is whether that data is lazy or concrete.

In test_cdm.py L923:928 you defined the convenience methods is_lazy and is_concrete which makes the testing crystal clear ... would you consider extending the cube API to include such methods on the Cube and remove has_data ?

How about a single Cube.has_lazy_data() method?

Yup, works for me, and fits in quite neatly with the lazy_data method.

Nice. Do it! 😉

Will do! 🤘

bjlittle · 2014-01-28T13:48:41Z

@rhattersley you can now update the .travis.yml to use biggus v0.3, which I've now tagged and pushed to PyPI.

rhattersley · 2014-01-28T13:51:54Z

update the .travis.yml to use biggus v0.3

Done.

bjlittle · 2014-01-28T14:02:09Z

👍 Top PR @rhattersley!

Here we go 😲 ... merge! (waiting on Travis)

rhattersley · 2014-01-28T15:02:26Z

(waiting on Travis)

Not any more. 😉

biggus

bjlittle · 2014-01-28T15:41:08Z

gulp ... what have I done 😉

rhattersley · 2014-01-28T15:42:13Z

🍻 😀

bblay · 2014-01-29T08:47:32Z

nice one

shoyer reviewed Dec 22, 2013
View reviewed changes

esc24 reviewed Jan 2, 2014
View reviewed changes

shoyer mentioned this pull request Jan 9, 2014

Cube indexing should use numpy views when possible #914

Closed

rhattersley mentioned this pull request Jan 21, 2014

Show when no masked elements in CML. #969

Merged

Switch to biggus for deferred loading.

9562ac9

rhattersley mentioned this pull request Jan 24, 2014

Array not hashable SciTools/biggus#53

Merged

Fix PPField equality

e8f0813

bjlittle reviewed Jan 27, 2014
View reviewed changes

Docstring for Cube.lazy_data()

dec3e5f

bjlittle reviewed Jan 28, 2014
View reviewed changes

Convert Cube.has_data() to Cube.has_lazy_data()

3595a93

Update travis to biggus v0.3

3ffaf1b

bjlittle added a commit that referenced this pull request Jan 28, 2014

Merge pull request #900 from rhattersley/biggus

5c46808

biggus

bjlittle merged commit 5c46808 into SciTools:master Jan 28, 2014

rhattersley deleted the biggus branch January 28, 2014 15:41

bjlittle mentioned this pull request Jan 29, 2014

Hybrid Pressure load and save support for PP #921

Merged

5 tasks

biggus #900

biggus #900

Uh oh!

Conversation

rhattersley commented Dec 20, 2013

Uh oh!

shoyer commented Dec 22, 2013

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

pelson commented Dec 24, 2013

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

esc24 commented Jan 2, 2014

Uh oh!

rhattersley commented Jan 23, 2014

Uh oh!

rhattersley commented Jan 23, 2014

Uh oh!

rhattersley commented Jan 24, 2014

Uh oh!

rhattersley commented Jan 24, 2014

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

bjlittle commented Jan 28, 2014

Uh oh!

rhattersley commented Jan 28, 2014

Uh oh!

bjlittle commented Jan 28, 2014

Uh oh!

rhattersley commented Jan 28, 2014

Uh oh!

bjlittle commented Jan 28, 2014

Uh oh!

rhattersley commented Jan 28, 2014

Uh oh!

bblay commented Jan 29, 2014

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants