Skip to content

Conversation

@jsignell
Copy link
Contributor

@jsignell jsignell commented Dec 15, 2025

The idea in this PR is to include a fast path for open_dataset that just uses the token that is passed into _maybe_chunk and doesn't worry about including chunks within the token.

Before:
image

After:
image

This PR shaves ~30 sec off the previous runtime for the dataset from the original issue. I was still seeing pretty intense memory consumption 17.14GB for this open_dataset call though - not a new thing, just wanted to flag

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

opening a zarr dataset taking so much time with dask

1 participant