-
-
Notifications
You must be signed in to change notification settings - Fork 1.2k
Open
Labels
bugneeds mcvehttps://matthewrocklin.com/blog/work/2018/02/28/minimal-bug-reportshttps://matthewrocklin.com/blog/work/2018/02/28/minimal-bug-reportstopic-dask
Description
What is your issue?
It's hard to tell whether this is a bug in xarray or dask, but since the async update for 2025.8.0, I have been getting many OSError: [Errno 24] Too many open files
errors when opening large zarr datasets. I don't have a minimal reproducible example readily available since I believe this only occurs for large files. My ulimit -n is set to 1024. Example traceback below:
Traceback
Traceback (most recent call last):
File "/exec/sgammon/.conda/envs/portclim/lib/python3.13/site-packages/distributed/scheduler.py", line 4874, in update_graph
expr = deserialize(expr_ser.header, expr_ser.frames)
^^^^^^^
File "/exec/sgammon/.conda/envs/portclim/lib/python3.13/site-packages/distributed/protocol/serialize.py", line 452, in deserialize
return loads(header, frames)
^^^^^^^^^^^^^^^^^
File "/exec/sgammon/.conda/envs/portclim/lib/python3.13/site-packages/distributed/protocol/serialize.py", line 111, in pickle_loads
return pickle.loads(pik, buffers=buffers)
^^^^^^^^^^^^^^^^^
File "/exec/sgammon/.conda/envs/portclim/lib/python3.13/site-packages/distributed/protocol/pickle.py", line 93, in loads
return pickle.loads(x, buffers=buffers)
^^^^^^^^^^^
File "/exec/sgammon/.conda/envs/portclim/lib/python3.13/site-packages/zarr/storage.py", line 1835, in __setstate__
self.__init__(path=path, compression=compression, allowZip64=allowZip64, mode=mode)
^^^^^^^^^^^^^^^
File "/exec/sgammon/.conda/envs/portclim/lib/python3.13/site-packages/zarr/storage.py", line 1823, in __init__
self.zf = zipfile.ZipFile(path, mode=mode, compression=compression, allowZip64=allowZip64)
^^^^^^^^^^^^^^^ File "/exec/sgammon/.conda/envs/portclim/lib/python3.13/zipfile/__init__.py", line 1367, in __init__
self.fp = io.open(file, filemode)
^^^
OSError: [Errno 24] Too many open files: '/baril/scenario/derived/Portraits-Climatiques-v2/simulation/ESPO-G6-AHCCD_v1-patch/CMIP6/ScenarioMIP/Ouranos/MIRO
C6/ssp585/r1i1p1f1/IDM/indicators/QS-DEC/liquidprcptot/liquidprcptot_QS-DEC_ESPO-G6-AHCCD_v1-patch_CMIP6_ScenarioMIP_Ouranos_MIROC6_ssp585_r1i1p1f1_IDM_ind
icators_1950-2100.zarr.zip'
The above exception was the direct cause of the following exception:
Traceback (most recent call last):
File "/home/sgammon/projects/portraits/preparation/2_statistics.py", line 151, in <module>
File "/exec/sgammon/.conda/envs/portclim/lib/python3.13/site-packages/xarray/core/dataset.py", line 2109, in to_netcdf
File "/exec/sgammon/.conda/envs/portclim/lib/python3.13/site-packages/xarray/backends/api.py", line 2096, in to_netcdf
File "/exec/sgammon/.conda/envs/portclim/lib/python3.13/site-packages/xarray/backends/common.py", line 413, in sync
File "/exec/sgammon/.conda/envs/portclim/lib/python3.13/site-packages/xarray/namedarray/daskmanager.py", line 247, in store
File "/exec/sgammon/.conda/envs/portclim/lib/python3.13/site-packages/dask/array/core.py", line 1227, in store
File "/exec/sgammon/.conda/envs/portclim/lib/python3.13/site-packages/dask/base.py", line 681, in compute
File "/exec/sgammon/.conda/envs/portclim/lib/python3.13/site-packages/distributed/client.py", line 2417, in _gather
raise exception.with_traceback(traceback)
RuntimeError: Error during deserialization of the task graph. This frequently
occurs if the Scheduler and Client have different environments.
For more information, see
https://docs.dask.org/en/stable/deployment-considerations.html#consistent-software-environments
pingponghero12
Metadata
Metadata
Assignees
Labels
bugneeds mcvehttps://matthewrocklin.com/blog/work/2018/02/28/minimal-bug-reportshttps://matthewrocklin.com/blog/work/2018/02/28/minimal-bug-reportstopic-dask