Skip to content

Open and explore HDF5 files in JupyterLab. Can handle very large (TB) sized files, and datasets of any dimensionality

License

Notifications You must be signed in to change notification settings

jupyterlab/jupyterlab-hdf5

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

PyPI version npm_version

interactive api docs

jupyterlab-hdf5

Important Notice

jupyterlab-hdf5 is no longer actively maintained, and will not work with JupyterLab 4 or later. jupyterlab-h5web is the recommended replacement.

Overview

Open and explore HDF5 files in JupyterLab. Can handle very large (TB) sized files. New in release v0.5.0, jlab-hdf5 can now open datasets of any dimensionality, from 0 to 32. Any 0D, 1D, or 2D slab of any dataset can easily be selected and displayed using numpy-style index syntax.

hdf_preview

Double clicking on an .hdf5 file in the file browser will open it in a special HDF browser. You can then browse through the groups and open the datasets in the .hdf5 file. All datasets will open read only.

For the moment, the browser context menu does not work with .hdf5 files/groups/datasets. Only double clicking is currently supported.

Designed from the ground up to be as efficient as possible. Data will only be fetched as needed to create the visible display. This allows the extension to work with very large files (tested working up to the TB range).

Installation

pip install jupyterlab_hdf
jupyter labextension install @jupyterlab/hdf5

This will install both the server extension and the labextension needed by this plugin.

You can also install the labextension via Jupyterlab's extension manager GUI. Keep in mind that if you use the lab extension GUI, you'll still need to install the jupyterlab_hdf server extension via pip.

Compression filters

The extension supports all compression filters supported by h5py: https://docs.h5py.org/en/stable/high/dataset.html#filter-pipeline.

To enable support for additional filters such as blosc or bitshuffle, you need to install hdf5plugin in addition to the extension:

pip install hdf5plugin

Development

For a development install, clone the repository and then run the following in the repo dir:

pip install -e .[dev]
jlpm build:dev

To watch for/rebuild on changes to this extension's source code, run:

jlpm run build:watch

What's in this extension

This extension has two main parts: an hdf5 filebrowser plugin, and an hdf5 dataset file type plugin.

HDF5 Filebrowser

Allows you to navigate an .hdf5 file's groups as though they were directories in a filesystem. Any .hdf5 file on a user's system can be opened by entering its path (relative to the Jupyterlab home directory) in the box at the top of the browser.

Note on link resolution

HDF5 files can contain links that point to entities in the same file (soft links) or to entities in a different files (external links). By default, the extension does not resolve such links.

Link resolution must be enabled explicitly by setting the config field HdfConfig.resolve_links to True. For this, there are two possibilities:

  • Set the config field when launching JupyterLab:
jupyter lab --HdfConfig.resolve_links=True
c.HdfConfig.resolve_links = True

Note that this will only resolve valid links. Broken links (e.g. links to a non-existent entity) will still appear as links.

HDF5 dataset file type

When you open a dataset using the hdf5 filebrowser, a document will open that displays the contents of the dataset via a grid.