The main goal of PyNWB is to enable users and developers to efficiently interact with the NWB data format, format files, and specifications. The following figures provide an overview of the high-level architecture of PyNWB and functionality of the various components.
We choose a modular design for PyNWB to enable flexibility and separate the various aspects of the NWB:N ecosystem (click to enlarge).
Overview of the main concepts/classes in PyNWB and their location in the overall software architecture (click to enlarge).
In memory objects
Interface for (most) applications
Like a table row
PyNWB has many of these -- one for each neurodata_type in the NWB schema. PyNWB organizes the containers into a set of modules based on their primary application (e.g., ophys for optophysiology):
Intermediary objects for I/O
Interface for I/O
Backend readers and writers must return and accept these
There are different kinds of builders for different base types:
- :py:class:`~hdmf.build.builders.GroupBuilder` - represents a collection of objects
- :py:class:`~hdmf.build.builders.DatasetBuilder` - represents data
- :py:class:`~hdmf.build.builders.LinkBuilder` - represents soft-links
- :py:class:`~hdmf.build.builders.ReferenceBuilder` - represents a reference to another group or dataset
Main Module: :py:mod:`hdmf.build.builders`
Interact with format specifications
Data structures to specify data types and what said types consist of
Python representation for YAML specifications
Interface for writing extensions or custom specification
There are several main specification classes:
- :py:class:`~pynwb.spec.NWBAttributeSpec` - specification for metadata
- :py:class:`~pynwb.spec.NWBGroupSpec` - specification for a collection of objects (i.e. subgroups, datasets, link)
- :py:class:`~pynwb.spec.NWBDatasetSpec` - specification for dataset (like and n-dimensional array). Specifies data type, dimensions, etc.
- :py:class:`~pynwb.spec.NWBLinkSpec` - specification for link (like a POSIX soft link)
- :py:class:`~hdmf.spec.spec.RefSpec` - specification for references (References are like links, but stored as data)
- :py:class:`~pynwb.spec.NWBDtypeSpec` - specification for compound data types. Used to build complex data type specification, e.g., to define tables (used only in :py:class:`~hdmf.spec.spec.DatasetSpec` and correspondingly :py:class:`~pynwb.spec.NWBDatasetSpec`)
Main Modules:
- :py:mod:`hdmf.spec` -- General specification classes.
- :py:mod:`pynwb.spec` -- NWB specification classes. (Most of these are specializations of the classes from :py:mod:`hdmf.spec`)
Note
A data_type
(or more specifically a neurodata_type
in the context of
NWB) defines a reusable type in a format specification that can be
referenced and used elsewhere in other specifications. The specification of
the NWB format is basically a collection of neurodata_types
, e.g.:
NWBFile
defines a GroupSpec for the top-level group of an NWB format
file which includes TimeSeries
, ElectrodeGroup
, ImagingPlane
and many other neurodata_types
. When creating a specification, two
main keys are used to include and define new neurodata_types
neurodata_type_inc
is used to include an existing type andneurodata_type_def
is used to define a new type
I.e, if both keys are defined then we create a new type that uses/inherits an existing type as a base.
Maintains the mapping between Container attributes and Spec components
ObjectMappers are constructed using a Spec
Ideally, one ObjectMapper for each data type
Things an ObjectMapper should do:
PyNWB has many of these -- one for each type in NWB schema
Main Module: :py:mod:`hdmf.build.objectmapper`
- NWB-specific ObjectMappers are locate in submodules of :py:class:`pynwb.io`
Relationship between Container, Builder, ObjectMapper, and Spec
Namespace
A namespace for specifications
Necessary for making extensions
Contains basic info about who created extensions
Core NWB:N schema has namespace “core”
Get from :py:class:`pynwb.spec.NWBNamespace`
- extension of generic Namespace class that will include core
:py:class:`~hdmf.spec.namespace.NamespaceCatalog` -- A class for managing namespaces
:py:class:`~hdmf.spec.write.NamespaceBuilder` -- A utility for building extensions
Map between data types, Container classes (i.e. a Python class object) and corresponding ObjectMapper classes
Constructed from a NamespaceCatalog
Things a TypeMap does:
- Given an NWB data type, return the associated Container class
- Given a Container class, return the associated ObjectMapper
PyNWB has two of these classes:
- the base class (i.e. :py:class:`~hdmf.build.manager.TypeMap`) - handles NWB 2.x
- :py:class:`pynwb.legacy.map.TypeMapLegacy` - handles NWB 1.x
PyNWB provides a “global” instance of TypeMap created at runtime
TypeMaps can be merged, which is useful when combining extensions
- Responsible for memoizing Builder and Container
- Constructed from a TypeMap
- PyNWB only has one of these: :py:class:`hdmf.build.manager.BuildManager`
Overview of BuildManager (and TypeMap) (click to enlarge).
Abstract base class for I/O
:py:class:`HDMFIO <hdmf.backends.io.HDMFIO>` has two key abstract methods:
- :py:meth:`~hdmf.backends.io.HDMFIO.write_builder` – given a builder, write data to storage format
- :py:meth:`~hdmf.backends.io.HDMFIO.read_builder` – given a handle to storage format, return builder representation
- Others: :py:meth:`~hdmf.backends.io.HDMFIO.open` and :py:meth:`~hdmf.backends.io.HDMFIO.close`
Constructed with a BuildManager
Extend this for creating a new I/O backend
PyNWB has one extension of this:
- :py:class:`~hdmf.backends.hdf5.h5tools.HDF5IO` - reading and writing HDF5
- :py:class:`~pynwb.NWBHDF5IO` - wrapper that pulls in core NWB specification
Overview of HDMFIO (click to enlarge).