Terraware device manager code

This code provides local on-site monitoring and control software for Terraformation hardware systems (seed banks, power generation, desalination, irrigation, etc.).

It runs as a service on the Terraware Balena stack.

This version of the README is heavily about the state I (Brian Sharp) am leaving the code in as I leave TF, so it's pretty ephemeral but seemed like the right place to leave this documentation.

Most Obvious Todos

The code hasn't actually been hooked up to use terraware-server and the auth stuff yet. I've only managed to test it in local sim mode for now. The API usage should be a drop-in replacement since local sim mode fully spoofs everything and it's a very small surface area. At this point I can probably get that code written, but testing it given I have 2 days left seems unrealistic since that means setting up the local terraware-server instance, populating the database with the config with SQL (which I don't really know very well) and so on.
The Chirpstack driver is currently disabled - to reenable it, search for "chirpstack" in device_manager.py to find two commented-out blocks (one at top and one at bottom) and then also add chirpstack_api back into requirements.txt. This is disabled because the chirpstack_api lib pulls in a ton of other stuff including grpcio, which takes a solid 45+ minutes to build on the Pi. The only use of the chirpstack_api stuff is for the protobuf-based message formats, but since you can configure LoRaWAN gateways to send JSON instead - and I believe the code as it stands is doing that - you can probably eliminate the chirpstack_api dependency entirely and just pull the payload out of the message json manually. I didn't do this because I don't have an active LoRaWAN gateway to test with, so I can't actually see what the incoming message format looks like.
The Chirpstack driver relies on the balena supervisor access to get the host machine IP address on the Pi; in the balena app you'll need to add the io.balena.features.supervisor-api: true label to the devices service - see the terrahass service, it already has it.
The only hardware I actually physically have to test against here is the Tempest weather station. I just ported this driver over from homeassistant, and when I ran it I was having trouble getting UDP packets from my weather station. It's using the exact same library as the homeassistant integration and the weatherflow.py driver is all of 75 lines of code. I assume it's just network port mapping stuff - homeassistant was setup to explicitly forward ports from the host OS whereas the device manager service is running in network_mode:host - but I haven't yet had time to dig into it deeply.
The other driver code has been refactored since the last deployed incarnation of the device manager, and I've tested it all in local_sim mode, but I haven't actually tested it all against real hardware, since I don't have that hardware and the code isn't ready to deploy. Since it works in local_sim mode and the refactoring was largely about handing the dev_info structure all the way down into the device constructors rather than having device_manager pull individual arguments and pass them in manually, I expect that for the most part it will just work, as the hardware interface code didn't really get touched. But it's very likely I made a few typos or logic errors pulling arguments out of the dev_info structure. Those bugs should be very straightforward to find and fix.

Configuration

Various local simulation options:

LOCAL_CONFIG_FILE_OVERRIDE: The filename, relative to the root of the project, to load. If you uncomment the line in the Dockerfile that copies sample-site.json over to the Pi, set this environment variable to sample-site.json - that file will be in the working directory for the devices service on the Pi. This will disable querying the config from the server, but it will not disable all other server interactions - the device manager will still query API tokens from Keycloak and send timeseries and timeseries values to the server. This is helpful for locally testing if you don't have a config populated in a database but you want to test all the other server interactions.
LOCAL_SIM: If set to true, the device manager will not contact the server at all for anything. Also, this setting is applied by default to all devices - so if it's set to True, by default all devices will run in a local simulation mode, too, where they will return fake data and not try to talk to real hardware. This can be overridden on a device-by-device basis by putting "local_sim":false in their "settings" json block in their config.

The following envvars are relevant whether running in local sim mode or production mode:

DIAGNOSTIC_MODE: Set this to true to enable verbose diagnostic printing.

And these variables must be set to run with a connection to terraware-server for querying config data and for uploading timeseries data:

OFFLINE_REFRESH_TOKEN: The Keycloak token used to request expiring API keys.
KEYCLOAK_API_CLIENT_ID: Set this to api for now (see above doc page).
ACCESS_TOKEN_REQUEST_URL: The full URL (including server and rest query string) for requesting the expiring access token (see above link.)
SERVER: The server address, e.g. 'https://localhost:4000/' for the terraware-server instance to load config from & push timeseries data to.
FACILITIES: The list of facility IDs this device manager instance represents. Should be just a comma-delimited list of ints, e.g. 1,3,18

The Terraware admin user interface has a "Generate New Offline Refresh Token" button on the device manager page that generates a new token and updates the corresponding environment variable in Balena.

Device Configuration

Refer to sample-site.json for a full example of configuring every supported sensor (including the currently-disabled-in-code chirpstack sensors.) There aren't enough drivers in there yet to really have a canonical split between "required" and "optional" - it's still a bit case-by-case. But we do have a split between formal parameters and "additional settings". In the sample-site.json the distinction is just "is it in the top-level device config dictionary, or is it in the nested 'settings' dictionary?" But on the terraware-server side, the significance is that the top-level ones can be formalized in the database schema, and then the 'settings' dictionary is a single JSON-valued field in the schema. So, it's harder to validate, but useful for very device-specific settings.

As of this writing, here's an exhaustive list of the parameters and what they're used for:

id (int): The globally-unique identifier for this device. Absolutely required, all drivers use this. It is half of the key for all timeseries data from the device.
name (string): The human-readable name of the device. This is required for all devices, but is used exclusively for diagnostic display.
type (string): One of 'ups', 'server', 'router', 'relay', 'sensor', 'hub'. As more drivers get added this list should probably be culled and formalized more, it's a little ad-hoc.
make (string) and model (string): The make and model of the device. Required for all devices, on principle. In practice they're primarily used by DeviceManager::get_device_class_to_instantiate and also by modbus devices to compose the spec filename they load to get their register layout.
address (string) and port (int): Address is used variously to mean an IP address or, usually for child devices off hubs (omnisense temp & humidity sensors, LoRa soil moisture sensors) it's some hex string unique ID specific to the hardware used to interpret incoming data packets. Port is used by fewer drivers but still common enough to be a first-class parameter.
parentId (int): The id of another device in the list (doesn't matter what order they appear in) that this device is chained off of. This is used for sensors that connect to 'hub' devices like the OmniSense gateway, LoRaWAN hubs, and so on. See below for more on that.
pollingInterval (int): How frequently, in seconds, to poll this device for values. If this value is omitted or set to 0, the device will never be polled. See below section for more on this.

Hubs, Child Devices, Polling Intervals

The parentId parameter in a device's configuration causes it to get bound to that device as a child (the hub device gets add_device called with the child device after the first construction pass.) Hubs then get a notify_all_devices_added when all their children are added to give them a chance to do stuff (ChirpStack and OmniSense wait for this to spawn their listening services so they don't throw away data for child sensors they haven't heard about yet.) Probably this is overkill since it's probably only a nanosecond between the hub's construction and when all its child devices are added, but that might change in the future if devices block on construction for some reason, so this seemed like the right move.

For hub/child relationships, right now the code is a little inconsistent: For OmniSense, the hub's poll returns all the timeseries values for all the child sensors, and so the hub device needs a valid pollingInterval value in its config, and the child sensors shouldn't be polled at all, but for ChirpStack, based on how the driver code was written (originally for homeassistant and I ported it over to the device manager) it was easier to have the child sensors return their data instead. There's no fundamental need to standardize one way or the other, but the inconsistency is bothersome. On the upside, it should be fine to just set all devices to be polled and some just never return any data.

Automations

Automations are used to automate responses to various device data conditions.

Each automation has the following fields:

id (int): The ID of the automation record.
facilityId (int): The ID of the facility associated with the automation.
name (string): A human-readable name of the automation. May be used in alert messages.
configuration (JSON): Various type-specific attributes of an automation.

The most commonly used automation is one that checks for bounds on a sensor value. It has the following configuration items:

type (string): For this automation, the value should be SensorBoundsAlert.
monitorDeviceId (int): The device ID of the device being monitored.
monitorTimeseriesName (string): The time series name (within the device) being monitored. (For example: "temperature")
lowerThreshold (float or null): The automation will send an alert if the sensor value is below this lower bound. If this threshold is null, no lower bound is in effect.
upperThreshold (float or null): The automation will send an alert if the sensor value is above this upper bound. If this threshold is null, no upper bound is in effect.
verbosity (int): Can be set above zero to enable diagnostic logging; ordinarily should be set to zero.

Balena Deployment

General setup:

If needed, create a new fleet and a new device.
Set the device's static IP to 192.168.2.2 as described here: https://github.com/terraware/balena/tree/main/static-ip
If needed, install the Balena console and run balena login

For local development:

Set the device to local mode in the Balena web interface.
Set the environment variables as needed in the Dockerfile; be sure not to check in this file!
Run balena push [ip address] where [ip address] is the Pi's local IP address (which you can obtain from the Balena web interface).

For deployments to a fleet:

Set these variables at the fleet level:
- SERVER
- ACCESS_TOKEN_REQUEST_URL
- KEYCLOAK_API_CLIENT_ID
Set these variables at the device level (unless using the provisioning system described below):
- OFFLINE_REFRESH_TOKEN
- FACILITIES
- DIAGNOSTIC_MODE
Run balena push [fleet name]

We typically use the following steps for deployment:

Run balena push device-manager-alpha
Check logs of device-manager-alpha systems in the Balena web interface
Run balena push device-manager-beta
Check logs of device-manager-beta systems in the Balena web interface
Run balena push device-manager-prod
Check logs of device-manager-prod systems in the Balena web interface

Local Testing without a Server

You can run the device manager without using a server:

Set the following environment variables:
- LOCAL_SITE_FILE_OVERRIDE: sample-site.json
- LOCAL_SIM: 1
- DIAGNOSTIC_MODE: 1
- FACILITIES: 0
Run python main.py

Bulk Provisioning

Prerequisites:

Python 3 installed locally
Balena CLI installed locally
balenaEtcher (or a similar program) installed locally
a local copy of this repo
a Balena account/login
an SD card reader/writer

Additional setup steps:

Download a disk image from the Balena web interface. (Go to a fleet and select "Add Device")
Run pip install balena-sdk pyyaml
In the terraware-devices directory, copy sample-provisioning.yaml to provisioning.yaml and edit it as needed. You will need a Balena API key (auth token), which can be obtained via the Preferences screen in the Balena web interface.

For each SD card, perform the following steps:

If not done already, run balena login.
Use balenaEtcher (or a similar program) to copy the disk image to a fresh SD card.
Run sudo python provisioning.py.
Write down the short code on the SD card container.

Note that the SD card configuration step (carried out by the provisioning script) does not appear to work under Windows. Let us know if you need an approach that works under Windows.

Modbus Testing

Use the Balena admin interface to open a terminal session in the main service.

If you'd like to ping a device run the following:

apt install iputils-ping
ping 192.168.2.50

If you'd like to check a modbus register, run this: python modtest.py -a 192.168.2.50 -r 70 -t uint16

Run python modtest.py -h for more options.

Name		Name	Last commit message	Last commit date
Latest commit History 162 Commits
automations		automations
devices		devices
specs		specs
test		test
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE.txt		LICENSE.txt
README.md		README.md
__init__.py		__init__.py
ble_scan.py		ble_scan.py
device_manager.py		device_manager.py
main.py		main.py
modtest.py		modtest.py
nut.conf		nut.conf
provisioning.py		provisioning.py
requirements.txt		requirements.txt
sample-cbw-weather.xml		sample-cbw-weather.xml
sample-provisioning.yaml		sample-provisioning.yaml
sample-site.json		sample-site.json
ups.conf		ups.conf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Terraware device manager code

Most Obvious Todos

Configuration

Device Configuration

Hubs, Child Devices, Polling Intervals

Automations

Balena Deployment

Local Testing without a Server

Bulk Provisioning

Modbus Testing

About

Releases

Packages

Contributors 4

Languages

License

terraware/terraware-devices

Folders and files

Latest commit

History

Repository files navigation

Terraware device manager code

Most Obvious Todos

Configuration

Device Configuration

Hubs, Child Devices, Polling Intervals

Automations

Balena Deployment

Local Testing without a Server

Bulk Provisioning

Modbus Testing

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages