Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Broadcasts failing on ember after migration #22453

Open
julien-billaud opened this issue May 4, 2024 · 126 comments
Open

Broadcasts failing on ember after migration #22453

julien-billaud opened this issue May 4, 2024 · 126 comments
Labels
ember Issues related to ember driver problem Something isn't working

Comments

@julien-billaud
Copy link

What happened?

While I've never been facing any issues for more than a year with the Sonoff Dongle-e + ezsp driver, I've tried to change the driver to ember, but nothing is working (tried multiple time) but sometime losing all the devices, sometime they are still there but impossible to interact with them, and pairing is never working. (for now I returned to the ezsp driver).
I'm not noticing much error in the log (only the broadcast error reported here #22445)

I've tried the exact same configuration on a regular x86 computer running debian (using the same zigbee dongle) and didn't face any issue which seems to be a linked with the Raspberry pi 4

What did you expect to happen?

No response

How to reproduce it (minimal and precise)

switch from eszp to ember driver

Zigbee2MQTT version

1.37.0

Adapter firmware version

7.4.2.0 build 0

Adapter

Sonoff dongle-e

Setup

Raspberry pi 4 using docker image

Debug log

No response

@julien-billaud julien-billaud added the problem Something isn't working label May 4, 2024
@Nerivec Nerivec added the ember Issues related to ember driver label May 4, 2024
@Nerivec
Copy link
Collaborator

Nerivec commented May 4, 2024

Any chance you can downgrade to 7.4.1 and see if you still have those problems on the pi?

@fir3drag0n
Copy link

fir3drag0n commented May 4, 2024

Same problem with SLZB-06M

But I don't have a raspberry pi 4, host is a x86 machine, running unraid and zigbee2mqtt in docker.

@Nerivec
Copy link
Collaborator

Nerivec commented May 4, 2024

Grouping the mentioned broadcasting issue here guys (#22445, #22398)
@supaeasy @alainsch @Ricc68 @VladislavVesely @luqsq

I cannot reproduce this with my Dongle-E. I've tried various firmware, various ways to migrate from ezsp to ember (even bad ones 😅).
Can you guys think of something that may be different in your setup from a "regular setup"?

@raphael1688
Copy link

Same problem with SLZB-06M

But I don't have a raspberry pi 4, host is a x86 machine, running unraid and zigbee2mqtt in docker.

adapter: ember
rtscts: false

May need to add 'rtscts' below adapter setting.

@supaeasy
Copy link

supaeasy commented May 5, 2024

Can you guys think of something that may be different in your setup from a "regular setup"?

Two things: I recently installed https://www.zigbee2mqtt.io/devices/ZFP-1A-CH.html#siglis-zfp-1a-ch

Wich I think is not a very common router. Swiss market only and most likely not very popular. Initially I had problems with it. Also shortly after I installed it, my second Dongle-E that I use as a router had to re-pair and this was one of the first devices in my 2yo network that I never had any problems with.

Second: Shortly before my Router Dongle failed I set reporting interval of every lamp to 1-3 seconds because I didn't see lamps status change quickly enough (or at all) when pressing a HW button like the switches mentioned above. After the Dongle failed I reverted this to 1-30 s and had no problems since. But I did the reverting before I saw the error in logs.

Also I have to say: I don't recognize bigger problems or misbehavior. I just saw the error in the logs. The only real problem I have is that sometimes (not reproducible) some IKEA Bulbs are starting in maximum dimmed mode even though at least one of them is never dimmed manually.

@julien-billaud
Copy link
Author

julien-billaud commented May 5, 2024

Grouping the mentioned broadcasting issue here guys (#22445, #22398) @supaeasy @alainsch @Ricc68 @VladislavVesely @luqsq

I cannot reproduce this with my Dongle-E. I've tried various firmware, various ways to migrate from ezsp to ember (even bad ones 😅). Can you guys think of something that may be different in your setup from a "regular setup"?

As the dongle-e is working using a docker images on an x86 environnement I'm guessing there is no issue with the zigbee Dongle, so if I focus on some specifics configs, here is what's coming to my mind as part of the change that might be different than a regular installation :

  • RPI4 using a Argon One case
  • RPI4 is booting from an SSD which is plugged to the USB3 port just bellow the Zigbee dongle
  • 64 bit is enabled for that OS
  • The persistent data of the container are stored on an encrypted (Luks volume) which is being mounted on boot

everything else is quite standard in my opinion.

@alainsch
Copy link

alainsch commented May 5, 2024

Nothing special over here. Had 1.36 running with SLZD-06M running on zigbee FW 20231030. Everything was running OK with adapter: ezsp

Did the following steps:

  • upgraded addon to 1.37

  • received the "zh:ezsp: Deprecated driver 'ezsp' currently in use, 'ember' will become the..." messages

  • changed adapter: ezsp to adapter: ember and restarted

  • got an error that my coordinator was not on EZSP13

  • upgraded my coordinator firmware to FW 20240408

  • as adviced by SMLight, changed config in zigbee2mqtt to "adapter: ember" + "rtscts: false"

  • restarted zigbee2mqtt and zigbee network is working

  • now at startup I get the message "zh:ember: Delivery of BROADCAST failed for "65533" [apsFrame={"profileId":0,"clusterId":19,"sourceEndpoint":0,"destinationEndpoint":0,"options":0,"groupId":0,"sequence":212} messageTag=255]"

  • pairing new entities does not work due to the same error

  • switching back to "adapter: ezsp" doesn't work either as I then get the error "zh:controller:greenpower: Received undefined command from '0'". another used already created a ticket for this.

So currently I'm in a state that my network is running, but I can't add any new devices.

Is there any more info we can provide?

@supaeasy
Copy link

supaeasy commented May 5, 2024

Oh I should have mentioned that I am running HAOS in a VM on Synology DSM 7.2.

Interference should not be a problem as my dongle is in a USB 2 port with a 2 m extension cable.

@alainsch
Copy link

alainsch commented May 5, 2024

My setup is HAOS running on a ODROID M1 with 8GB RAM and 512 GB SSD.

@fir3drag0n
Copy link

fir3drag0n commented May 5, 2024

Nothing special over here. Had 1.36 running with SLZD-06M running on zigbee FW 20231030. Everything was running OK with adapter: ezsp

Did the following steps:

  • upgraded addon to 1.37
  • received the "zh:ezsp: Deprecated driver 'ezsp' currently in use, 'ember' will become the..." messages
  • changed adapter: ezsp to adapter: ember and restarted
  • got an error that my coordinator was not on EZSP13
  • upgraded my coordinator firmware to FW 20240408
  • as adviced by SMLight, changed config in zigbee2mqtt to "adapter: ember" + "rtscts: false"
  • restarted zigbee2mqtt and zigbee network is working
  • now at startup I get the message "zh:ember: Delivery of BROADCAST failed for "65533" [apsFrame={"profileId":0,"clusterId":19,"sourceEndpoint":0,"destinationEndpoint":0,"options":0,"groupId":0,"sequence":212} messageTag=255]"
  • pairing new entities does not work due to the same error
  • switching back to "adapter: ezsp" doesn't work either as I then get the error "zh:controller:greenpower: Received undefined command from '0'". another used already created a ticket for this.

So currently I'm in a state that my network is running, but I can't add any new devices.

Is there any more info we can provide?

Exactly the same behavior. Plus the problem that no new devices can't be paired with ember. But with ezsp I can add devices. In my case especially all my routers get disconnected.

@fir3drag0n
Copy link

I do have 4 mmwave presence sensors. Maybe these devices have an influence.

@alainsch
Copy link

alainsch commented May 5, 2024

Sorry, posted my follow-up on the wrong ticket...

These are the messages I see when I startup Zigbee2MQTT. Maybe they are related.

[2024-05-05 11:00:43] info: z2m: Logging to console, file (filename: log.log)
[2024-05-05 11:00:49] info: z2m: Starting Zigbee2MQTT version 1.37.0 (commit #unknown)
[2024-05-05 11:00:49] info: z2m: Starting zigbee-herdsman (0.45.0)
[2024-05-05 11:00:49] info: zh:ember: ======== Ember Adapter Starting ========
[2024-05-05 11:00:49] info: zh:ember:ezsp: ======== EZSP starting ========
[2024-05-05 11:00:49] info: zh:ember:uart:ash: ======== ASH NCP reset ========
[2024-05-05 11:00:49] info: zh:ember:uart:ash: Socket ready
[2024-05-05 11:00:49] info: zh:ember:uart:ash: ======== ASH starting ========
[2024-05-05 11:00:51] info: zh:ember:uart:ash: ======== ASH connected ========
[2024-05-05 11:00:51] info: zh:ember:uart:ash: ======== ASH started ========
[2024-05-05 11:00:51] info: zh:ember:ezsp: ======== EZSP started ========
[2024-05-05 11:00:51] warning: zh:ember: [EzspConfigId] Failed to SET "ADDRESS_TABLE_SIZE" TO "16" with status=ERROR_OUT_OF_MEMORY. Firmware value will be used instead.
[2024-05-05 11:00:51] warning: zh:ember: [EzspConfigId] Failed to SET "APS_UNICAST_MESSAGE_COUNT" TO "32" with status=ERROR_OUT_OF_MEMORY. Firmware value will be used instead.
[2024-05-05 11:00:51] warning: zh:ember: [EzspConfigId] Failed to SET "NEIGHBOR_TABLE_SIZE" TO "26" with status=ERROR_OUT_OF_MEMORY. Firmware value will be used instead.
[2024-05-05 11:00:51] warning: zh:ember: [EzspConfigId] Failed to SET "SOURCE_ROUTE_TABLE_SIZE" TO "200" with status=ERROR_INVALID_VALUE. Firmware value will be used instead.
[2024-05-05 11:00:51] warning: zh:ember: [EzspConfigId] Failed to SET "MULTICAST_TABLE_SIZE" TO "16" with status=ERROR_OUT_OF_MEMORY. Firmware value will be used instead.
[2024-05-05 11:00:51] info: zh:ember: [STACK STATUS] Network up.
[2024-05-05 11:00:51] info: zh:ember: [INIT TC] NCP network matches config.
[2024-05-05 11:00:51] info: zh:ember: [CONCENTRATOR] Started source route discovery. 1247ms until next broadcast.
[2024-05-05 11:00:51] info: z2m: zigbee-herdsman started (resumed)
[2024-05-05 11:00:51] info: z2m: Coordinator firmware version: '{"meta":{"build":0,"ezsp":13,"major":7,"minor":4,"patch":1,"revision":"7.4.1 [GA]","special":0,"type":170},"type":"EmberZNet"}'
[2024-05-05 11:00:51] info: z2m: Currently 12 devices are joined:
...

[2024-05-05 11:00:51] info: z2m: Zigbee: disabling joining new devices.
[2024-05-05 11:00:51] info: z2m: Connecting to MQTT server at mqtt://core-mosquitto:1883
[2024-05-05 11:00:52] info: z2m: Connected to MQTT server
[2024-05-05 11:00:52] info: z2m: Started frontend on port 8099
[2024-05-05 11:00:53] info: z2m: Zigbee2MQTT started!
[2024-05-05 11:01:11] error: zh:ember: Delivery of BROADCAST failed for "65532" [apsFrame={"profileId":0,"clusterId":31,"sourceEndpoint":0,"destinationEndpoint":0,"options":0,"groupId":0,"sequence":0} messageTag=255]
[2024-05-05 11:01:23] error: zh:ember: Delivery of BROADCAST failed for "65532" [apsFrame={"profileId":0,"clusterId":31,"sourceEndpoint":0,"destinationEndpoint":0,"options":0,"groupId":0,"sequence":0} messageTag=255]
[2024-05-05 11:01:33] error: zh:ember: Delivery of BROADCAST failed for "65532" [apsFrame={"profileId":0,"clusterId":31,"sourceEndpoint":0,"destinationEndpoint":0,"options":0,"groupId":0,"sequence":0} messageTag=255]

Whenever I try to start the pairing process, I see these messages:

[2024-05-05 11:03:28] info: z2m: Zigbee: allowing new devices to join.
[2024-05-05 11:03:28] info: zh:ember: [STACK STATUS] Network opened.
[2024-05-05 11:03:29] error: zh:ember: Delivery of BROADCAST failed for "65532" [apsFrame={"profileId":0,"clusterId":54,"sourceEndpoint":0,"destinationEndpoint":0,"options":256,"groupId":0,"sequence":240} messageTag=2]
[2024-05-05 11:03:29] error: zh:ember: Delivery of BROADCAST failed for "65533" [apsFrame={"profileId":41440,"clusterId":33,"sourceEndpoint":242,"destinationEndpoint":242,"options":256,"groupId":0,"sequence":241} messageTag=3]

@fir3drag0n
Copy link

@alainsch I also had a discussion with @Nerivec at discord, because I also have the same stillsaying error message.

@alainsch
Copy link

alainsch commented May 5, 2024

Exactly the same behavior. Plus the problem that no new devices can't be paired with ember. But with ezsp I can add devices. In my case especially all my routers get disconnected.

Ah yes, and I wasn't aware it is related...

I have a SLZB-06M as coordinator (groundfloor) and a Sonoff Dongle-E flashed as router (first floor). Yesterday evening my Sonoff router got disconnected. It is while trying to pair it again that I found out I couldn't pair any devices.

I have a very small zigbee network (more a test setup here), so I have no other routers, only end devices.

@alainsch
Copy link

alainsch commented May 5, 2024

@alainsch I also had a discussion with @Nerivec at discord, because I also have the same stillsaying error message.

I'm pretty new to discord, I'll try to find the channel (?) so I can follow the discussion.

@fir3drag0n
Copy link

Exactly the same behavior. Plus the problem that no new devices can't be paired with ember. But with ezsp I can add devices. In my case especially all my routers get disconnected.

Ah yes, and I wasn't aware it is related...

I have a SLZB-06M as coordinator (groundfloor) and a Sonoff Dongle-E flashed as router (first floor). Yesterday evening my Sonoff router got disconnected. It is while trying to pair it again that I found out I couldn't pair any devices.

I have a very small zigbee network (more a test setup here), so I have no other routers, only end devices.

I already have nearly 70 devices...

@alainsch
Copy link

alainsch commented May 5, 2024

I already have nearly 70 devices...

Here at home, HA is a small setup (12 devices) I use mainly for testing. But in our vacation home, everything is controlled by HA and we have 51 zigbee and 33 ESPHome devices.

In this second setup, I also have the same SLZB-06M coordinator, but still on the older 20231030 firmware, where the adapter is still defined as 'adapter: ezsp'.

Since I ugraded to 1.37, I couldn't pair any new devices too, due to another error: "zh:controller:greenpower: Received undefined command from '0'"

And that setup is not a test setup :-(

@fir3drag0n
Copy link

@alainsch I also had a discussion with @Nerivec at discord, because I also have the same stillsaying error message.

I'm pretty new to discord, I'll try to find the channel (?) so I can follow the discussion.

In the development-branch channel. The similarity we both have is the same coordinator (I am at the dev Firmware right now). But maybe you can rather rule out the cause if you only have 12 devices in your setup.

@Ricc68
Copy link

Ricc68 commented May 5, 2024

Very very simple configuration here.

HAOS on qemu VM in low end x86-64 QNAP nas, resources 2 cpu+2 GB ram as suggested by HAOS setup guide.
I have seen a lot of ppl using VMs or arm devices: one common point may be low resources in terms of CPU power and/or RAM.

Back to the setup, I can report two setups:

  1. ZBDongle-E with fw 7.4.2, Z2M 1.37.0, ember driver. Only the ZBDongle-E is in the ZigBee network so it is only the coordinator. The broadcast errors happens. This may rule out the devices and spot the light on the coordinator.
  2. ZBDongle-E as above in above setup but with 2 Sonoff TRVZB valves added to the ZigBee network: same error continues to happen. But since it was happening with the coordinator alone as for setup 1, I would rule out the fact that I have added the 2 devices.

Anyway I see from other posts that the error is happening with a variety of devices and if I look at another common factor, all the variety of networks showing the error have -> a coordinator <- which again spots the light on the coordinator.

I see that @Nerivec is not able to reproduce the issue, and, needless to say, also Nerivec is working with a coordinator which should obviously rule out the coordinator itself (unless there is some elusive coordinator hardware common factor), maybe a good starting point for you would be to constrain the system on a low resource/slow host or a VM with limited resources to see what happens with the coordinator handling of Z2M.

Maybe another hint maybe found in the first post from @julien-billaud: "I've tried the exact same configuration on a regular x86 computer running debian (using the same zigbee dongle) and didn't face any issue which seems to be a linked with the Raspberry pi 4".

@alainsch
Copy link

alainsch commented May 5, 2024

OK, because my setup is a small setup mainly for test, I did the following steps:

  • removed zigbee2mqtt addon
  • removed the zigbee2mqtt folder from my config
  • re-installed zigbee2mqtt with the SLZB-06M and "adapter:ezsp"
  • startup and got the following messages

[12:01:03] INFO: Preparing to start...
[12:01:04] INFO: Socat not enabled
[12:01:10] INFO: Starting Zigbee2MQTT...
[2024-05-05 12:01:14] info: z2m: Logging to console, file (filename: log.log)
[2024-05-05 12:01:20] info: z2m: Starting Zigbee2MQTT version 1.37.0 (commit #unknown)
[2024-05-05 12:01:20] info: z2m: Starting zigbee-herdsman (0.45.0)
[2024-05-05 12:01:20] warning: zh:ezsp: Deprecated driver 'ezsp' currently in use, 'ember' will become the officially supported EmberZNet driver in next release. If using Zigbee2MQTT see #21462
[2024-05-05 12:01:24] info: zh:ezsp:driv: Leaving current network and forming new network
[2024-05-05 12:01:25] info: zh:ezsp:driv: Form network
[2024-05-05 12:01:26] info: zh:controller: Wrote coordinator backup to '/config/zigbee2mqtt/level_0/coordinator_backup.json'
[2024-05-05 12:01:26] info: z2m: zigbee-herdsman started (reset)
[2024-05-05 12:01:26] info: z2m: Coordinator firmware version: '{"meta":{"maintrel":"1 ","majorrel":"7","minorrel":"4","product":13,"revision":"7.4.1.0 build 0"},"type":"EZSP v13"}'
[2024-05-05 12:01:26] info: z2m: Currently 0 devices are joined:
[2024-05-05 12:01:26] info: z2m: Zigbee: disabling joining new devices.
[2024-05-05 12:01:27] info: z2m: Connecting to MQTT server at mqtt://core-mosquitto:1883
[2024-05-05 12:01:27] info: z2m: Connected to MQTT server
[2024-05-05 12:01:28] info: z2m: Started frontend on port 8099
[2024-05-05 12:01:28] info: z2m: Zigbee2MQTT started!

  • when I try to start pairing a device, I see

[2024-05-05 12:01:40] info: z2m: Zigbee: allowing new devices to join.
[2024-05-05 12:01:41] error: zh:controller:greenpower: Received undefined command from '0'
[2024-05-05 12:02:00] info: zh:controller: Interview for '0x00158d0008083d2a' started
[2024-05-05 12:02:00] info: z2m: Device '0x00158d0008083d2a' joined
[2024-05-05 12:02:00] info: z2m: Starting interview of '0x00158d0008083d2a'
[2024-05-05 12:02:11] info: zh:controller: Succesfully interviewed '0x00158d0008083d2a'
[2024-05-05 12:02:11] info: z2m: Successfully interviewed '0x00158d0008083d2a', device has successfully been paired
[2024-05-05 12:02:11] info: z2m: Device '0x00158d0008083d2a' is supported, identified as: Aqara Motion sensor (RTCGQ11LM)
[2024-05-05 12:02:11] info: z2m: Configuring '0x00158d0008083d2a'
[2024-05-05 12:02:11] info: z2m: Successfully configured '0x00158d0008083d2a'

  • so pairing is possible in "adapter:ezsp" mode. Removed the device...

[2024-05-05 12:02:19] info: z2m: Removing device '0x00158d0008083d2a' (block: false, force: true)
[2024-05-05 12:02:19] info: z2m: Successfully removed device '0x00158d0008083d2a' (block: false, force: true)

  • changed the config to "adapter: ember" and "rtscts: false" and restarted zigbee2mqtt

[12:06:41] INFO: Preparing to start...
[12:06:42] INFO: Socat not enabled
[12:06:48] INFO: Starting Zigbee2MQTT...
[2024-05-05 12:06:53] info: z2m: Logging to console, file (filename: log.log)
[2024-05-05 12:06:58] info: z2m: Starting Zigbee2MQTT version 1.37.0 (commit #unknown)
[2024-05-05 12:06:58] info: z2m: Starting zigbee-herdsman (0.45.0)
[2024-05-05 12:06:59] info: zh:ember: ======== Ember Adapter Starting ========
[2024-05-05 12:06:59] info: zh:ember:ezsp: ======== EZSP starting ========
[2024-05-05 12:06:59] info: zh:ember:uart:ash: ======== ASH NCP reset ========
[2024-05-05 12:06:59] info: zh:ember:uart:ash: Socket ready
[2024-05-05 12:06:59] info: zh:ember:uart:ash: ======== ASH starting ========
[2024-05-05 12:07:00] info: zh:ember:uart:ash: ======== ASH connected ========
[2024-05-05 12:07:00] info: zh:ember:uart:ash: ======== ASH started ========
[2024-05-05 12:07:00] info: zh:ember:ezsp: ======== EZSP started ========
[2024-05-05 12:07:00] warning: zh:ember: [EzspConfigId] Failed to SET "ADDRESS_TABLE_SIZE" TO "16" with status=ERROR_OUT_OF_MEMORY. Firmware value will be used instead.
[2024-05-05 12:07:00] warning: zh:ember: [EzspConfigId] Failed to SET "APS_UNICAST_MESSAGE_COUNT" TO "32" with status=ERROR_OUT_OF_MEMORY. Firmware value will be used instead.
[2024-05-05 12:07:00] warning: zh:ember: [EzspConfigId] Failed to SET "NEIGHBOR_TABLE_SIZE" TO "26" with status=ERROR_OUT_OF_MEMORY. Firmware value will be used instead.
[2024-05-05 12:07:00] warning: zh:ember: [EzspConfigId] Failed to SET "SOURCE_ROUTE_TABLE_SIZE" TO "200" with status=ERROR_INVALID_VALUE. Firmware value will be used instead.
[2024-05-05 12:07:00] warning: zh:ember: [EzspConfigId] Failed to SET "MULTICAST_TABLE_SIZE" TO "16" with status=ERROR_OUT_OF_MEMORY. Firmware value will be used instead.
[2024-05-05 12:07:00] info: zh:ember: [STACK STATUS] Network up.
[2024-05-05 12:07:00] info: zh:ember: [INIT TC] NCP network matches config.
[2024-05-05 12:07:00] info: zh:ember: [CONCENTRATOR] Started source route discovery. 1248ms until next broadcast.
[2024-05-05 12:07:01] info: z2m: zigbee-herdsman started (resumed)
[2024-05-05 12:07:01] info: z2m: Coordinator firmware version: '{"meta":{"build":0,"ezsp":13,"major":7,"minor":4,"patch":1,"revision":"7.4.1 [GA]","special":0,"type":170},"type":"EmberZNet"}'
[2024-05-05 12:07:01] info: z2m: Currently 0 devices are joined:
[2024-05-05 12:07:01] info: z2m: Zigbee: disabling joining new devices.
[2024-05-05 12:07:01] info: z2m: Connecting to MQTT server at mqtt://core-mosquitto:1883
[2024-05-05 12:07:01] info: z2m: Connected to MQTT server
[2024-05-05 12:07:02] info: z2m: Started frontend on port 8099
[2024-05-05 12:07:02] info: z2m: Zigbee2MQTT started!

  • when I try to pair the same aqara motion sensor...

[2024-05-05 12:07:40] info: z2m: Zigbee: allowing new devices to join.
[2024-05-05 12:07:40] info: zh:ember: [STACK STATUS] Network opened.
[2024-05-05 12:08:08] info: zh:controller: Interview for '0x00158d0008083d2a' started
[2024-05-05 12:08:08] info: z2m: Device '0x00158d0008083d2a' joined
[2024-05-05 12:08:09] info: z2m: Starting interview of '0x00158d0008083d2a'
[2024-05-05 12:08:11] warning: zh:ember: [ZDO] Node descriptor for "7769" reports device is only compliant to revision "pre-21" of the ZigBee specification (current revision: 23).
[2024-05-05 12:08:47] info: zh:controller: Succesfully interviewed '0x00158d0008083d2a'
[2024-05-05 12:08:47] info: z2m: Successfully interviewed '0x00158d0008083d2a', device has successfully been paired
[2024-05-05 12:08:47] info: z2m: Device '0x00158d0008083d2a' is supported, identified as: Aqara Motion sensor (RTCGQ11LM)
[2024-05-05 12:08:47] info: z2m: Configuring '0x00158d0008083d2a'
[2024-05-05 12:08:47] info: z2m: Successfully configured '0x00158d0008083d2a'

so pairing is working and I didn't get the broadcast error now, not while starting up and not while pairing.

So starting over with zigbee2mqtt solved it for me, but that is not possible for everyone I think :-)

@alainsch
Copy link

alainsch commented May 5, 2024

so pairing is working and I didn't get the broadcast error now, not while starting up and not while pairing.

So starting over with zigbee2mqtt solved it for me, but that is not possible for everyone I think :-)

No, not completly... after approx 5 minutes, pairing was again not possible. No errors, but the connection / interview didn't start. Tried to restart z2m and reboot the coordinator, nothing helps.

Downgraded the coordinator to the 20231030 FW (ESZP12) and switched back to "adapter: ezsp" and I still got the "error: zh:controller:greenpower: Received undefined command from '0' " messages, but pairing is possible again.

Will see in about 10 minutes...

@fir3drag0n
Copy link

fir3drag0n commented May 5, 2024

Very very simple configuration here.

HAOS on qemu VM in low end x86-64 QNAP nas, resources 2 cpu+2 GB ram as suggested by HAOS setup guide. I have seen a lot of ppl using VMs or arm devices: one common point may be low resources in terms of CPU power and/or RAM.

Back to the setup, I can report two setups:

  1. ZBDongle-E with fw 7.4.2, Z2M 1.37.0, ember driver. Only the ZBDongle-E is in the ZigBee network so it is only the coordinator. The broadcast errors happens. This may rule out the devices and spot the light on the coordinator.
  2. ZBDongle-E as above in above setup but with 2 Sonoff TRVZB valves added to the ZigBee network: same error continues to happen. But since it was happening with the coordinator alone as for setup 1, I would rule out the fact that I have added the 2 devices.

Anyway I see from other posts that the error is happening with a variety of devices and if I look at another common factor, all the variety of networks showing the error have -> a coordinator <- which again spots the light on the coordinator.

I see that @Nerivec is not able to reproduce the issue, and, needless to say, also Nerivec is working with a coordinator which should obviously rule out the coordinator itself (unless there is some elusive coordinator hardware common factor), maybe a good starting point for you would be to constrain the system on a low resource/slow host or a VM with limited resources to see what happens with the coordinator handling of Z2M.

Maybe another hint maybe found in the first post from @julien-billaud: "I've tried the exact same configuration on a regular x86 computer running debian (using the same zigbee dongle) and didn't face any issue which seems to be a linked with the Raspberry pi 4".

I do also have one Sonoff TRVZB.

And I also started fresh with one new zigbee2mqtt config and just the coordinator, and even at start the pairing/broadcast issue appeared immediately. I don't think that it is an issue with raspberry pi as I am using an x86 machine running a zigbee2mqtt container (docker).

I also observed that a coordinator reset sometimes helped. @Nerivec recommended to do a hard reset with my device (that includes pushing the physical reset button). This also helped me once starting without any issues, but after restarting again, I again suffered by those errors.

@Ricc68
Copy link

Ricc68 commented May 5, 2024

HAOS on qemu VM in low end x86-64 QNAP nas, resources 2 cpu+2 GB ram as suggested by HAOS setup guide. I have seen a lot of ppl using VMs or arm devices: one common point may be low resources in terms of CPU power and/or RAM.

I don't think that it is an issue with raspberry pi as I am using an x86 machine running a zigbee2mqtt container (docker).

Just to have a better understanding: what CPU/RAM is your x86 machine? Is it running what OS? Is it on bare metal or on a virtualization environment like Proxmox or other VM of any sort? I agree dockers are less demanding, but performance then is limited by the host so it would be useful to know what kind of host is running your docker and how loaded is your x86 system.

@fir3drag0n
Copy link

fir3drag0n commented May 5, 2024

It is a Intel® Core™ i3-9100 system with 64 GB RAM ECC.
It is running Unraid / NAS system with virtualization options (docker or vms).

@Nerivec
Copy link
Collaborator

Nerivec commented May 5, 2024

I have a low-resource VM that mimics the specs of an average PI 4 to run tests on stuff that I know affect performance. No issue there either. No failed broadcast without any device, nor with devices, and successfully paired & re-paired a dozen devices since it's been running for a couple of hours.

But just in case, you can try giving it some breathing room with the adapter_delay setting:

advanced:
  adapter_delay: 20

Default/min is 5, max is 60 (milliseconds). Note that at 60, you are likely to experience some delays when triggering devices rapidly.


PS: I created an issue in the firmware repo for the SLZB-06M and the failing config IDs. May or may not be related to the ensuing troubles, but we need to get to the bottom of it nonetheless. darkxst/silabs-firmware-builder#90

@Ricc68
Copy link

Ricc68 commented May 5, 2024

adapter_delay: 20

Added the adapter_delay option, no joy:

[2024-05-05 14:42:54] error: zh:ember: Delivery of BROADCAST failed for "65532" [apsFrame={"profileId":0,"clusterId":54,"sourceEndpoint":0,"destinationEndpoint":0,"options":256,"groupId":0,"sequence":170} messageTag=255]
[2024-05-05 14:42:55] error: zh:ember: Delivery of BROADCAST failed for "65533" [apsFrame={"profileId":41440,"clusterId":33,"sourceEndpoint":242,"destinationEndpoint":242,"options":256,"groupId":0,"sequence":171} messageTag=1]
[2024-05-05 14:42:57] error: zh:ember: Delivery of BROADCAST failed for "65533" [apsFrame={"profileId":0,"clusterId":19,"sourceEndpoint":0,"destinationEndpoint":0,"options":1024,"groupId":0,"sequence":53} messageTag=255]
[2024-05-05 14:44:07] error: zh:ember: Delivery of BROADCAST failed for "65533" [apsFrame={"profileId":0,"clusterId":19,"sourceEndpoint":0,"destinationEndpoint":0,"options":1024,"groupId":0,"sequence":59} messageTag=255]

at startup of z2m.

@julien-billaud
Copy link
Author

I've been doing little more testing and figured out "what was wrong".
I've done the following tests :
Start on a fresh install for the pi4 and install the latest version of docker, all from an SD card (removing de SSD plugged on the USB3 port) only remaining plugged, the dongle-e on the second USB3 port.
Averything has been running perfectly fine with the ember driver.
From that fresh install, I then plugged the SSD on the USB3 port then it started to be way less responsive so I've rebooted the system and got the exact same "BROADCAST" errors and nothing was working.
Then, I've switched the dongle-e to one of the USB2.0 port and kept the SSD to one of the USB3 port then no more error.
last test, starting the PI4 from the SSD plugged to USB3.0 then the Dongle-e to USB2.0 and now everything is working fine with ember driver.

To conclude, it seems like the ember driver is for some reason little bit more sensitive (I know that using the Dongle without extension cord isn't ideal).
Hope it will help for those who are observing the same "BROADCAST" error after switching from ezsp to ember driver and/or what in that driver is leading to that strange behavior.

@supaeasy
Copy link

supaeasy commented May 5, 2024

Can't be my problem. USB2 Port with 2m extension cable.

@Dominator86
Copy link

Dominator86 commented Jun 19, 2024

After rebooting HA today my zigbee devices became unresponsive and "Delivery of BROADCAST failed" errors kept popping up.
The only way to "fix" it was reverting back to ezsp.
It's weird because I've been running ember for several weeks now and it has already survived a reboot without issues, so no idea why this time was different.
Hardware is a Sonoff Dongle-E with FW 7.4.2.0 and I'm using the Z2M HA addon if that matters.

@sblantipodi
Copy link

at this point is pretty evident that Ember is not ready for the release,
I don't get why they consider EZSP deprecated :)

@alainsch
Copy link

for my "production" environment, I'm switching back to EZSP 'till ember problems are fixed.

    public async start(): Promise<StartResult> {
        const logEzspDeprecated = (): void => {
            const message = `Deprecated driver 'ezsp' currently in use, 'ember' will become the officially supported EmberZNet ` +
                `driver in next release. If using Zigbee2MQTT see https://github.com/Koenkk/zigbee2mqtt/discussions/21462`;
            logger.warning(message, NS);
        };
        logEzspDeprecated();
        this.deprecatedTimer = setInterval(logEzspDeprecated, 60 * 60 * 1000); // Every 60 mins
        return this.driver.startup();
    }

Maybe in a next release you could set a higher value for the timer to call the logEzspDeprecated() method? A warning every hour is a bit overkill I think. Once a day or only on startup?

@voc0der
Copy link

voc0der commented Jun 25, 2024

SkyConnect blue here, just upgraded to 7.4.3.0 which went smoothly, and made rtscts: true work. (only worked with false prior). Thank you for the advice and help there, and thank you for the work so far on ember.

In short in my env, ember mostly works, and never crashes once it's up, but when you restart it (or parts of the stack), it almost always fails to start, citing this error, and can cause havoc with the depends_on in a workflow (HomeAssistant). A simple docker restart zigbee2mqtt fixes it. But it must be done each time.

Unfortunately, nothing has changed in the new firmware, it still will fail to start sometimes, and then restarting the docker container after it's failed will make it work.

This container probably has less issues booting with restart: always but that setting is not compatible with a sane docker deployment.

Maybe there can be an environment variable to let it try 3 times before giving up? At least then we can ignore this until you win at hide and seek?

@supaeasy
Copy link

My setup with Dongle E that has been really solid for quite some time has become close to unusable. I am having errors over errors, switching back and forth from ezsp to ember has produced errors in the likes of "duplicate device detected for 12345, kicking from network" - unfortunately these logs have been overwritten already so I don't know the exact wording. It is incomprehensible to me that it is apparently not possible to determine wich devices exactly are meant. Why not use the friendly name?
This made my network completely unstable, lights turn on on the lowest dim setting RANDOMLY and connections are subsequently lost completely as these kicks apparently occurred in routers. This ist really, really bad. I appreciate every effort from all the devs but the last weeks have been an utterly underwhelming experience for me. I am thinking about completely resetting and pairing everything again but I just don't want to lose all the long term statistics. Any Ideas?

@Nerivec
Copy link
Collaborator

Nerivec commented Jun 25, 2024

@voc0der What error do you mean is preventing start? This broadcast delivery failure shouldn't prevent "start".

@supaeasy You can reset the zigbee2mqtt installation completely (config + dabatase + adapter network), as long as you re-name devices the same after your re-pair them (so they end up with the same HA entity ID), HA should link back to the previous history. I did that a few months back without problem (assuming nothing's changed in HA logic since).
For this warning: An ID conflict was detected for network address "12345". Corresponding devices kicked from the network., 12345 is the device's address in decimal form (friendly names are only identified way higher up the ladder, not at the driver level, where this occurs). The same applies for most of Z2M's logs.
If you do re-pair all devices, based on your various errors, I'd go slow, making sure you can identify which is guilty if problems start appearing after a new one is added. I also suggest, before you start Z2M again, that you clear the NVM on your adapter, and re-flash the firmware to ensure a clean slate, then figure out the best channel, and set network configs to GENERATE to create a proper network.
You can DM me on Discord if you need some help.


I need someone that can consistently reproduce this Delivery of BROADCAST failed (the constant one that breaks permit join, among other things) and is able to run custom code (replace a few files in the zigbee2mqtt installation folder). If you can and have some time to spare (and a network that allows you to do this without too much troubles), please find me on Discord (zigbee2mqtt server, can DM me from there). Since we can't reproduce this, I need someone to report findings after running a few tests.

@voc0der
Copy link

voc0der commented Jun 26, 2024

Moved to other thread.

@supaeasy
Copy link

@Nerivec unfortunately my current schedule doesn't allow for intense testing. Just for reference: all the Problems started after I made the following changes:

  • Installed a Zigfred Plus and I think this could be the issue to look at first bc this seems related.
  • set the reporting intervals for on/off state of it and some bulbs to very short intervals (0-3s)
  • switched to ember
  • upgraded my dongle-e Firmware to 7.4.4 (but have reverted now to 7.4.2)

Unfortunately I made all of these changes in a very short time frame so it is impossible to tell which of these broke my network. Also I have a second Dongle-e with router Firmware and even though it seems to be working throughout these errors journey I have the feeling that only a power cycle of this device as well as resetting and re pairing the Zigfred solve my problems - at least temporarily. Had to do this twice since April now. Could the Dongle Router be too stressed? Are there known issues with Zigfred devices?

Symptoms:

  • IKEA Tradfri bulbs randomly turning on in lowest dimming setting, sometimes while becoming unresponsive too and very rarely even have to be re paired.
  • Devices dropping from network that will have to be re paired.
  • Network partly slow, partly very responsive
  • OTA not working properly ('error: z2m: Failed to check if update available for '[any device]' (OTA: Device didn't respond to OTA request')

Problem is that these issues seem to come creeping in, it is not all of a sudden everything broken.

Setup:
HA running on a VM on a Synology NAS

Zigbee devices:
lumi.sensor_magnet.aq2: 9

ZBT-CCTLight-M3500107: 6

lumi.weather: 5

TRADFRI bulb GU10 WS 345lm: 5

TRADFRI control outlet: 4

TRADFRIbulbE27WSglobeopal1055lm: 4

WB01: 3

lumi.vibration.aq1: 2

TRADFRIbulbE14WScandleopal470lm: 2

TRADFRI motion sensor: 2

lumi.sensor_wleak.aq1: 2

TS0044: 1

S26R2ZB: 1

DONGLE-E_R: 1

Remote Control N2: 1

TS0049: 1

TRADFRI on/off switch: 1

lumi.airmonitor.acn01: 1

zigfred plus: 1

@Nerivec
Copy link
Collaborator

Nerivec commented Jun 27, 2024

Very short intervals (reporting/availability/etc) and unstable devices/network definitely don't go well together. You may want to try reverting these to defaults one at a time, see if things start improving. Also, if you think something might be wrong with that Zigfred device, try removing it from Z2M, see if things improve after a few hours (give time for the network to adjust). A single bad device can make a whole mess on the network because of how Zigbee works... I don't know Zigfred at all however, so, can't say much about them.

@supaeasy
Copy link

supaeasy commented Jun 28, 2024

Okay, thank you. I have made the following changes now and hope this works out:

  • reset Zigfred to factory defaults and re-paired it.
  • ensured steady power supply to one router that would be otherwise shutdown from time to time (a plug that was connected to an auto power-off plug). I didnt consider this a problem because I had no use for it when it was offline but figured a router outage might stress the network and I could avoid this one easily. However I still have some bulbs that will go offline from time to time due to wife factor...
  • manually set ota to the default from github page because no updates were ever found (I didnt change anything before here) to
    ota: zigbee_ota_override_index_location: https://github.com/Koenkk/zigbee-OTA/raw/master/index.json

Everything seems to work fine for now. Will post updates if it goes down the drain again. The OTA thing however only changed that one update for an IKEA remote was found - nonetheless it cannot be applied because Device didn't respond to OTA request. Removing the Zigfred device from my network would be ultima ratio and a big, big bummer because that one is by far the most costly piece of my network and controls the heart of my home - with no real alternatives that I came across. Would also mean to hire an electrician to uninstall it and that would leave me with a useless 200 CHF device and the same in electrician bills so I really want this one to work... Can you think of anything that might corrupt the network that I can test? For example channel, WiFi interference, etc. I really dont want to play the try and error game but rather measure it somehow to point me to that error source.

One more question though: Is there another way to ensure on/off states are sent to the network immediately? I mean I absoultely don't need reports every x seconds, just state changes that are reliable. I set the reporting intervall to these short settings because when I trigger on/off state via the zigfred (or any other switch) the correct power state of a lamp does not show up directly in my HA dashboard. So I can never be sure lights are on/off when I am away. If i trigger the state via HA the status is displayed correctly, immediately and reliably.

@Nerivec
Copy link
Collaborator

Nerivec commented Jun 28, 2024

Lots of IKEA stuff is picky with OTA, need to keep them awake until they start updating, otherwise it fails (docs device picked at random, but procedure similar).
Argh, hard to remove the Zigfred from the network even temporarily then. Maybe the breaker it is on doesn't kill too much of the house and you could try that for a couple hours...? Not much else you can do to see if a particular device is messing with the network (unless you get very technical).
You can try scanning your environment with your adapter to see if anything might be interfering. I implemented the features in https://github.com/Nerivec/ember-zli (uses ember driver under the hood):

=> Scan network
  => Channels usage / RSSI (11-26)
  => Existing networks

Not sure why on/off would not be reported instantly. Do you have a screenshot of the reporting tab for your Zigfred (or a device that does this)?

@bjornsivertsen
Copy link

log.txt

Here is my log detailing broadcast failure, with the following details below:

image

@BMWK1
Copy link

BMWK1 commented Jun 30, 2024

Got the same broadcast 65533/65532 error every 3'30", after ember migration with Z2M 1.38.0 and ncp-uart-hw-v7.4.1.0-zbdonglee-115200.gbl or ncp-uart-hw-v7.4.3.0-zbdonglee-115200.gbl
came back to ezsp as long as the issues are not resolved....
Z2M dockerized on Synology DSM 7.2, Sonoff dongleE with a 3 meters USB cable

@supaeasy
Copy link

supaeasy commented Jul 1, 2024

Not sure why on/off would not be reported instantly. Do you have a screenshot of the reporting tab for your Zigfred (or a device that does this)?

@Nerivec well, I found out: "Minimum reporting change (reportable_change)" is poorly translated in German, I use the localized version of Z2M. It is translated to say "Minimale Report-Änderungen" which is not really comprehensible (it translates back to "Minimal change of reports", referencing to reports instead of what leads to cause reports). It should be "Minimale Veränderung um Berichte auszulösen". I am quite sure I didn't mess with this setting before but it was set to 0. I have reverted reporting-Intervals zu 60-3600 now and set Min. Reporting change to 1. Now everything is working as it should (except for one group of bulbs that has somewhat lazy reports (some 5-30s delay) but I can live with that and my debug protocol does not seem as flooded anymore. Will see if this is sustainable now. It does "feel" better.

@mirkochip88
Copy link

I have two instances of z2m, both on SLZB-06M. The main one with 40 devices is on ezsp, no problems. The secondary one is on Ember with 10 devices, and I can run all the tests you want if you tell me what to do.

On the ember istance, after migrating at version 1.37, I continue to receive this error:

[2024-07-02 21:49:53] error: zh:ember:uart:ash: Received ERROR from NCP, with code=ERROR_EXCEEDED_MAXIMUM_ACK_TIMEOUT_COUNT.
[2024-07-02 21:49:53] error: zh:ember:uart:ash: ASH disconnected | NCP status: ASH_NCP_FATAL_ERROR
[2024-07-02 21:49:53] error: zh:ember:uart:ash: Error while parsing received frame, status=ASH_NCP_FATAL_ERROR.
[2024-07-02 21:49:53] error: zh:ember: !!! NCP FATAL ERROR reason=HOST_FATAL_ERROR. ATTEMPTING RESET... !!!

In both version 1.37 and 1.38, the error appeared only upon restarting the Docker container and even prevented the instance from starting. However, I resolved it by shutting down Docker, replacing the state.json file (sometimes need to replace database and configuration) with one from a few hours earlier (I have a continuous incremental backup always active), at which point it would restart and work perfectly. With version 1.39, which I have been testing for a few hours, the problem occurs every 10-15 minutes, often resolving itself. But after a few times, I am forced to shut down the container and replace with an oldest state.json file.

@hairglass
Copy link

hairglass commented Jul 3, 2024

Всем жертвам ember)))
У меня sonoff dongle e и была та же проблема. Ничего из советов выше не помогло.
А помогло перепрошить стик обратно на 7.3.1 и переходите на ezsp.
Сейчас zigbee2mqtt 1.39.0-1 и всё прекрасно работает.
Вывод - проблема в прошивка стика

@supaeasy
Copy link

supaeasy commented Jul 3, 2024

Are you sure? 7.3.1 seems like a major step back to me (although I have no Idea whatsoever what to expect from the different versions because I can't seem to find any description or change log).

@hairglass
Copy link

Вам нужна стабильность или красивые цифры в прошивка?)))

@thecode
Copy link

thecode commented Jul 3, 2024

To all victims of ember))) I have sonoff dongle e and had the same problem. None of the tips above helped. And what helped was to reflash the stick back to 7.3.1 and switch to ezsp. Now zigbee2mqtt 1.39.0-1 and everything works fine. Conclusion - the problem is in the firmware of the stick

I don't have any problems with 7.4.1 and ezsp, the problem start when I switch to ember. I am not yet sure if the problem is in firmware or the ember driver used in Zigbee2MQTT

@hairglass
Copy link

To all victims of ember))) I have sonoff dongle e and had the same problem. None of the tips above helped. And what helped was to reflash the stick back to 7.3.1 and switch to ezsp. Now zigbee2mqtt 1.39.0-1 and everything works fine. Conclusion - the problem is in the firmware of the stick

I don't have any problems with 7.4.1 and ezsp, the problem start when I switch to ember. I am not yet sure if the problem is in firmware or the ember driver used in Zigbee2MQTT

Возможно зависит от оборудования.
Я использую mini pc szbox w6 intel n100.
Windows 11 pro.
И да, 7.4.1 прошивку не устанавливал. И устанавливать в ближайшее время не буду)))

@7floor
Copy link

7floor commented Jul 5, 2024

HASS OS on RPi 4
Z2M v 1.39.0 as an addon
Sonoff Dongle-E 7.4.3.0 no hw flow control
ember: tons of troubles, sometimes can't pair, also couldn't re-pair deleted device
ezsp: everything works stable like a charm!

So please, don't obsolete ezsp yet

@carbogit
Copy link

carbogit commented Jul 9, 2024

Standalone installation 1.39.0 commit: 0326926 with Sonoff E dongle fw 7.4.3.0 build 0 has been working fine on a Raspberry 4. After upgrading to Raspi 5, it started throwing errors 'zh:ember: Delivery of BROADCAST failed for "65533"...'
No errors after switching to EZSP.

The only change I made to the installation is apt upgrade for the following packages:
(..)

Edit: switched back to Pi 4 and configuration to ember; problems are gone. So the errors were either random or due to some hardware issue on the Pi 5.

@jusicgn
Copy link

jusicgn commented Jul 10, 2024

Jumping into this, adding my experience regarding SLZB-06M and ember:
SLZB-06M ist running on core firmware v2.3.6.
Tried different zigbee firmwares with no difference regading the bootup-issue:

z2m is running in a dedicated LXC on my proxmox server.
SLZB-06M is accesed via ethernet connection (wired).

Everytime the LXC with z2m restarts, z2m is not coming up, throwing this error:

Jul 09 18:58:02 z2m systemd[1]: Started zigbee2mqtt.service - zigbee2mqtt.
Jul 09 18:58:04 z2m npm[383]: > [email protected] start
Jul 09 18:58:04 z2m npm[383]: > node index.js
Jul 09 18:58:04 z2m npm[402]: Starting Zigbee2MQTT with watchdog (60000,300000,900000,1800000,3600000).
Jul 09 18:58:10 z2m npm[402]: [2024-07-09 18:58:10] info:         zh:ember: Using default stack config.
Jul 09 18:58:10 z2m npm[402]: [2024-07-09 18:58:10] info:         zh:ember: ======== Ember Adapter Starting ========
Jul 09 18:58:10 z2m npm[402]: [2024-07-09 18:58:10] info:         zh:ember:ezsp: ======== EZSP starting ========
Jul 09 18:58:10 z2m npm[402]: [2024-07-09 18:58:10] info:         zh:ember:uart:ash: ======== ASH NCP reset ========
Jul 09 18:58:10 z2m npm[402]: [2024-07-09 18:58:10] info:         zh:ember:uart:ash: Socket ready
Jul 09 18:58:10 z2m npm[402]: [2024-07-09 18:58:10] info:         zh:ember:uart:ash: ======== ASH starting ========
Jul 09 18:58:10 z2m npm[402]: [2024-07-09 18:58:10] error:         zh:ember:uart:ash: Received ERROR from NCP while connecting, with code=ERROR_EXCEEDED_MAXIMUM_ACK_TIMEOUT_COUNT.
Jul 09 18:58:10 z2m npm[402]: [2024-07-09 18:58:10] error:         zh:ember:uart:ash: ASH disconnected | NCP status: ASH_NCP_FATAL_ERROR
Jul 09 18:58:10 z2m npm[402]: [2024-07-09 18:58:10] error:         zh:ember:uart:ash: Error while parsing received frame, status=ASH_NCP_FATAL_ERROR.
Jul 09 18:58:10 z2m npm[402]: [2024-07-09 18:58:10] info:         zh:ember:uart:ash: ======== ASH NCP reset ========
Jul 09 18:58:10 z2m npm[402]: [2024-07-09 18:58:10] info:         zh:ember:uart:ash: ======== ASH starting ========
Jul 09 18:58:11 z2m npm[402]: [2024-07-09 18:58:11] info:         zh:ember:uart:ash: ======== ASH connected ========
Jul 09 18:58:11 z2m npm[402]: [2024-07-09 18:58:11] info:         zh:ember:uart:ash: ======== ASH started ========
Jul 09 18:58:11 z2m npm[402]: [2024-07-09 18:58:11] info:         zh:ember:ezsp: ======== EZSP started ========
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] warning:         zh:ember:uart:ash: Frame(s) in progress cancelled in [1ac1020b0a527e]
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] error:         zh:ember:uart:ash: Received unexpected reset from NCP, with reason=RESET_SOFTWARE.
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] error:         zh:ember:uart:ash: ASH disconnected: ASH_ERROR_NCP_RESET | NCP status: ASH_NCP_FATAL_ERROR
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] error:         zh:ember:uart:ash: Error while parsing received frame, status=HOST_FATAL_ERROR.
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] error:         zh:ember: !!! NCP FATAL ERROR reason=HOST_FATAL_ERROR. ATTEMPTING RESET... !!!
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:queue: Request dispatching stopped; queue=0 priorityQueue=0
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash: ASH COUNTERS since last clear:
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash:   Total frames: RX=2, TX=3
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash:   Cancelled   : RX=1, TX=0
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash:   DATA frames : RX=0, TX=1
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash:   DATA bytes  : RX=0, TX=4
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash:   Retry frames: RX=0, TX=0
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash:   ACK frames  : RX=0, TX=1
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash:   NAK frames  : RX=0, TX=0
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash:   nRdy frames : RX=0, TX=0
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash:   CRC errors      : RX=0
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash:   Comm errors     : RX=0
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash:   Length < minimum: RX=0
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash:   Length > maximum: RX=0
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash:   Bad controls    : RX=0
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash:   Bad lengths     : RX=0
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash:   Bad ACK numbers : RX=0
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash:   Out of buffers  : RX=0
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash:   Retry dupes     : RX=0
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash:   Out of sequence : RX=0
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash:   ACK timeouts    : RX=0
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash: ======== ASH stopped ========
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:ezsp: ======== EZSP stopped ========
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember: ======== Ember Adapter Stopped ========
Jul 09 18:58:13 z2m npm[402]: [2024-07-09 18:58:13] info:         zh:ember: ======== Ember Adapter Starting ========
Jul 09 18:58:13 z2m npm[402]: [2024-07-09 18:58:13] info:         zh:ember:ezsp: ======== EZSP starting ========
Jul 09 18:58:13 z2m npm[402]: [2024-07-09 18:58:13] info:         zh:ember:uart:ash: ======== ASH NCP reset ========
Jul 09 18:58:13 z2m npm[402]: [2024-07-09 18:58:13] info:         zh:ember:uart:ash: Socket ready
Jul 09 18:58:13 z2m npm[402]: [2024-07-09 18:58:13] info:         zh:ember:uart:ash: ======== ASH starting ========
Jul 09 18:58:14 z2m npm[402]: [2024-07-09 18:58:14] info:         zh:ember:uart:ash: ======== ASH connected ========
Jul 09 18:58:14 z2m npm[402]: [2024-07-09 18:58:14] info:         zh:ember:uart:ash: ======== ASH started ========
Jul 09 18:58:14 z2m npm[402]: [2024-07-09 18:58:14] info:         zh:ember:ezsp: ======== EZSP started ========
Jul 09 18:58:14 z2m npm[402]: [2024-07-09 18:58:14] info:         zh:ember: [STACK STATUS] Network up.
Jul 09 18:58:14 z2m npm[402]: [2024-07-09 18:58:14] info:         zh:ember: [INIT TC] NCP network matches config.
Jul 09 18:58:14 z2m npm[402]: [2024-07-09 18:58:14] info:         zh:ember: [CONCENTRATOR] Started source route discovery. 1248ms until next broadcast.
Jul 09 18:58:14 z2m npm[402]: [2024-07-09 18:58:14] info:         zh:ember:queue: Request dispatching started.

z2m is never coming up regardless how long I wait.

A simple systemctl restart zigbee2mqtt.service fixes the problem everytime.
With z2m coming up:

Jul 09 20:23:03 z2m npm[1493]: Starting Zigbee2MQTT with watchdog (60000,300000,900000,1800000,3600000).
Jul 09 20:23:06 z2m npm[1493]: [2024-07-09 20:23:06] info:         zh:ember: Using default stack config.
Jul 09 20:23:06 z2m npm[1493]: [2024-07-09 20:23:06] info:         zh:ember: ======== Ember Adapter Starting ========
Jul 09 20:23:06 z2m npm[1493]: [2024-07-09 20:23:06] info:         zh:ember:ezsp: ======== EZSP starting ========
Jul 09 20:23:06 z2m npm[1493]: [2024-07-09 20:23:06] info:         zh:ember:uart:ash: ======== ASH NCP reset ========
Jul 09 20:23:06 z2m npm[1493]: [2024-07-09 20:23:06] info:         zh:ember:uart:ash: Socket ready
Jul 09 20:23:06 z2m npm[1493]: [2024-07-09 20:23:06] info:         zh:ember:uart:ash: ======== ASH starting ========
Jul 09 20:23:07 z2m npm[1493]: [2024-07-09 20:23:07] info:         zh:ember:uart:ash: ======== ASH connected ========
Jul 09 20:23:07 z2m npm[1493]: [2024-07-09 20:23:07] info:         zh:ember:uart:ash: ======== ASH started ========
Jul 09 20:23:07 z2m npm[1493]: [2024-07-09 20:23:07] info:         zh:ember:ezsp: ======== EZSP started ========
Jul 09 20:23:08 z2m npm[1493]: [2024-07-09 20:23:08] info:         zh:ember: [STACK STATUS] Network up.
Jul 09 20:23:08 z2m npm[1493]: [2024-07-09 20:23:08] info:         zh:ember: [INIT TC] NCP network matches config.
Jul 09 20:23:08 z2m npm[1493]: [2024-07-09 20:23:08] info:         zh:ember: [CONCENTRATOR] Started source route discovery. 1248ms until next broadcast.
Jul 09 20:23:08 z2m npm[1493]: [2024-07-09 20:23:08] info:         zh:ember:queue: Request dispatching started.

Let me know if I can help with additional information :)

@jfamiens
Copy link

Jumping into this, adding my experience regarding SLZB-06M and ember: SLZB-06M ist running on core firmware v2.3.6. Tried different zigbee firmwares with no difference regading the bootup-issue:

z2m is running in a dedicated LXC on my proxmox server. SLZB-06M is accesed via ethernet connection (wired).

Everytime the LXC with z2m restarts, z2m is not coming up, throwing this error:

Jul 09 18:58:02 z2m systemd[1]: Started zigbee2mqtt.service - zigbee2mqtt.
Jul 09 18:58:04 z2m npm[383]: > [email protected] start
Jul 09 18:58:04 z2m npm[383]: > node index.js
Jul 09 18:58:04 z2m npm[402]: Starting Zigbee2MQTT with watchdog (60000,300000,900000,1800000,3600000).
Jul 09 18:58:10 z2m npm[402]: [2024-07-09 18:58:10] info:         zh:ember: Using default stack config.
Jul 09 18:58:10 z2m npm[402]: [2024-07-09 18:58:10] info:         zh:ember: ======== Ember Adapter Starting ========
Jul 09 18:58:10 z2m npm[402]: [2024-07-09 18:58:10] info:         zh:ember:ezsp: ======== EZSP starting ========
Jul 09 18:58:10 z2m npm[402]: [2024-07-09 18:58:10] info:         zh:ember:uart:ash: ======== ASH NCP reset ========
Jul 09 18:58:10 z2m npm[402]: [2024-07-09 18:58:10] info:         zh:ember:uart:ash: Socket ready
Jul 09 18:58:10 z2m npm[402]: [2024-07-09 18:58:10] info:         zh:ember:uart:ash: ======== ASH starting ========
Jul 09 18:58:10 z2m npm[402]: [2024-07-09 18:58:10] error:         zh:ember:uart:ash: Received ERROR from NCP while connecting, with code=ERROR_EXCEEDED_MAXIMUM_ACK_TIMEOUT_COUNT.
Jul 09 18:58:10 z2m npm[402]: [2024-07-09 18:58:10] error:         zh:ember:uart:ash: ASH disconnected | NCP status: ASH_NCP_FATAL_ERROR
Jul 09 18:58:10 z2m npm[402]: [2024-07-09 18:58:10] error:         zh:ember:uart:ash: Error while parsing received frame, status=ASH_NCP_FATAL_ERROR.
Jul 09 18:58:10 z2m npm[402]: [2024-07-09 18:58:10] info:         zh:ember:uart:ash: ======== ASH NCP reset ========
Jul 09 18:58:10 z2m npm[402]: [2024-07-09 18:58:10] info:         zh:ember:uart:ash: ======== ASH starting ========
Jul 09 18:58:11 z2m npm[402]: [2024-07-09 18:58:11] info:         zh:ember:uart:ash: ======== ASH connected ========
Jul 09 18:58:11 z2m npm[402]: [2024-07-09 18:58:11] info:         zh:ember:uart:ash: ======== ASH started ========
Jul 09 18:58:11 z2m npm[402]: [2024-07-09 18:58:11] info:         zh:ember:ezsp: ======== EZSP started ========
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] warning:         zh:ember:uart:ash: Frame(s) in progress cancelled in [1ac1020b0a527e]
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] error:         zh:ember:uart:ash: Received unexpected reset from NCP, with reason=RESET_SOFTWARE.
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] error:         zh:ember:uart:ash: ASH disconnected: ASH_ERROR_NCP_RESET | NCP status: ASH_NCP_FATAL_ERROR
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] error:         zh:ember:uart:ash: Error while parsing received frame, status=HOST_FATAL_ERROR.
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] error:         zh:ember: !!! NCP FATAL ERROR reason=HOST_FATAL_ERROR. ATTEMPTING RESET... !!!
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:queue: Request dispatching stopped; queue=0 priorityQueue=0
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash: ASH COUNTERS since last clear:
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash:   Total frames: RX=2, TX=3
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash:   Cancelled   : RX=1, TX=0
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash:   DATA frames : RX=0, TX=1
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash:   DATA bytes  : RX=0, TX=4
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash:   Retry frames: RX=0, TX=0
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash:   ACK frames  : RX=0, TX=1
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash:   NAK frames  : RX=0, TX=0
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash:   nRdy frames : RX=0, TX=0
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash:   CRC errors      : RX=0
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash:   Comm errors     : RX=0
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash:   Length < minimum: RX=0
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash:   Length > maximum: RX=0
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash:   Bad controls    : RX=0
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash:   Bad lengths     : RX=0
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash:   Bad ACK numbers : RX=0
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash:   Out of buffers  : RX=0
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash:   Retry dupes     : RX=0
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash:   Out of sequence : RX=0
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash:   ACK timeouts    : RX=0
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash: ======== ASH stopped ========
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:ezsp: ======== EZSP stopped ========
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember: ======== Ember Adapter Stopped ========
Jul 09 18:58:13 z2m npm[402]: [2024-07-09 18:58:13] info:         zh:ember: ======== Ember Adapter Starting ========
Jul 09 18:58:13 z2m npm[402]: [2024-07-09 18:58:13] info:         zh:ember:ezsp: ======== EZSP starting ========
Jul 09 18:58:13 z2m npm[402]: [2024-07-09 18:58:13] info:         zh:ember:uart:ash: ======== ASH NCP reset ========
Jul 09 18:58:13 z2m npm[402]: [2024-07-09 18:58:13] info:         zh:ember:uart:ash: Socket ready
Jul 09 18:58:13 z2m npm[402]: [2024-07-09 18:58:13] info:         zh:ember:uart:ash: ======== ASH starting ========
Jul 09 18:58:14 z2m npm[402]: [2024-07-09 18:58:14] info:         zh:ember:uart:ash: ======== ASH connected ========
Jul 09 18:58:14 z2m npm[402]: [2024-07-09 18:58:14] info:         zh:ember:uart:ash: ======== ASH started ========
Jul 09 18:58:14 z2m npm[402]: [2024-07-09 18:58:14] info:         zh:ember:ezsp: ======== EZSP started ========
Jul 09 18:58:14 z2m npm[402]: [2024-07-09 18:58:14] info:         zh:ember: [STACK STATUS] Network up.
Jul 09 18:58:14 z2m npm[402]: [2024-07-09 18:58:14] info:         zh:ember: [INIT TC] NCP network matches config.
Jul 09 18:58:14 z2m npm[402]: [2024-07-09 18:58:14] info:         zh:ember: [CONCENTRATOR] Started source route discovery. 1248ms until next broadcast.
Jul 09 18:58:14 z2m npm[402]: [2024-07-09 18:58:14] info:         zh:ember:queue: Request dispatching started.

z2m is never coming up regardless how long I wait.

A simple systemctl restart zigbee2mqtt.service fixes the problem everytime. With z2m coming up:

Jul 09 20:23:03 z2m npm[1493]: Starting Zigbee2MQTT with watchdog (60000,300000,900000,1800000,3600000).
Jul 09 20:23:06 z2m npm[1493]: [2024-07-09 20:23:06] info:         zh:ember: Using default stack config.
Jul 09 20:23:06 z2m npm[1493]: [2024-07-09 20:23:06] info:         zh:ember: ======== Ember Adapter Starting ========
Jul 09 20:23:06 z2m npm[1493]: [2024-07-09 20:23:06] info:         zh:ember:ezsp: ======== EZSP starting ========
Jul 09 20:23:06 z2m npm[1493]: [2024-07-09 20:23:06] info:         zh:ember:uart:ash: ======== ASH NCP reset ========
Jul 09 20:23:06 z2m npm[1493]: [2024-07-09 20:23:06] info:         zh:ember:uart:ash: Socket ready
Jul 09 20:23:06 z2m npm[1493]: [2024-07-09 20:23:06] info:         zh:ember:uart:ash: ======== ASH starting ========
Jul 09 20:23:07 z2m npm[1493]: [2024-07-09 20:23:07] info:         zh:ember:uart:ash: ======== ASH connected ========
Jul 09 20:23:07 z2m npm[1493]: [2024-07-09 20:23:07] info:         zh:ember:uart:ash: ======== ASH started ========
Jul 09 20:23:07 z2m npm[1493]: [2024-07-09 20:23:07] info:         zh:ember:ezsp: ======== EZSP started ========
Jul 09 20:23:08 z2m npm[1493]: [2024-07-09 20:23:08] info:         zh:ember: [STACK STATUS] Network up.
Jul 09 20:23:08 z2m npm[1493]: [2024-07-09 20:23:08] info:         zh:ember: [INIT TC] NCP network matches config.
Jul 09 20:23:08 z2m npm[1493]: [2024-07-09 20:23:08] info:         zh:ember: [CONCENTRATOR] Started source route discovery. 1248ms until next broadcast.
Jul 09 20:23:08 z2m npm[1493]: [2024-07-09 20:23:08] info:         zh:ember:queue: Request dispatching started.

Let me know if I can help with additional information :)

Same configuration, same issue

@djmwj
Copy link

djmwj commented Jul 11, 2024

Just wanted to chime in for awareness. I run two different coordinators via docker containers. One coordinator is SLZB-06M via LAN and one is SLZB-07 via USB. The issue is only manifesting on the SLZB-07 via USB. The SLZB-06M has not had any issues. Switching to the ezsp adapter solves the issue for SLZB-07. I'm happy to provide logs or debug info as well.
I noticed that the ezsp adapter is deprecated; is there efforts underway to provide performance parity between ember and ezsp before ezsp is removed? Thanks!

@ThiSaadeh
Copy link

Jumping into this, adding my experience regarding SLZB-06M and ember: SLZB-06M ist running on core firmware v2.3.6. Tried different zigbee firmwares with no difference regading the bootup-issue:

z2m is running in a dedicated LXC on my proxmox server. SLZB-06M is accesed via ethernet connection (wired).

Everytime the LXC with z2m restarts, z2m is not coming up, throwing this error:

Jul 09 18:58:02 z2m systemd[1]: Started zigbee2mqtt.service - zigbee2mqtt.
Jul 09 18:58:04 z2m npm[383]: > [email protected] start
Jul 09 18:58:04 z2m npm[383]: > node index.js
Jul 09 18:58:04 z2m npm[402]: Starting Zigbee2MQTT with watchdog (60000,300000,900000,1800000,3600000).
Jul 09 18:58:10 z2m npm[402]: [2024-07-09 18:58:10] info:         zh:ember: Using default stack config.
Jul 09 18:58:10 z2m npm[402]: [2024-07-09 18:58:10] info:         zh:ember: ======== Ember Adapter Starting ========
Jul 09 18:58:10 z2m npm[402]: [2024-07-09 18:58:10] info:         zh:ember:ezsp: ======== EZSP starting ========
Jul 09 18:58:10 z2m npm[402]: [2024-07-09 18:58:10] info:         zh:ember:uart:ash: ======== ASH NCP reset ========
Jul 09 18:58:10 z2m npm[402]: [2024-07-09 18:58:10] info:         zh:ember:uart:ash: Socket ready
Jul 09 18:58:10 z2m npm[402]: [2024-07-09 18:58:10] info:         zh:ember:uart:ash: ======== ASH starting ========
Jul 09 18:58:10 z2m npm[402]: [2024-07-09 18:58:10] error:         zh:ember:uart:ash: Received ERROR from NCP while connecting, with code=ERROR_EXCEEDED_MAXIMUM_ACK_TIMEOUT_COUNT.
Jul 09 18:58:10 z2m npm[402]: [2024-07-09 18:58:10] error:         zh:ember:uart:ash: ASH disconnected | NCP status: ASH_NCP_FATAL_ERROR
Jul 09 18:58:10 z2m npm[402]: [2024-07-09 18:58:10] error:         zh:ember:uart:ash: Error while parsing received frame, status=ASH_NCP_FATAL_ERROR.
Jul 09 18:58:10 z2m npm[402]: [2024-07-09 18:58:10] info:         zh:ember:uart:ash: ======== ASH NCP reset ========
Jul 09 18:58:10 z2m npm[402]: [2024-07-09 18:58:10] info:         zh:ember:uart:ash: ======== ASH starting ========
Jul 09 18:58:11 z2m npm[402]: [2024-07-09 18:58:11] info:         zh:ember:uart:ash: ======== ASH connected ========
Jul 09 18:58:11 z2m npm[402]: [2024-07-09 18:58:11] info:         zh:ember:uart:ash: ======== ASH started ========
Jul 09 18:58:11 z2m npm[402]: [2024-07-09 18:58:11] info:         zh:ember:ezsp: ======== EZSP started ========
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] warning:         zh:ember:uart:ash: Frame(s) in progress cancelled in [1ac1020b0a527e]
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] error:         zh:ember:uart:ash: Received unexpected reset from NCP, with reason=RESET_SOFTWARE.
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] error:         zh:ember:uart:ash: ASH disconnected: ASH_ERROR_NCP_RESET | NCP status: ASH_NCP_FATAL_ERROR
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] error:         zh:ember:uart:ash: Error while parsing received frame, status=HOST_FATAL_ERROR.
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] error:         zh:ember: !!! NCP FATAL ERROR reason=HOST_FATAL_ERROR. ATTEMPTING RESET... !!!
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:queue: Request dispatching stopped; queue=0 priorityQueue=0
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash: ASH COUNTERS since last clear:
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash:   Total frames: RX=2, TX=3
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash:   Cancelled   : RX=1, TX=0
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash:   DATA frames : RX=0, TX=1
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash:   DATA bytes  : RX=0, TX=4
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash:   Retry frames: RX=0, TX=0
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash:   ACK frames  : RX=0, TX=1
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash:   NAK frames  : RX=0, TX=0
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash:   nRdy frames : RX=0, TX=0
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash:   CRC errors      : RX=0
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash:   Comm errors     : RX=0
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash:   Length < minimum: RX=0
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash:   Length > maximum: RX=0
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash:   Bad controls    : RX=0
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash:   Bad lengths     : RX=0
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash:   Bad ACK numbers : RX=0
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash:   Out of buffers  : RX=0
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash:   Retry dupes     : RX=0
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash:   Out of sequence : RX=0
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash:   ACK timeouts    : RX=0
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:uart:ash: ======== ASH stopped ========
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember:ezsp: ======== EZSP stopped ========
Jul 09 18:58:12 z2m npm[402]: [2024-07-09 18:58:12] info:         zh:ember: ======== Ember Adapter Stopped ========
Jul 09 18:58:13 z2m npm[402]: [2024-07-09 18:58:13] info:         zh:ember: ======== Ember Adapter Starting ========
Jul 09 18:58:13 z2m npm[402]: [2024-07-09 18:58:13] info:         zh:ember:ezsp: ======== EZSP starting ========
Jul 09 18:58:13 z2m npm[402]: [2024-07-09 18:58:13] info:         zh:ember:uart:ash: ======== ASH NCP reset ========
Jul 09 18:58:13 z2m npm[402]: [2024-07-09 18:58:13] info:         zh:ember:uart:ash: Socket ready
Jul 09 18:58:13 z2m npm[402]: [2024-07-09 18:58:13] info:         zh:ember:uart:ash: ======== ASH starting ========
Jul 09 18:58:14 z2m npm[402]: [2024-07-09 18:58:14] info:         zh:ember:uart:ash: ======== ASH connected ========
Jul 09 18:58:14 z2m npm[402]: [2024-07-09 18:58:14] info:         zh:ember:uart:ash: ======== ASH started ========
Jul 09 18:58:14 z2m npm[402]: [2024-07-09 18:58:14] info:         zh:ember:ezsp: ======== EZSP started ========
Jul 09 18:58:14 z2m npm[402]: [2024-07-09 18:58:14] info:         zh:ember: [STACK STATUS] Network up.
Jul 09 18:58:14 z2m npm[402]: [2024-07-09 18:58:14] info:         zh:ember: [INIT TC] NCP network matches config.
Jul 09 18:58:14 z2m npm[402]: [2024-07-09 18:58:14] info:         zh:ember: [CONCENTRATOR] Started source route discovery. 1248ms until next broadcast.
Jul 09 18:58:14 z2m npm[402]: [2024-07-09 18:58:14] info:         zh:ember:queue: Request dispatching started.

z2m is never coming up regardless how long I wait.

A simple systemctl restart zigbee2mqtt.service fixes the problem everytime. With z2m coming up:

Jul 09 20:23:03 z2m npm[1493]: Starting Zigbee2MQTT with watchdog (60000,300000,900000,1800000,3600000).
Jul 09 20:23:06 z2m npm[1493]: [2024-07-09 20:23:06] info:         zh:ember: Using default stack config.
Jul 09 20:23:06 z2m npm[1493]: [2024-07-09 20:23:06] info:         zh:ember: ======== Ember Adapter Starting ========
Jul 09 20:23:06 z2m npm[1493]: [2024-07-09 20:23:06] info:         zh:ember:ezsp: ======== EZSP starting ========
Jul 09 20:23:06 z2m npm[1493]: [2024-07-09 20:23:06] info:         zh:ember:uart:ash: ======== ASH NCP reset ========
Jul 09 20:23:06 z2m npm[1493]: [2024-07-09 20:23:06] info:         zh:ember:uart:ash: Socket ready
Jul 09 20:23:06 z2m npm[1493]: [2024-07-09 20:23:06] info:         zh:ember:uart:ash: ======== ASH starting ========
Jul 09 20:23:07 z2m npm[1493]: [2024-07-09 20:23:07] info:         zh:ember:uart:ash: ======== ASH connected ========
Jul 09 20:23:07 z2m npm[1493]: [2024-07-09 20:23:07] info:         zh:ember:uart:ash: ======== ASH started ========
Jul 09 20:23:07 z2m npm[1493]: [2024-07-09 20:23:07] info:         zh:ember:ezsp: ======== EZSP started ========
Jul 09 20:23:08 z2m npm[1493]: [2024-07-09 20:23:08] info:         zh:ember: [STACK STATUS] Network up.
Jul 09 20:23:08 z2m npm[1493]: [2024-07-09 20:23:08] info:         zh:ember: [INIT TC] NCP network matches config.
Jul 09 20:23:08 z2m npm[1493]: [2024-07-09 20:23:08] info:         zh:ember: [CONCENTRATOR] Started source route discovery. 1248ms until next broadcast.
Jul 09 20:23:08 z2m npm[1493]: [2024-07-09 20:23:08] info:         zh:ember:queue: Request dispatching started.

Let me know if I can help with additional information :)

I got exactly the same setup / issue, it's rather annoying since I have weekly backups running and the LXC never comes back up after it cause of the same errors you provided, manual intervention via systemctl restart is required for it to resume working.

@Tarjaizaid
Copy link

Hello,

My current feedback if can be usefull

I had the same issue.
I'm using a laptop with Home assistant VM with qemu and SONOFF Zigbee updated to 7.4.3.0.
As other user, i've Delivery of BROADCAST failed error.

I've seen messages about USB power and moved the dongle to another port. nothing change

I stopped the zigbee2mqtt container and moved my configuration to "have a fresh install"
After container start, i can see directly these error messages :
error: zh:ember: Delivery of BROADCAST failed for "65532" [apsFrame={"profileId":0,"clusterId":54,"sourceEndpoint":0,"destinationEndpoint":0,"options":256,"groupId":0,"sequence":209} messageTag=1]
error: zh:ember: Delivery of BROADCAST failed for "65533" [apsFrame={"profileId":41440,"clusterId":33,"sourceEndpoint":242,"destinationEndpoint":242,"options":256,"groupId":0,"sequence":208} messageTag=255]

I don't know what are the profileID and if they push back by home assistant :(
I will try to have a fresh home assistant install to continue investigatin

@jjnj023
Copy link

jjnj023 commented Jul 16, 2024

Unfortunately I have the exact same issue. Also running Home Assistant as a VM.
I was hoping for a fix to be available. My entire system is unstable (120+ devices) so I really want to revert to a previous version without having to connect all devices again.
Can someone link me to where I can download the correct firmware to downgrade?

@thecode
Copy link

thecode commented Jul 16, 2024

I don't know what are the profileID and if they push back by home assistant :( I will try to have a fresh home assistant install to continue investigatin

I want to save you time from going in the wrong direction, this is not related to Home Assistant, it is internal by the ember driver, you can run Zigbee2MQTT without Home Assistant if you want to check, just point it to connect to another MQTT server, but anyhow not related.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
ember Issues related to ember driver problem Something isn't working
Projects
None yet
Development

No branches or pull requests