-
Notifications
You must be signed in to change notification settings - Fork 36
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Cyberpower stale commands #91
Comments
The addons aren't designed to manually edit files, I would recommend looking at the |
Paul, I just noticed that in the docs. Is this a proper config? I does load.
|
upsd_maxage: 25 should be outside the device (i.e. at the same level as devices/mode etc). Also shutdown_hassio is not valid anymore and can be removed. |
Thanks for the info. New config:
I will update, if it stays online for 3 to 4 hours. Since the update I have only made about 2 hours. I am surprised that directly connected hardware is less tolerant than the other units I have remotely connected to RPi's. They just never fail. |
Does the log_level: debug place undo load on anything? I really does provide a lot of info. |
You can remove the log_level debug, it is rarely that helpful:) Please let us know how you get on. |
Do I need to worry about the cruff in the /etc/nut config files that was brought forward and now can't be removed? I verified that /etc/nut/ups.conf still has the basic perimeters that I have been running forever. Or is that being added to the file by the config in the addon? |
The addon creates the config on startup, I'm not sure what you are referring to. |
Actually looking further, you would likely need:
Assuming those are valid driver options |
This config did not work...
Log snippet around the nut restart"
My config from above does not seem to process the pollinterval properly, the ups becomes stale after about 1 hr and 45 minutes. Restarting nut restores service. |
I personally have a Cyberpower UPS, it has been up and running for the past 5 hours or so of testing with the following:
Looking at the NUT docs, the default poll interval is 2 seconds. |
I just stripped mine down to this:
Maybe just made it to complex or smart. I am more accustomed to running on a Pi and having to configure it by hand. Will report back. |
Well crap, no data already. Reporting data stale in log. Will try a system restart, don't expect a change |
Same problem here: CyberPower UPS was working properly with the Addon, but now is throwing up "stale data" log messages and the sensors become unavailable. It's unclear what the trigger is. A manual restart of the Addon seems to get things going, albeit temporarilly. I first noticed this more than two days ago, so it was either brought about by HA 2012.2 or by the NUT Addon's v0.5.0. Let me know what I can do to provide any more useful information. |
@ricarva This morning I fell back to my last stable config, it was version 0.5.0. Did a partial restore a few moments ago. I have noticed that just unplugging and replugging the usb restored service. I am waiting to see if this test allows the unit to remain up. |
I have exceeded 2 hours of good connection after reverting to my 0.5.0 snapshot. Up longer now than I have been on 0.6.0 or 0.6.1, but long term stability test will just take time. |
@mattlward Thanks for the head's up. I'm still on 0.6.1 and HA 2012.2, but confirm your finding that unplugging/replugging the USB restores the service. Let's see for how long. @sinclairpaul Any idea of where the issue may lie? Thanks for the help and insights. |
Under .0.6.1 the service was restored for less than 2 hours, no different than restarting Nut. I am on HA 2021.2.3 I am running on a Lenovo M73, so I have multiple usb controllers and changing to a different controller had the same short duration fix. |
@ricarva, did you edit the /etc/nut *.conf files under older versions? I did and I know the container gets rebuilt on upgrade, but my ups.conf and upsd.conf files still seemed to contain my old data when changing from 0.5.0 to 0.6.0. I could be that it only appears that way, if when the system builds those files it reads data from the addon config file. |
I'm running on a RPi3b, and my last few attempts saw NUT last less than an hour. I did not edit the *.conf files. |
Not sure if this means anything... This is a graph of my free memory, the spike was the last time 0.6.1 died and I fell back to 0.5.0. My system has always appeared to have a memory leak, but never really goes below around 5600. And never becomes unstable. If not for version changes, I would expect wonderful uptimes. I have see 16 weeks between reboots. |
Same issue at my side. It began yesterday. |
I just pushed an update to the edge repo which you are welcome to test, allowing the config of the deadtime parameter. I have been running for at least 5 hours fine with it (although I got that yesterday), with the following config:
Can I also suggest that when you save the config, take a quick look in the Supervisor log, as it will report any issues with it. I will continue to test over the next day or so. |
I will try to convert to the edge repo this evening or in the morning. Thanks for your work and help. |
@sinclairpaul, thanks for the quick turnaround on a possible fix. I do wonder: do you have an idea of what made the issue manifest when it wasn't a problem in the past? |
@sinclairpaul , now on the latest edge version with the following config:
|
This comment has been minimized.
This comment has been minimized.
You can docker exec all you want, however I think this is getting a little off topic for this issue, likely better asked on the forums or Discord. |
Understood. Just throwing it out there. |
So currently I am 8 hours without an issue, any other updates? |
I am at 3:45 on the edge build, still looking good. |
@sinclairpaul still the question holds: why would deadtime need to be set now, when it wasn't a problem in the past? |
As mine has been running all night without an issue, I will release and close this out.
Deadtime was also set, it now can be adjusted, the repo has a changelog, and after spending ~20 hours of my own time on the addon this week, I'm not really going to look any further into it 😉. Might of been the debian change, or the HA hw layer change, but neither I can do anything about. |
Hopefully fixed with v0.6.2, closing out for now. |
@sinclairpaul 0.6.2 doesn't solved it here. Can you reopen the issue please? |
I'm also seeing stale data errors with v6.2. |
I'm sorry folks, it is likely your configuration. I have been running for over a week with no issues, and based on the other comments I would suggest it works. To clarify my config is:
|
Alright, I'll duplicate your config as it will work for me and let you know how it goes. |
same here, testing Paul's config... |
My input on the configuration piece: what made the setup stable for me was setting the poll interval. The Maxage and Deadtime params, by themselves, were not enough. Cheers, |
I can confirm here that since the 0.6.2 update AND the |
I'm having this issue. Here is my config :
Let's see if this fixes it. I had already setted the maxage and deadtime without success. I'm on 0.6.2 as well. |
i am not sure it matters, but have the serial in there because I have multiple ups attached.
|
something broke in the recent HA core since the problem started since the last HA Core update. |
In my case the it's still working at the moment the solution was to add -pollininterval = 15 in the device -> config option So my config is working. |
A couple days now without error. Thanks @sinclairpaul. |
Thanks for all the feedback, please feel free to open new issues. |
Problem/Motivation
Container file system not complete. After the upgrade to 0.6.1 my ups started going stale. I opened portainer and consoled into the nut container to edit /etc/nut/ups.conf: and add a pollinterval = 15. But, vi and/or nano are not in the container.
I did adjust my config this way and am waiting to see if it works.
I can restart nut without complaints, at this point.
I suspect that the editors went away with the linux change in 0.6.0
Expected behavior
I expect my ups to stay online.
Actual behavior
Data goes stale after a few hours, Cyber Power ups and they are known for needing a short pollinterval
I may need to change DEADTIME 25, normally in the file /etc/nut/upsmon.conf:
Steps to reproduce
Proposed changes
The text was updated successfully, but these errors were encountered: