-
-
Notifications
You must be signed in to change notification settings - Fork 368
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ups.status: WAIT roughly every 24 hours with CyberPower rack UPS #1689
Comments
Output voltage looks odd too at 260V vs. 230V input, though may well be a wrong mapping or reality of your location. CPS devices were mentioned in NUT issues quite a few times with various quirks, as well as Raspberries. Not sure if that just reflects their popularity or HW/FW problems, though. Still, makes sense to peruse tagged issues to see if anything rings a bell. I think there was a discussion about eventual USB port resets (possibly firmware reboots) that might be seen in dmesg, causing the OS to find the USB HID device again and default udev to grab it, so the NUT drivers could not reconnect when they tried softly. There were changes in current master (maybe after 2.8.0 release) to let it try harder. FWIW, beside Pi reboot - does restarting nut-driver service help unwedge it - did you try? If not, might be the Pi USB stack got stuck... |
@jimklimov Thanks for your insights. I did try to restart There is nothing in dmesg for that time. There are, however, error messages in /var/syslog starting at the time when the NUT driver failed. Although, they are not really useful:
I'm trying this workaround right now, although it's really just shooting into the dark: |
I suppose "resource unavailable" and a few seconds later "connected to" mean that the driver tried to restart. Not sure however why it took roughly half an hour after "stale" status... |
Thanks for the link, I'll add it to wiki => https://github.com/networkupstools/nut/wiki/Troubleshooting-eventual-disconnections-(Data-stale) |
Probably off-topic here but out of curiosity I checked the output voltage 1) on the UPS device and 2) with a multimeter and both show a correct 234V - 235V reading. Therefore 263.0V reported by nut is currently incorrect. I think that I found another GitHub topic on that: #439 |
With a 2.7.4 build running, yes that issue may be it. Can you try building the current NUT master branch to run the driver? Might suffice to:
...to get a single data collection walk and dump afterwards similar to the You may have to run the test-mode driver via |
It seems that suggestion from https://raspberrypi.stackexchange.com/questions/66611/nut-cyberpower-data-stale might have helped. I'm pretty sure that I haven't changed anything else and the system has been up for the past 3 days straight. To clarify, this is what I did:
I'll report back if it goes wrong. Do you think @jimklimov that it's worth for me to revert back and try building from the master branch? |
I suppose current NUT master does not address this, but a PR -- to detect CPS (maybe others too, maybe just by ids) and default to lower pollrate if unspecified by user -- might help for better out of the box experience :D |
@jimklimov Ok, I agree. I will give it a few more days just to make sure it's stable. Then I revert my changes to reproduce that the WAIT/STALL issue happens again. This way we'll be sure that these changes exactly address the issue. Will report back here. Would you agree? |
Yes, sounds right - thanks! |
@jimklimov Everything has been holding up well until now, reverting my changes, trying to reintroduce the issue again. |
@jimklimov Strange as it seems, the problem doesn't seem to come back after reverting my changes. Maybe what I should do is: reinstall Raspberry PI and follow the exact steps again. That sounds more like a weekend project, I will keep it running though. I don't like the "it works, but I don't know why" effect 😂 |
@jimklimov All right, reproduced, got the "WAIT" state today. What would you suggest next? Will you create a branch/PR that I download and install instead of the out-of-box version? |
Revising old notes, remembered to suggest |
Hopefully only suggest I've been running a CyberPower 1500EPFCLCD-UK for the last 2 years with 2.74 on RPi with the following settings:
and the only issue is that the UPS will randomly stop responding, it could be weeks or months, but a restart of the driver after a While investigating this issue with 2.74 I tried a self build of 2.80 and that didn't appear to have the same issue, so it may already be fixed. As it happens a week ago I switched to the latest master build (2.82+) so if there's any issues I'll report back. So far, so good. |
Thanks for the update! FWIW, in a brewing commit, it would loudly "only suggest" |
…NG.adoc: introduce DEFAULT_POLLFREQ_CPS and suggest pollonly on CPS devices [networkupstools#1689] Signed-off-by: Jim Klimov <jimklimov+nut@gmail.com>
All good so far with a CP1500EPFCLCD (0501/0764) using the new default polling frequency. 👍 With this UPS and 2.74 it could take anywhere from a few days to couple of months for the UPS to stop communicating (but a driver restart - Settings:
System log:
|
I'm running
nut
onRaspbian
with a CyberPower OR600ERM1U rack UPS connected via USB.After each reboot,
nut
is working properly for roughly 24 hours, reporting of battery level, runtime, etc. is working fine. I tested it while charging, discharging, floating. However, after a couple of hours (usually about a day)nut
reports UPS status changed fromOL
toWAIT
and since that time I am unable to read any information fromnut
until I completely restart my Raspberry Pi.Environment:
$ uname -a Linux raspberrypi 5.15.32+ #1538 Thu Mar 31 19:37:58 BST 2022 armv6l GNU/Linux $ upsd -V Network UPS Tools upsd 2.7.4 $ upsc -V Network UPS Tools upscmd 2.7.4
This is how it looks like after I receive a notification from my NodeRED that
ups.status
has changed toWAIT
:After this I have to reboot my Raspberry Pi and after a full reboot everything starts working:
This happens every day, not just a single occasion. Any ideas what can be wrong?
The text was updated successfully, but these errors were encountered: