-
-
Notifications
You must be signed in to change notification settings - Fork 756
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
borg check --repair --verify-data: inconsistent/sporadic chunk id verification failure #5822
Comments
this is obviously a bug (you should not see %s but a chunkid), there was an argument missing there. Fixed by #5823. |
About your "bug" report: AFAICS now, this is not a bug, but expected borg behaviour:
Before actually deleting stuff with integrity problems, borg is careful: It just tries a second time to get the chunk from the repo and verifies the mac, decrypts, decompresses the data and then checks if the wanted-chunkid corresponds to the computed-chunkid:
The main idea with that code is not to delete chunks just because of sporadic failures. If it doesn't cause issues again, you get that "did not consistently fail" message and it does not delete the chunk. The bad news is of course:
So I suggest you run some passes of memtest86+ on that system. |
After you have verified that CPU/RAM is working ok, you could also run the borg repair command again. It would be interesting to see if it still finds problematic chunks and if so, whether they have same of different ids. |
Theoretically, there could be also a bug in borg's code (incorrect detection of inconsistent behaviour), but let's first assume that there actually is inconsistent behaviour. |
Sorry for the delay, I'm still trying to get to the GRUB menu during startup to run the memtest from there. Still trying, getting back. |
fix missing parameter in "did not consistently fail" msg, see #5822
@jurgenhaas for memtest86+ you need to boot with legacy BIOS and you usually can get to the grub menu with hammering shift or esc(?) after the bios screen (or reconfigure grub to always show the menu for some seconds). you could also make a USB stick with memtest86+ and boot from that via BIOS boot order (F12 or so). |
OK, I get the Grub menu displayed during boot but it doesn't contain memtest86+. I've looked into |
Some devices might be only bootable via UEFI, e.g. NVME PCIe SSDs. But if your system still supports legacy boot, you should be able to boot from a memtest86+ usb stick. If you still have a DVD drive (internal SATA or external via USB), you could also use a ubuntu 18.04 DVD, it has a working version of memtest86+. |
fix missing parameter in "did not consistently fail" msg, see #5822
Think I've tried everything possible. With legacy mode enforced, the NUC won't boot under any circumstances. With a USB device it does recognize it but then won't boot from it either.
But I ran it several times and it reports different problems for most of the runs:
It's different every time I run it. Not sure what that means, you're probably going to tell me I should replace the RAM in that box? |
Some notable things in that output:
guess the very first thing i would try is to re-seat memory modules (and if that does not help: the cpu). maybe just some bad contact. if that doesn't help, try with a new memory module. |
ok, so the minor borg bug was already fixed and the main issue is a hardware issue, thus i am closing this. |
I know I'm late with this feedback, but I wanted everyone let know that re-seating memory modules resolved the issue. TBH, I was sceptical with this advice, but it really works. Thanks again @ThomasWaldmann for helping me out with this. |
Have you checked borgbackup docs, FAQ, and open Github issues?
Yes
Is this a BUG / ISSUE report or a QUESTION?
ISSUE
System information. For client/server mode post info for both machines.
Your borg version (borg -V).
1.1.16
Operating system (distribution) and version.
Ubuntu 20.04
Hardware / network configuration, and filesystems used.
Intel NUC with ect4
How much data is handled by borg?
400GB on that host, 274GB in the problematic repository
Full borg commandline that lead to the problem (leave away excludes and passwords)
Describe the problem you're observing.
My monthly check reported a problem with the integrity of some chunks, so I started to analyse the repository. The command above logs the following statement:
Can you reproduce the problem? If so, describe how. If not, describe troubleshooting steps you took before opening the issue.
Yes, no matter how often I run this. It continues reporting that integrity error but still doesn't delete the reported chunk.
The text was updated successfully, but these errors were encountered: