Moderator: Captain Ned

Flying Fox wrote:Wouldn't this be the exact same scenario where you want some more redundancy when you are backing up regular files as well?
Dirge wrote:I have never investigated the process of storing checksums/hashes in checksum files
#!/bin/bash
find . -xdev -type f -or -type l | sort | xargs --delimiter=\\n md5sum -bUsacomp2k3 wrote:Why not just put it on a good raid1/5 setup and then not worry about it?
just brew it! wrote:I've been using my own poor-man's version of md5deep for a while:
- Code: Select all
#!/bin/bash
find . -xdev -type f -or -type l | sort | xargs --delimiter=\\n md5sum -b
This script generates a list of MD5 checksums for the entire directory tree rooted at the current directory. The checksums can be redirected into a file which can then be used later (via md5sum --check) to verify the checksums.

#!/bin/bash
find . -xdev -type f -or -type l | sort | xargs --delimiter=\\n md5sum -bfind . -xdev -type f -or -type l -print0 | xargs -0 md5sum -bzamb wrote:
- Code: Select all
#!/bin/bash
find . -xdev -type f -or -type l | sort | xargs --delimiter=\\n md5sum -b
Why not:
- Code: Select all
find . -xdev -type f -or -type l -print0 | xargs -0 md5sum -b
That’s much more robust and elegant. (I ignored the “sort” command as it’s not really needed.)
By the way: The “-b” parameter to “md5sum” is not needed under Unix/BSD/Linux systems.
Sorry for the off-topic (and the smugness involved).
just brew it! wrote:The reason I sorted the list is so that you can subsequently use the "diff" command to compare the hash lists for two directory hierarchies, to instantly identify all files that differ or have been added/removed.
just brew it! wrote:Being "diff"-friendly is also the reason for the "-b" option -- even though it does not affect the checksum calculation when running on a *NIX system, it does prepend a '*' to each checksum in the output. So if you want to be able to use "diff" to compare checksum files which were generated on *NIX with checksum files which were generated on Windows, you need to use the "-b" option (on both sides).
just brew it! wrote:Sorry for pointing out that some of the smugness was premature.
You can say that again.just brew it! wrote:Usacomp2k3 wrote:Why not just put it on a good raid1/5 setup and then not worry about it?
Because RAID is about maintaining availability in the event of a hardware failure, not backup.
Aphasia wrote:You can say that again.
For some reason I have a problem with data corruption when working in lightroom against a folder on my raid array. It seems like the transfer and working directly on the files via network can corrupt them when generation the raid. Only thing to do is reload the file from the backup and reimport it. And I cant really import the full information since my SSD cant hold a library with 30GB worth of photos.
Without backups, I've would've been toast.
Oh, never noticed it was a necro. Not that bad though, more of a zombie thing than an undead skeleton thing. It just barely had time to start to decompose.just brew it! wrote:It seems this thread necro has taken an interesting twist...
just brew it! wrote:Sounds like something's flaky in your network or file server. That's a nightmare waiting to happen; silent data corruption is far worse than something that fails outright, since you may not realize you have a problem until some time in the future. Depending on your backup regime, you may not even have a backup with a good copy of the file any more (yes, this is an argument for archiving some of your backups permanently instead of recycling the media).
Flaky NIC, flaky network switch, or bad RAM in the file server (does the server have error-correcting RAM?) are a few potential culprits that come to mind.
Yeah, also sometime on my to-do list, right after I have automized my weekly full backups of the raid. Right now I have automatic backups of all system data to the raid. The raid gets backuped, or really synced to my NAS with free file sync on a biweekly/triweekly basis manually, but I started doing the automated batch profiles that are going into the scheduler.just brew it! wrote:Yeah, that's another big caveat with RAID. You need to have a mechanism in place to notify you when drive failures occur; otherwise, RAID can cause more problems than it solves by laying time bombs for you!
Users browsing this forum: No registered users and 1 guest