Cleaning up hoarded data

So. Probably, like many others here, I have been hoarding quite the amount of data over the course of the years. However, it’s only in the later years I’ve been doing it in an organized way (I used to just throw stuff on external HDDs earlier) since, when I was a youngster, I just thought I’d keep a copy of it and that’s it.

Well. The dreaded day has come: I would now like to sort everything neatly, get rid of dupes and the whole shabang. BUT, as you probably know, it is a nightmare to sift through TBs of data that’s been hoarded over the years.

I was thus wondering if anyone has any clever ideas on how to perform this horrible task? I started looking through some stuff using tools like, e.g. Czkawka, but there is just too much.

Ideas are most welcome; or just a general chit-chat around it is good too :blush:

First post!

I don’t really have a answer to your question since I was going to suggest Czkawka :slight_smile:

I have a similar situation with terabytes of raw data, snapshots, old qcow2-vm’s, backups and whatnot that I’ve started to organize multiple times but paused on at least two occasions due to fatigue. Just added 300gb from a failed laptop so, rinse repeat…

My thought is when my next NAS is online I’ll get to it. heh

Personally I will probably just sift through the data with the help of Czkawka to get a proper structure and try to keep it that way, but I know I’ll probably end up in the same situation once again due to lack of discipline so I am also looking for inspiration how to organize my data.

Maybe if I get my IBM Storwize V7000 online, that’ll fix everything! haha

Edit:
Just wanted to add that I just found this forum and looking forward to reading and asking questions! Just the other day I found my self wanting a forum off reddit/Discord/github that handles zfs and storage discussions.

Cheers

Welcome! Happy to have you onboard.

1 Like