My pools current status for context:
pool: tank
state: ONLINE
status: One or more devices has experienced an error resulting in data
corruption. Applications may be affected.
action: Restore the file in question if possible. Otherwise restore the
entire pool from backup.
see: https://openzfs.github.io/openzfs-docs/msg/ZFS-8000-8A
scan: resilvered 185M in 00:00:11 with 0 errors on Thu Jul 10 21:13:21 2025
config:
NAME STATE READ WRITE CKSUM
tank ONLINE 0 0 0
raidz1-0 ONLINE 1 0 0
wwn-0x5000c50066a3d6d3 ONLINE 1 0 0
wwn-0x50014ee1ac3fa6d5 ONLINE 1 0 0
wwn-0x50014ee257b8a520 ONLINE 0 0 0
I have a scrub set to run on the first of each month. I don’t have notifications set, so I just manually check when I have time that everything went well. When I took a peek last night, everything looked great and seen that I had some pending updates as well. I applied those:
================================================================================
Package Arch Version Repository Size
================================================================================
Installing:
kernel x86_64 4.18.0-553.58.1.el8_10 baseos 10 M
kernel-core x86_64 4.18.0-553.58.1.el8_10 baseos 44 M
kernel-devel x86_64 4.18.0-553.58.1.el8_10 baseos 24 M
kernel-modules x86_64 4.18.0-553.58.1.el8_10 baseos 36 M
Upgrading:
bpftool x86_64 4.18.0-553.58.1.el8_10 baseos 11 M
kernel-headers x86_64 4.18.0-553.58.1.el8_10 baseos 12 M
kernel-tools x86_64 4.18.0-553.58.1.el8_10 baseos 11 M
kernel-tools-libs x86_64 4.18.0-553.58.1.el8_10 baseos 10 M
libblockdev x86_64 2.28-7.el8_10 appstream 132 k
libblockdev-crypto x86_64 2.28-7.el8_10 appstream 81 k
libblockdev-fs x86_64 2.28-7.el8_10 appstream 87 k
libblockdev-loop x86_64 2.28-7.el8_10 appstream 70 k
libblockdev-lvm x86_64 2.28-7.el8_10 appstream 87 k
libblockdev-mdraid x86_64 2.28-7.el8_10 appstream 77 k
libblockdev-part x86_64 2.28-7.el8_10 appstream 80 k
libblockdev-swap x86_64 2.28-7.el8_10 appstream 72 k
libblockdev-utils x86_64 2.28-7.el8_10 appstream 80 k
pam x86_64 1.3.1-37.el8_10 baseos 747 k
platform-python x86_64 3.6.8-70.el8_10.rocky.0 baseos 88 k
python3-libs x86_64 3.6.8-70.el8_10.rocky.0 baseos 7.8 M
python3-perf x86_64 4.18.0-553.58.1.el8_10 baseos 11 M
sos noarch 4.9.2-1.el8_10 baseos 986 k
Removing:
kernel x86_64 4.18.0-553.53.1.el8_10 @baseos 0
kernel-core x86_64 4.18.0-553.53.1.el8_10 @baseos 71 M
kernel-devel x86_64 4.18.0-553.53.1.el8_10 @baseos 53 M
kernel-modules x86_64 4.18.0-553.53.1.el8_10 @baseos 25 M
After applying those, I seen that it triggered a dkms and updated zfs from 2.1.x to 2.2.8-1. I got excited about that since 2.2 added ntfy to zed. zpool status said that an upgrade is available, so I applied it. I added my credentials for ntfy in zed.rc
and tried triggering an alert using sudo zinject -d wwn-0x5000c50066a3d6d3 -e io -T all -f 100 tank
. Which worked great. I got my ntfy notification. I tried clearing the error by running
sudo zinject -c all
sudo zpool clear tank wwn-0x5000c50066a3d6d3
but immediately after, I started getting more error notifications. It was saying files were corrupt, on every snapshot… I checked those files and sure enough, I couldn’t open them, copy them. I was like that stinks, but not the end of the world. I checked another file not listed as corrupted, it wouldn’t open, and another. Now I am scared to touch anything. Smart is saying the drives are fine. Everything was working fine until I did the above.
My question(s), what did I do wrong? and how can I fix it?
Thank you for any help