I have a ZFS array attached to my Proxmox host for media and general storage. One of the drives had a large number of relocated sectors and came up faulted so I got a replacement. Ran the replace command but the completion percentage hasn't moved since I started and the ETA goes up and up.
I know it is a slightly strange setup, but I have 7 3TB drives in a Raid z2 and a pair of 4TB drives in a mirror. one of the 3TB drives failed.
I tried to offline the faulted drive but that didn't seem to have any effect. So I just ran
When I add -v to see errors I get
I expect the resilvering to take a few days for a 3TB drive, but it has been sitting at 5.17% for a few hours now. I am also a little confused as to why the pool is unavailable.
Did I break something?
I know it is a slightly strange setup, but I have 7 3TB drives in a Raid z2 and a pair of 4TB drives in a mirror. one of the 3TB drives failed.
I tried to offline the faulted drive but that didn't seem to have any effect. So I just ran
zpool replace PoolofThrees /dev/disk/by-id/wwn-0x5000039ff4f5a07c /dev/disk/by-id/wwn-0x5000039ff4cb4a1c
Code:
pool: PoolofThrees
state: UNAVAIL
status: One or more devices is currently being resilvered. The pool will
continue to function, possibly in a degraded state.
action: Wait for the resilver to complete.
scan: resilver in progress since Tue Dec 22 18:04:33 2020
2.56T scanned at 228M/s, 917G issued at 79.8M/s, 17.3T total
169M resilvered, 5.17% done, 2 days 11:58:32 to go
config:
NAME STATE READ WRITE CKSUM
PoolofThrees UNAVAIL 0 0 0 insufficient replicas
raidz2-0 UNAVAIL 0 0 0 cannot open
wwn-0x5000039fe3cac490 ONLINE 0 0 0
wwn-0x5000039fe3cb445e ONLINE 0 0 0
wwn-0x5000039fe3c400e1 ONLINE 0 0 0
wwn-0x5000039ff4f2df57 ONLINE 0 0 0
wwn-0x5000039ff4f2df60 ONLINE 0 0 0
wwn-0x5000039ff4f595db ONLINE 0 0 0
replacing-6 DEGRADED 0 0 0
wwn-0x5000039ff4f5a07c FAULTED 76 0 0 too many errors
wwn-0x5000039ff4cb4a1c ONLINE 0 0 0 (resilvering)
mirror-1 ONLINE 0 0 0
wwn-0x5000cca250ebf410 ONLINE 0 0 0
wwn-0x5000cca23dd2ea40 ONLINE 0 0 0
errors: 1616047 data errors, use '-v' for a list
errors: List of errors unavailable: pool I/O is currently suspended
and when I check the event log all the errors are marked as "pool_failmode=wait".I expect the resilvering to take a few days for a 3TB drive, but it has been sitting at 5.17% for a few hours now. I am also a little confused as to why the pool is unavailable.
Did I break something?