We've released interim 0.3.5.0 version some time ago, that is available for download. It hasn't been publicly announced because included various small improvements and bug fixes, and has been addressed to users, who faced these issues (support of vdev_id.conf file, display DRAID pools in UI, better support for NVMe S.M.A.R.T., support for pools with listsnapshots=on property, etc). We are still working on new Preview 4 (0.4.0.0) release with the new set of features and currently expect it on March 6.just checking in as I haven't seen any updates on the site, any news on a new release?
We use smartmontools to get the SMART data (the same package as in the article you mentioned). Poolsman mostly displays the data that it gets from this tool (the overall health state is also provided by this). Additionally we perform some minor transformations to make the data more friendly (e.g. calculate host reads/writes in bytes). You also can manually check any of the SMART attributes (including attributes from the article), provided by your drive and supported by smartmontools package.Regarding SMART and disks, how do you currently determine that a disk is healthy? Are you perhaps monitoring these? What SMART Hard Disk Errors Actually Tell Us
Thanks. It definitely makes sense to monitor these particular SMART attributes in parallel (and/or highlight them when they RAW values are greater than zero). We've added a research task for that into our backlog. Probably we will add some kind of such monitoring in one of our next releases (at least simple highlighting of these attributes should be easy to implement).Thank you. The reason I'm asking is because those specific SMART params seem to signal an impending drive failure without necessarily triggering SMART overall health alarms - backblaze have a massive statistic body of evidence on it. I think there's an opportunity here for SMART(er) drive health monitoring beyond simply echoing SMART values back to the user. Quoting the article:
View attachment 27864
Hi, thank you for letting us know. There was an issue with SSL cert, now it's fixed.Hi, poolsman.com is down. Does this project have a new website?
root@box:~# zpool status
pool: aggr0
state: DEGRADED
status: One or more devices is currently being resilvered. The pool will
continue to function, possibly in a degraded state.
action: Wait for the resilver to complete.
scan: scrub repaired 0B in 2 days 10:05:20 with 0 errors on Tue Jul 11 10:29:22 2023
scan: resilver (draid3:19d:24c:2s-0) in progress since Mon Jul 24 19:40:55 2023
11.3T scanned at 2.12G/s, 10.4T issued 1.95G/s, 359T total
482G resilvered, 3.15% done, 1 days 22:36:05 to go
config:
NAME STATE READ WRITE CKSUM
aggr0 DEGRADED 0 0 0
draid3:19d:24c:2s-0 DEGRADED 0 0 0
0a-0 ONLINE 0 0 0 (resilvering)
0a-1 ONLINE 0 0 0 (resilvering)
0a-2 ONLINE 0 0 0 (resilvering)
0a-3 ONLINE 0 0 0 (resilvering)
0a-4 ONLINE 0 0 0 (resilvering)
0a-5 ONLINE 0 0 0 (resilvering)
0a-6 ONLINE 0 0 0 (resilvering)
0a-7 ONLINE 0 0 0 (resilvering)
0a-8 ONLINE 0 0 0 (resilvering)
0a-9 ONLINE 0 0 0 (resilvering)
0a-10 ONLINE 0 0 0 (resilvering)
0a-11 ONLINE 0 0 0 (resilvering)
spare-12 DEGRADED 0 0 0
0a-12 UNAVAIL 3 4 0
draid3-0-0 ONLINE 0 0 0 (resilvering)
0a-13 ONLINE 0 0 0 (resilvering)
0a-14 ONLINE 0 0 0 (resilvering)
0a-15 ONLINE 0 0 0 (resilvering)
0a-16 ONLINE 0 0 0 (resilvering)
0a-17 ONLINE 0 0 0 (resilvering)
0a-18 ONLINE 0 0 0 (resilvering)
0a-19 ONLINE 0 0 0 (resilvering)
0a-20 ONLINE 0 0 0 (resilvering)
0a-21 ONLINE 0 0 0 (resilvering)
0a-22 ONLINE 0 0 0 (resilvering)
0a-23 ONLINE 0 0 0 (resilvering)
special
mirror-1 ONLINE 0 0 0
nvme01-part1 ONLINE 0 0 0
nvme02-part1 ONLINE 0 0 0
cache
sdc1 ONLINE 0 0 0
sdd1 ONLINE 0 0 0
spares
draid3-0-0 INUSE currently in use
draid3-0-1 AVAIL