First the build:
Supermicro X11SSL-F motherboard (bios updated to latest: 1.0b)
Intel i3 6100 CPU
32GB Crucial ECC UDIMM DDR4
LSI 9212-4i4e (updated to P19)
Supermicro 826E1-R800 chassis with SAS-826EL1 backplane (3Gb SAS expander
The issue:
I am seeing multiple drives sometimes become degraded. One of the mirrors I can't get a successful resilver. The degraded drives look perfectly fine from their Smart values with no reallocated sectors. The pool will be working normally, or resilvering, when all of a sudden any drive in use will start to show a large number of hard and transfer errors. Using IPMI to pull up the KVM console I see a large number of this error message:
scsi: /pci@0,0/pci8086,1905@1,1/pci1000,3060@0 (mpt_sas0)
Aborted_command!
Any idea how to narrow this down? Is this a firmware issue on my 9212? Could it be a bad backplane, cable, or HBA? The voltages from the PSU look okay from IPMI. Am I stuck with replacing different hardware till the error goes away?
Supermicro X11SSL-F motherboard (bios updated to latest: 1.0b)
Intel i3 6100 CPU
32GB Crucial ECC UDIMM DDR4
LSI 9212-4i4e (updated to P19)
Supermicro 826E1-R800 chassis with SAS-826EL1 backplane (3Gb SAS expander
The issue:
I am seeing multiple drives sometimes become degraded. One of the mirrors I can't get a successful resilver. The degraded drives look perfectly fine from their Smart values with no reallocated sectors. The pool will be working normally, or resilvering, when all of a sudden any drive in use will start to show a large number of hard and transfer errors. Using IPMI to pull up the KVM console I see a large number of this error message:
scsi: /pci@0,0/pci8086,1905@1,1/pci1000,3060@0 (mpt_sas0)
Aborted_command!
Any idea how to narrow this down? Is this a firmware issue on my 9212? Could it be a bad backplane, cable, or HBA? The voltages from the PSU look okay from IPMI. Am I stuck with replacing different hardware till the error goes away?
Attachments
-
199.3 KB Views: 20