Disable drive connected to SAS2308 HBA on a remote server

Notice: Page may contain affiliate links for which we may earn a small commission through services like Amazon Affiliates or Skimlinks.

trumee

Member
Jan 31, 2016
245
16
18
55
Hello,
I have a drive die in a remote system which is causing the system boot time to increase a lot. The system sits around probing the disk at boot and ultimately times out. Here is a log,

Code:
Feb 02 00:59:31 box kernel: sd 0:0:3:0: [sdd] Unaligned partial completion (resid=6140, sector_sz=512)
Feb 02 00:59:31 box kernel: sd 0:0:3:0: [sdd] tag#5329 CDB: Read(10) 28 00 00 44 c6 00 00 00 f8 00
Feb 02 00:59:31 box kernel: sd 0:0:3:0: [sdd] tag#5329 FAILED Result: hostbyte=DID_SOFT_ERROR driverbyte=DRIVER_OK cmd_age=3s
Feb 02 00:59:31 box kernel: sd 0:0:3:0: [sdd] tag#5329 CDB: Read(10) 28 00 00 44 c6 00 00 00 f8 00
Feb 02 00:59:31 box kernel: I/O error, dev sdd, sector 4507136 op 0x0:(READ) flags 0x80700 phys_seg 30 prio class 0
Feb 02 00:59:31 box kernel: sd 0:0:3:0: Power-on or device reset occurred
Feb 02 00:59:31 box kernel: mpt2sas_cm0: log_info(0x31110d00): originator(PL), code(0x11), sub_code(0x0d00)
Feb 02 00:59:31 box kernel: mpt2sas_cm0: log_info(0x31110d00): originator(PL), code(0x11), sub_code(0x0d00)
Feb 02 00:59:31 box kernel: mpt2sas_cm0: log_info(0x31110d00): originator(PL), code(0x11), sub_code(0x0d00)
Feb 02 00:59:31 box kernel: mpt2sas_cm0: log_info(0x31110d00): originator(PL), code(0x11), sub_code(0x0d00)
Feb 02 00:59:31 box kernel: sd 0:0:3:0: Power-on or device reset occurred
Feb 02 00:59:31 box kernel: mpt2sas_cm0: log_info(0x31110d00): originator(PL), code(0x11), sub_code(0x0d00)
Feb 02 00:59:31 box kernel: mpt2sas_cm0: log_info(0x31110d00): originator(PL), code(0x11), sub_code(0x0d00)
Feb 02 00:59:31 box kernel: sd 0:0:3:0: Power-on or device reset occurred
The HBA is a Hewlett Packard Enterprise H220i,
Code:
# lspci -nn -v -s 04:00.0
04:00.0 Serial Attached SCSI controller [0107]: Broadcom / LSI SAS2308 PCI-Express Fusion-MPT SAS-2 [1000:0087] (rev 05)
Subsystem: Hewlett Packard Enterprise H220i [1590:0041]
Flags: bus master, fast devsel, latency 0, IRQ 17
I/O ports at c000 [size=256]
Memory at dd540000 (64-bit, non-prefetchable) [size=64K]
Memory at dd500000 (64-bit, non-prefetchable) [size=256K]
Expansion ROM at dd400000 [disabled] [size=1M]
Capabilities: [50] Power Management version 3
Capabilities: [68] Express Endpoint, IntMsgNum 0
Capabilities: [d0] Vital Product Data
Capabilities: [a8] MSI: Enable- Count=1/1 Maskable- 64bit+
Capabilities: [c0] MSI-X: Enable+ Count=16 Masked-
Capabilities: [100] Advanced Error Reporting
Capabilities: [1e0] Secondary PCI Express
Capabilities: [1c0] Power Budgeting <?>
Capabilities: [190] Dynamic Power Allocation <?>
Capabilities: [148] Alternative Routing-ID Interpretation (ARI)
Kernel driver in use: mpt3sas
Kernel modules: mpt3sas
I am able to bring up the AVAGO menu by pressing Ctrl+C, but there is no option to disable the drive. Is it possible to disable the drive?
 

BLinux

cat lover server enthusiast
Jul 7, 2016
2,723
1,111
113
artofserver.com
If you don't want the controller to scan the drives during POST, disable the "boot support" in the card and it will not scan during POST and hopefully complete that step faster. If you disable "boot support", of course, this assumes you are not booting off the H220 card. If you are, then another possibility is to manually configure the primary and secondary boot drives and *maybe* that will prevent it from scanning all the drives.
 

trumee

Member
Jan 31, 2016
245
16
18
55
The slow down occurs once the linux kernel start the boot (after the EFI menu).

How do I disable the boot support on the HBA? I don't remember such an option coming up in the AVAGO menu.
 
Last edited:

trumee

Member
Jan 31, 2016
245
16
18
55
Disabled the "boot support" and rebooted. The POST seems quicker, but the boot is still slow,

1738640545203.png
 

trumee

Member
Jan 31, 2016
245
16
18
55
In that image can you not select "SAS Topology" and drill down to the bad drive and mark it "offline"?
I dont see any option to mark the drive offline. In the device properties (using Alt+D), i see option to "Format" and "Verify".