HBA LSI 9400-16i errors in kernel log

Notice: Page may contain affiliate links for which we may earn a small commission through services like Amazon Affiliates or Skimlinks.

bugacha

Active Member
Sep 21, 2024
382
102
43
I have LSI 9400-i16 on TrueNAS Scale 24.10.2 and in general it runs fine, disks and everything works.

I recently discovered that log file is full of errors :

Code:
mpt3sas_cm0: log_info(0x300301e1): originator(IOP), code(0x03), sub_code(0x01e1)

# egrep "sub_code\(0x01e1\)" kern.log | wc -l
222293
Same error repeats many times a second without any particular pattern.

Does anyone know what they are ?

Controller is running latest available firmware, and I guess its in IT mode

Code:
# storcli64 /c0 show nolog

CLI Version = 007.3103.0000.0000 Aug 22, 2024
Operating system = Linux 6.6.44-production+truenas
Controller = 0
Status = Success
Description = None

Product Name = HBA 9400-16i
Serial Number = SPC0726330
SAS Address =  500605b0120d9490
PCI Address = 00:01:00:00
System Time = 02/17/2025 08:04:56
FW Package Build = 24.00.00.00
FW Version = 24.00.00.00
BIOS Version = 09.47.00.00_24.00.00.00
NVDATA Version = 24.00.00.22
PSOC FW Version = 0x0001
PSOC Part Number = 05689
Driver Name = mpt3sas
Driver Version = 43.100.00.00
Bus Number = 1
Device Number = 0
Function Number = 0
Domain ID = 0
Vendor Id = 0x1000
Device Id = 0xAC
SubVendor Id = 0x1000
SubDevice Id = 0x3000
Board Name = HBA 9400-16i
Board Assembly = 03-50008-15012
Board Tracer Number = SPC0726330
Security Protocol = None
 
Last edited:

bugacha

Active Member
Sep 21, 2024
382
102
43
maybe this might help? lsi-decode-loginfo.py
Thank you!

Code:
bin/lsi-decode-loginfo.py 0x300301e1

0x300301e1 = Error code being matched

0x00010000 IOP_LOGINFO_CODE_INVALID_SAS_ADDRESS
0x00020000 IOP_LOGINFO_CODE_UNUSED2
0x00030000 IOP_LOGINFO_CODE_CONFIG_INVALID_PAGE
0x00000001 PL_LOGINFO_SUB_CODE_OPEN_FAIL_NO_DEST_TIME_OUT
0x00020000 PL_LOGINFO_CODE_INVALID_SGL
0x00030000 PL_LOGINFO_CODE_WRONG_REL_OFF_OR_FRAME_LENGTH
0x00000100 PL_LOGINFO_SUB_CODE_OPEN_FAILURE
0x00000101 PL_LOGINFO_SUB_CODE_OPEN_FAILURE_NO_DEST_TIMEOUT
0x00000120 PL_LOGINFO_SUB_CODE_TARGET_BUS_RESET
0x00010000 IR_LOGINFO_RAID_ACTION_ERROR
0x00020000 IR_LOGINFO_CODE_UNUSED2
0x00010001 IR_LOGINFO_VOLUME_CREATE_INVALID_LENGTH
0x00010020 IR_LOGINFO_PHYSDISK_CREATE_TOO_MANY_DISKS
0x00010021 IR_LOGINFO_PHYSDISK_CREATE_INVALID_LENGTH

gives too many random error codes.

I dont even use Raid functionality of the controller, TrueNAS runs ZFS raidz2 array.
 

bugacha

Active Member
Sep 21, 2024
382
102
43
Running it on sub-code

Code:
$ bin/lsi-decode-loginfo.py 0x01e1

0x01e1 = Error code being matched

0x00000001 PL_LOGINFO_SUB_CODE_OPEN_FAIL_NO_DEST_TIME_OUT
0x00000100 PL_LOGINFO_SUB_CODE_OPEN_FAILURE
0x00000101 PL_LOGINFO_SUB_CODE_OPEN_FAILURE_NO_DEST_TIMEOUT
0x00000120 PL_LOGINFO_SUB_CODE_TARGET_BUS_RESET

No idea what those mean
 

bugacha

Active Member
Sep 21, 2024
382
102
43
Tried another script on internet : lsi_decode_loginfo/lsi_decode_loginfo.py at master · baruch/lsi_decode_loginfo

This gives just one error :

Code:
bin/lsi_decode_loginfo_2.py 0x300301e1

Value           300301E1h
Type:           30000000h       SAS
Origin:         00000000h       IOP
Code:           00030000h       IOP_LOGINFO_CODE_CONFIG_INVALID_PAGE
Sub Code:       00000100h       IOP_LOGINFO_CODE_CONFIG_INVALID_PAGE_RT Route Table Entry not found
unknown         000000E1h       unknown

Seems pretty benign to me ?