Megaraid IBM M5016 Power on, reset, or bus device reset occurred / Diagnostics failed for PD/ PD error

Notice: Page may contain affiliate links for which we may earn a small commission through services like Amazon Affiliates or Skimlinks.

pablocool

Member
Mar 13, 2012
31
1
8
38
Poland
Hi,

One of my RAID6 array disks (WD30EFRX) went offline because of "Power on, reset, or bus device reset occurred / Diagnostics failed for PD / PD Error". I checked media errors, SMART and everything looks fine. I made this disk online again imported foreign configuration, and for while everything was fine. But after a minute disk again gone offline.
What may it be? PSU? Failing controller? Disk?

raid6.PNG

Code:
ID = 219
SEQUENCE NUMBER = 44326
TIME = 06-07-2020 02:40:46
LOCALIZED MESSAGE = Controller ID:  0   Foreign Configuration Imported

ID = 232
SEQUENCE NUMBER = 44325
TIME = 06-07-2020 02:40:46
LOCALIZED MESSAGE = Controller ID:  0   Replaced Missing on array:   -:-:19      Array   0  Row   0

ID = 114
SEQUENCE NUMBER = 44324
TIME = 06-07-2020 02:40:46
LOCALIZED MESSAGE = Controller ID:  0   State change:   PD       =   -:-:19  Previous   =   Unconfigured Good      Current   =   Offline

ID = 218
SEQUENCE NUMBER = 44323
TIME = 06-07-2020 02:40:45
LOCALIZED MESSAGE = Controller ID:  0   Foreign Configuration Detected

ID = 218
SEQUENCE NUMBER = 44322
TIME = 06-07-2020 02:40:37
LOCALIZED MESSAGE = Controller ID:  0   Foreign Configuration Detected

ID = 114
SEQUENCE NUMBER = 44321
TIME = 06-07-2020 02:40:27
LOCALIZED MESSAGE = Controller ID:  0   State change:   PD       =   -:-:19  Previous   =   Unconfigured Bad      Current   =   Unconfigured Good

ID = 7
SEQUENCE NUMBER = 44320
TIME = 06-07-2020 02:37:13
LOCALIZED MESSAGE = Controller ID:  0   Alarm disabled by user

ID = 285
SEQUENCE NUMBER = 44319
TIME = 06-07-2020 02:35:35
LOCALIZED MESSAGE = Controller ID:  0  PD FRU is:    PD :   -:-:19  FRU :

ID = 247
SEQUENCE NUMBER = 44318
TIME = 06-07-2020 02:35:35
LOCALIZED MESSAGE = Controller ID:  0  Device inserted   Device Type:       Disk  Device Id:   19

ID = 91
SEQUENCE NUMBER = 44317
TIME = 06-07-2020 02:35:35
LOCALIZED MESSAGE = Controller ID:  0   PD inserted:   -:-:19

ID = 114
SEQUENCE NUMBER = 44316
TIME = 06-07-2020 02:35:29
LOCALIZED MESSAGE = Controller ID:  0   State change:   PD       =   -:-:19  Previous   =   Failed      Current   =   Unconfigured Bad

ID = 248
SEQUENCE NUMBER = 44315
TIME = 06-07-2020 02:35:29
LOCALIZED MESSAGE = Controller ID:  0  Device removed   Device Type:       Disk  Device Id:   19

ID = 112
SEQUENCE NUMBER = 44314
TIME = 06-07-2020 02:35:29
LOCALIZED MESSAGE = Controller ID:  0   PD removed:   -:-:19

ID = 114
SEQUENCE NUMBER = 44313
TIME = 06-07-2020 02:35:28
LOCALIZED MESSAGE = Controller ID:  0   State change:   PD       =   -:-:19  Previous   =   Configured - shielded      Current   =   Failed

ID = 401
SEQUENCE NUMBER = 44312
TIME = 06-07-2020 02:35:28
LOCALIZED MESSAGE = Controller ID:  0  Diagnostics failed on PD:   -:-:19

ID = 368
SEQUENCE NUMBER = 44311
TIME = 06-07-2020 02:35:28
LOCALIZED MESSAGE = Controller ID:  0  Power state change failed on PD   =   -:-:19  Previous   =   On  Current   =   Powersave

ID = 250
SEQUENCE NUMBER = 44310
TIME = 06-07-2020 02:35:27
LOCALIZED MESSAGE = Controller ID:  0  VD is now PARTIALLY DEGRADED   VD   1

ID = 81
SEQUENCE NUMBER = 44309
TIME = 06-07-2020 02:35:27
LOCALIZED MESSAGE = Controller ID:  0   State change on VD:   1      Previous   =   Optimal  Current   =       Partially Degraded

ID = 114
SEQUENCE NUMBER = 44308
TIME = 06-07-2020 02:35:27
LOCALIZED MESSAGE = Controller ID:  0   State change:   PD       =   -:-:19  Previous   =   Online      Current   =   Configured - shielded

ID = 87
SEQUENCE NUMBER = 44307
TIME = 06-07-2020 02:35:27
LOCALIZED MESSAGE = Controller ID:  0   PD Error:   -:-:19      ( Critical   250)

ID = 268
SEQUENCE NUMBER = 44306
TIME = 06-07-2020 02:35:26
LOCALIZED MESSAGE = Controller ID:  0  PD Reset:   PD       =   -:-:19,   Critical       =   3,   Path   =       0x5005076028C2916A

ID = 113
SEQUENCE NUMBER = 44302
TIME = 05-07-2020 18:52:25
LOCALIZED MESSAGE = Controller ID:  0   Unexpected sense:   PD       =   -:-:19Power on, reset, or bus device reset occurred,   CDB   =    0x8a 0x00 0x00 0x00 0x00 0x00 0x06 0x27 0x4b 0x00 0x00 0x00 0x01 0x00 0x00 0x00    ,   Sense   =    0x70 0x00 0x06 0x00 0x00 0x00 0x00 0x0a 0x00 0x00 0x00 0x00 0x29 0x00 0x00 0x00 0x00 0x00
 

Stefan75

Member
Jan 22, 2018
96
10
8
48
Switzerland
Did you ever find out what was going on here?

I have a LSI hardware raid6 with 17 1TB SATA disks in a Netapp DS4243.
Only the two Samsung HD103SI cause the "Power on, reset, or bus device reset occurred".
In my case the raid doesn't break though :)