Samsung PM983 dropped out of ESXi 7

Notice: Page may contain affiliate links for which we may earn a small commission through services like Amazon Affiliates or Skimlinks.

986box

Active Member
Oct 14, 2017
239
43
28
44
Picked up Samsung PM983 recently and used it for a few months. It dropped out of ESXi 7.0 yesterday without warning.
CrystalDisk reported good health as before but space reported is off. It should be 960GB.

What can caused it to drop off?
 

Attachments

oneplane

Well-Known Member
Jul 23, 2021
846
484
63
It's health is not Good, but CrystialDisk is not smart enough to figure it out. Your SSD is dead and running of the fallback bootrom, or using an unconfigured bootrom because it can't load the device-specific configuration.

As for if it can be repaired or revived, I don't know. What your'e seeing on your screen is basically an SSD that says: "my CPU is working, but that's about it".
 

oneplane

Well-Known Member
Jul 23, 2021
846
484
63
None of the consumer tools will help you here. Either drive firmware recovery tools or manufacturer tools, neither of which are publicly available.
 

986box

Active Member
Oct 14, 2017
239
43
28
44
What would cause the problem? overheating? Trash the drive? I have another on the way.
 

oneplane

Well-Known Member
Jul 23, 2021
846
484
63
Can be anything. Bad NAND, bad DRAM, bad controller, bad PCB, bad firmware. It's impossible to tell since you can no longer communicate with anything in the drive except the bootrom and that won't tell you anything. This requires physical inspection.
 

gb00s

Well-Known Member
Jul 25, 2018
1,197
603
113
Poland
None of the consumer tools will help you here. Either drive firmware recovery tools or manufacturer tools, neither of which are publicly available.
That's not quite accurate >> PC 3000 with all its variants, as one of them, is available from here

EDIT: Added price list if someone is interested
 
Last edited:
  • Like
Reactions: oneplane

oneplane

Well-Known Member
Jul 23, 2021
846
484
63
That's not quite accurate >> PC 3000 with all its variants, as one of them, is available from here
I suppose the commercial data and drive recovery tools might have some options, but even Flash Extractor, Arvika's database, Atoll, MRTLab, Rusolut, DFL etc (and probably a bunch more that used to be mechanical only, like SalvationData?) all need a flash map, firmware modules and either chip off board adapter or special interface card to do anything.

I did wonder if maybe the drive has a simple serial interface (most do) where some more details as to what is exactly broken can be gathered (i.e. unable to even talk to the NAND vs. talking to the NAND but some data being corrupted). Maybe it's something dumb like a memory overflow in a SMART table. Or maybe the bootrom can't find the main firmware and stops loading and it can be fixed by simply re-flashing the common firmware, leaving the device specific configuration and runtime data as-is. But it's all guessing at this stage o_O And I don't think 986box has a serial interface and logic probe or logic analyser on hand to find communication pins:p would be a fun exercise tho.
 
  • Like
Reactions: Stephan and gb00s

986box

Active Member
Oct 14, 2017
239
43
28
44
And I don't think 986box has a serial interface and logic probe or logic analyser on hand to find communication pins:pwould be a fun exercise tho.
Yeah, I do not have want to use whatever resources needed to figure this out. Unless it’s a quick 10 mins and free s/w.

in addition, I don’t have important VMs on this storage. So the loss was not a big deal.

luckily,the rest are on the consumer Samsung 970 I bought off here a few years ago.
 

oneplane

Well-Known Member
Jul 23, 2021
846
484
63
Yeah, I do not have want to use whatever resources needed to figure this out. Unless it’s a quick 10 mins and free s/w.

in addition, I don’t have important VMs on this storage. So the loss was not a big deal.

luckily,the rest are on the consumer Samsung 970 I bought off here a few years ago.
In that case: trash it.
 

Stephan

Well-Known Member
Apr 21, 2017
945
714
93
Germany
I commend Samsung for teaching alot of people all over the world lately about the value of having a working backup solution. /s
 

111alan

Active Member
Mar 11, 2019
291
109
43
Haerbing Institution of Technology
Yeah, I do not have want to use whatever resources needed to figure this out. Unless it’s a quick 10 mins and free s/w.

in addition, I don’t have important VMs on this storage. So the loss was not a big deal.

luckily,the rest are on the consumer Samsung 970 I bought off here a few years ago.
If you still want the drive, try security erasing the drive as is using nvmecli. There is a chance that the software fault may be cleared and the drive may come back to life.

Then you can doe some full drive reads and writes to see if you can trust it with data.
 

986box

Active Member
Oct 14, 2017
239
43
28
44
I do have backup running. The VMs on the drive were inactive anyway.
Had pulled the drive off the server to check on a different machine. Will give the nvmecli when I have some time.

Had been looking at picking up a few SSDs to add. May pickup extra one for datastore.

Original idea for the post is understand what causes the failure. So not to repeat again.