ES Xeon Discussion

Notice: Page may contain affiliate links for which we may earn a small commission through services like Amazon Affiliates or Skimlinks.

Wasmachineman_NL

Wittgenstein the Supercomputer FTW!
Aug 7, 2019
2,263
841
113
Are there any noteworthy ES Broadwell Xeons for X99? I refuse to pay **** you money for a 6950X that gets curbstomped by a 7500F.
 

RolloZ170

Well-Known Member
Apr 24, 2016
9,404
3,015
113
germany
Are there any noteworthy ES Broadwell Xeons for X99? I refuse to pay **** you money for a 6950X that gets curbstomped by a 7500F.
prod.steppings
QK3M 1650v4 ES2 (partly unlocked, ratio OC all core 4200 ) (80-100 CNY goofish)
QK3S 1660v4 ES2 (partly unlocked, ratio OC all core 4200 ) (160-404 CNY goofish)
work on gigabyte and asrock, ASUS RAMPAGE V EXTR.

B0
QHVJ 1660v4 ES1 (99 CNY goofish)
gigabyte,asrock
 

Wasmachineman_NL

Wittgenstein the Supercomputer FTW!
Aug 7, 2019
2,263
841
113
prod.steppings
QK3M 1650v4 ES2 (partly unlocked, ratio OC all core 4200 ) (80-100 CNY goofish)
QK3S 1660v4 ES2 (partly unlocked, ratio OC all core 4200 ) (160-404 CNY goofish)
work on gigabyte and asrock, ASUS RAMPAGE V EXTR.

B0
QHVJ 1660v4 ES1 (99 CNY goofish)
gigabyte,asrock
So nothing noteworthy and I'm better off getting a 6900K or 6950X instead. Great. Thanks Intel /s
 

liaobme

New Member
Aug 11, 2025
1
0
1
i got same boards. this is first rev. with only skylake BIOS. the later rev. C2030(cerberus) supports Cascade Lake, maybe we can use the C2030 BIOS. mine has two different BIOS.
BIOS0:
50650,50651,50652,50654
BIOS1:
50650,50651,50652,50653,50654
 

sersmile

New Member
Jun 7, 2025
5
1
3
Hello guys, looking for some help here. First please kindly forgive my stupidness.
I am using dual 8368es(QWAT). I have been getting MCE for a week now. The error log in Linux is like:
Code:
Aug 10 05:26:41 astraTZ kernel: mce: [Hardware Error]: CPU 38: Machine Check: 0 Bank 26: 80000040020000b1
Aug 10 05:26:41 astraTZ kernel: mce: [Hardware Error]: TSC 29abee5a339 PPIN 5d4fa8cb73c77cc0
I asked Gemini what this means, Gemini told me that Bank26 is related to memory issue. I am not sure if Gemini is correct, but I do have some memory issues on this platform. Please forgive my ignorance here. The issue I came across is quite complex:
1. When I was first installing memory to the slots, I got distracted and accidentally broke one of the slot. Then I left that slot empty and populated every other slot.
2. After that, I was able to get 14 memory slots working. It was supposed to be 15 since only one slot is missing memory module. I used a tool called cockpit to inspected memory info and found out that 15 out of 16 slots is presented with memory, but the memory in one of the slots has an unknown size, which explains why my server only detects 14.
3. I was still unable to locate the real issue until then. But as I am getting MCE over the last week, two more of my memory modules can’t be detected. I am starting to suspect that the issue is with the CPU (since the OS is reporting MCE about the CPU at the same time)
Below is an image of my memory info captured by cockpit.
Also there is a separate incident that might be related to this. Due to my stupidness, I bent ONE of the socket pins of the socket which the problem CPU is installed (this happened before I install the whole system). I paid a guy to repair the problem socket for me, he used tools to bend the pin back, the angle of the pin is not exactly identical to its original state. But I think it seems fine since it’s “almost” perfect and definitely not touching any other pins in the socket. Could this one imperfect pin cause my problem?
Any help is appreciated :)
 

Attachments

sersmile

New Member
Jun 7, 2025
5
1
3
I was still unable to locate the real issue until then. But as I am getting MCE over the last week, two more of my memory modules can’t be detected.
guys this is so weird. I rebooted my computer and now it detects 15 memory modules…
 

RolloZ170

Well-Known Member
Apr 24, 2016
9,404
3,015
113
germany
guys this is so weird. I rebooted my computer and now it detects 15 memory modules…
some ecc errs can accumulate over time(threshold) and result in outmapping of that slot(until maintenence/replacement)
clear cmos.
early ES stepping B0/C0 are known to have memory channel loss over time(they do not come back), i hope this issue applies not to D0s.
 
  • Like
Reactions: sersmile

GadflyII

New Member
Aug 14, 2025
7
3
3
I have never messed with ES Xeons before (only retail units professionally), and I foolishly and made a purchase off ebay.

I purchased two Xeon 8592+ ES "Q2SR+" and a Gigabyte MS73-HB1 motherboard. In reading this thread, it is clear that I should have done a bit more research before I clicked the buy button. Oh well, live and learn right? I guess I didn't think that some of the features or accelerators would be locked / or down graded to a lower level. I also didn't really understand the steppings, so I won't know exactly what stoppings I am going to get until they show up, as I am guessing that the "Q2SR+" means that they could be any stepping after the Q2SR, and that is not a stepping.

I don't care too much about most features, I am going to use this as a workstation for running and training some AI models (Not LLM's/image/Video creation etc.), for a new project. I went with these CPU's for the on die AMX, etc. accelerators, and I guess that is really all I want running on as many cores as possible, but the more features and accelerators I can unlock the better.

So, How boned am I here? I know... I'm a dumbass that click buy before doing my research. Is this salvageable with modded bios, etc; or do I need to go shopping again?
 
Last edited:

Andrix

New Member
Mar 15, 2025
14
11
3
I have never messed with ES Xeons before (only retail units professionally), and I foolishly and made a purchase off ebay.

I purchased two Xeon 8592+ ES "Q2SR+" and a Gigabyte MS73-HB1 motherboard. In reading this thread, it is clear that I should have done a bit more research before I clicked the buy button. Oh well, live and learn right?
Well, you are clearly struck by the buyer's remorse and likely to change your mind about the purchase at least once.

I am guessing that the "Q2SR+" means that they could be any stepping after the Q2SR, and that is not a stepping.
"...Q2SR+ Gigabyte MS73-HB1..." most likely lacks a space and should be read as "...Q2SR and Gigabyte MS73-HB1...". You get to see some slopiness from hardware sellers on ebay quite a bit. In other words, I tend to believe that you bought specifically Q2SR and there is no such thing as Q2SR+.

I went with these CPU's for the on die AMX
And you should be getting it with Q2SRs unless they are broken. I elaborated on it in another thread.

So, How boned am I here? I know... I'm a dumbass that click buy before doing my research. Is this salvageable with modded bios, etc; or do I need to go shopping again?
I wouldn't be surprised if the motherboard arrives with already modified bios and the combination works fine as it. If it doesn't, you're sorted out apparently. :)
 

RolloZ170

Well-Known Member
Apr 24, 2016
9,404
3,015
113
germany
I wouldn't be surprised if the motherboard arrives with already modified bios and the combination works fine as it.
i want to make clear that a mod. BIOS is not required, only if you want to use the actual official BIOS.
the mod. BIOS or the how to should have arrived in China by now, thought.
i have not kept it secret, but delayed to keep the EMR A0 ES prices low as long as possible.
 
  • Like
Reactions: GadflyII

GadflyII

New Member
Aug 14, 2025
7
3
3
Well, you are clearly struck by the buyer's remorse and likely to change your mind about the purchase at least once.


"...Q2SR+ Gigabyte MS73-HB1..." most likely lacks a space and should be read as "...Q2SR and Gigabyte MS73-HB1...". You get to see some slopiness from hardware sellers on ebay quite a bit. In other words, I tend to believe that you bought specifically Q2SR and there is no such thing as Q2SR+.


And you should be getting it with Q2SRs unless they are broken. I elaborated on it in another thread.


I wouldn't be surprised if the motherboard arrives with already modified bios and the combination works fine as it. If it doesn't, you're sorted out apparently. :)
After speaking to @RolloZ170, I am much more hopeful, I think this is going to work really well, If they do I am immediately buying a second setup, each will allow me to run 6 GPU's with 128 cores of AMX, two servers would let me run one training batch every ~22 days. For contrast, running one training batch with hosted setups would take ~30days and each batch costs about $50,000 - $80,000 in hardware fees. (which I obviously can't afford :D)

If anyone has any suggestions on where to find some cheap used memory. I would appreciate it. Looking for at least 8 modules to start, and will expand to 16 later. Most like go with some 48GB modules. I could most likely get away with 8x 32GB per socket, but it might be a little tight.