I'll be doing this same upgrade when my Q2SR gets here. I'm super excited for this build. What is the bios mod that you needed to do?
@RolloZ170 helped me. disable acm. exist second variant with enable acm but must be older version.I'll be doing this same upgrade when my Q2SR gets here. I'm super excited for this build. What is the bios mod that you needed to do?
@RolloZ170 is the most educated about all of this Xeon stuff hah.@RolloZ170 helped me. disable acm. exist second variant with enable acm but must be older version.
Do you happen to have a copy of those instructions to disable acm?@RolloZ170 helped me. disable acm. exist second variant with enable acm but must be older version.
Emr is a nice upgrade from spr. Too bad they couldn't be run on motherboards asus w790 sage ace. But if you don't have a problem with c741 boards, this is an opportunity to get very good cpu for little money.
2400mhzWhat is the sustained all-core AVX512 frequency for Q2SR?
Turbo Ratio Limits - IA/SSE, Fused: 40x (1-32c), 34x (33-48c), 27x (49-58c), 26x (59-64c)
Turbo Ratio Limits - IA/SSE, Resolved: 40x (1-32c), 34x (33-48c), 27x (49-58c), 26x (59-64c)
Turbo Ratio Limits - AVX2, Fused: 38x (1-32c), 32x (33-48c), 26x (49-54c), 25x (55-64c)
Turbo Ratio Limits - AVX2, Resolved: 38x (1-32c), 32x (33-48c), 26x (49-54c), 25x (55-64c)
Turbo Ratio Limits - AVX-512, Fused: 35x (1-32c), 29x (33-48c), 25x (49-54c), 24x (55-64c)
Turbo Ratio Limits - AVX-512, Resolved: 35x (1-32c), 29x (33-48c), 25x (49-54c), 24x (55-64c)
Turbo Ratio Limits - TMUL, Fused: 35x (1-32c), 29x (33-48c), 23x (49-54c), 22x (55-64c)
Turbo Ratio Limits - TMUL, Resolved: 35x (1-32c), 29x (33-48c), 23x (49-54c), 22x (55-64c)
Thanks, but have you actually verified that the core clock reaches 2400mhz in an AVX512 workload? It doesn't work this way for QYFS, at least not with the standard power limit settings. A hwinfo screenshot shows2400mhz
Code:Turbo Ratio Limits - AVX-512, Fused: 35x (1-32c), 29x (33-48c), 25x (49-54c), 24x (55-64c) Turbo Ratio Limits - AVX-512, Resolved: 35x (1-32c), 29x (33-48c), 25x (49-54c), 24x (55-64c)
Turbo Ratio Limits - AVX-512, Fused: 35x (1-28c), 29x (29-42c), 24x (43-50c), 23x (51-56c)
Turbo Ratio Limits - AVX-512, Resolved: 35x (1-28c), 29x (29-42c), 24x (43-50c), 23x (51-56c)
$ grep MHz /proc/cpuinfo | sort -n -k 4
cpu MHz : 1739.738
cpu MHz : 1797.958
cpu MHz : 1797.960
...
cpu MHz : 1798.842
cpu MHz : 1799.059
cpu MHz : 1799.105
cpu MHz : 1800.179
cpu MHz : 2573.705
cpu MHz : 2800.000
limit is only if all cores run AVX512 which is rarely happen. during AVX512 heavy workload there is no space for core clock asking thought,The turbo limit should be 2.3 GHz, whereas I see the following during a linpack benchmark run:
| CPU | linpack (GFLOPS) | mp_linpack (GFLOPS) | AVX512 freq. (GHz) | Base freq. (GHz) |
| Q2SR (64c) | 3800 | 4000 | 1.9-2.2 | 1.7 |
| QYFS (56c) | 2740 | 3070 | 1.8 | 1.9 |
| 8480+ (56c) | 2920 | 3120 | 1.9 | 2.0 |
power limit for Xeon's is strictly TDP. TDP can be exceeded for a limited time(max.448 sec.).The CPU power consumption (the turbostat reading) initially reaches 380W staying like this for a while and then reduces to 350W. If anyone can explain this, your comment would be very welcome.

PL1 TimeWindow e.g. 128 is not one shoot forever. with some healing (internal calculator) the time can start again, or a fraction of.The workload consists of 4 smaller runs about 40s each. The PL1 time window is set to 128s, and I sketched what I believe was the first one. The average power was 350W (TDP). I assume the next window should start right after those 128s. But then those 380W should have lasted a bit longer in the second window. Does the temperature enter the conversation here or is it completely irrelevant?
I'm not sure if it even has AVX512 for all x64 cores, as far as I know normally Intel CPUs (most Xeons) have two AVX512 execution units per CPU (hardly per tile/chiplet)limit is only if all cores run AVX512 which is rarely happen
you maybe mix that with TMUL / QAT Accelerators.I'm not sure if it even has AVX512 for all x64 cores, as far as I know normally Intel CPUs (most Xeons) have two AVX512 execution units per CPU (hardly per tile/chiplet)
edit: specification for 8592+ clearly states:
# of AVX-512 FMA Units = 2
Indeed, thanks for the link.you maybe mix that with TMUL / QAT Accelerators.
AVX512 is per core.
Indeed, thanks for the link.
"Intel Xeon Scalable processors have two FMA units per core to combine multiplication and addition into a single operation and accelerate computation speeds."
note that there may be two AVX512 FMA units. but all have AVX512 instruction set.normally Intel CPUs (most Xeons) have two AVX512 execution units per CPU
Thanks for the reminder. I saw even a slight gain (~50 GFLOPS) after disabling hyperthreads.As I was informed - we have not seen a drop in the GFLops rating when the HT was disabled.
So even 1 thread on 1 core can fully utilize the vector unit(s) present there.
Yeah, FMA is quite important as it doubles instructions per cycle resulting in 32 FLOPs per cycle per core.note that there may be two AVX512 FMA units. but all have AVX512 instruction set.
================================================================================
T/V N NB P Q Time Gflops
--------------------------------------------------------------------------------
WC00R2R2 80000 1536 2 1 7.18 4.75480e+04
WC00R2R2 80000 1536 2 1 7.17 4.75755e+04
WC00R2R2 80000 1536 2 1 7.20 4.74291e+04
WC00R2R2 120000 1536 2 1 22.20 5.18854e+04
WC00R2R2 160000 1536 2 1 49.83 5.48036e+04
WC00R2R2 200000 1536 2 1 96.42 5.53152e+04
WC00R2R2 200000 1536 2 1 95.54 5.58245e+04
================================================================================
T/V N NB P Q Time Gflops
--------------------------------------------------------------------------------
WC00R2R2 80000 1536 1 1 10.47 3.26018e+04
WC00R2R2 80000 1536 1 1 10.30 3.31345e+04
WC00R2R2 80000 1536 1 1 10.67 3.19953e+04
WC00R2R2 120000 1536 1 1 36.01 3.19959e+04
WC00R2R2 120000 1536 1 1 33.63 3.42528e+04
WC00R2R2 120000 1536 1 1 33.93 3.39565e+04
WC00R2R2 160000 1536 1 1 72.98 3.74185e+04
WC00R2R2 160000 1536 1 1 74.39 3.67090e+04
WC00R2R2 160000 1536 1 1 76.39 3.57455e+04
================================================================================
T/V N NB P Q Time Gflops
--------------------------------------------------------------------------------
WC00R2R2 80000 1536 1 1 9.87 3.45798e+04
WC00R2R2 80000 1536 1 1 9.94 3.43259e+04
WC00R2R2 120000 1536 1 1 29.18 3.94775e+04
WC00R2R2 120000 1536 1 1 30.00 3.83982e+04
WC00R2R2 160000 1536 1 1 67.36 4.05389e+04
WC00R2R2 160000 1536 1 1 66.23 4.12296e+04