Hello,
I've done quite a few sever builds over the years, but this is my first Supermicro and first Xeon 3rd gen. This is a new home NAS/App server build and I'm still at base level (no cards or drives added) so I've got:
My main mystery -- CPU overheating:
The CPU will overheat and literally hit thermal shutdown just sitting in setup (!) on the Supermicro. Fans are all operational and I feel good airflow. The CPU cooler has a good interface w/ thermal paste -- it definitely is pulling heat and gets hot. The two 80mm fans are under 2" away and exhausting fine.
For some reason I don't see any CPU temp stats in setup (am I missing something?) and they don't seem to be available via the BMC while in setup -- so basically I'm blind, but it will thermal and shutdown if left for several minutes, with an event logged in the BMC.
So, to get sensor stats available via the BMC I decided to just get it to boot up to something, so I connected TrueNAS install media and just let it sit at the main install menu. As soon as that boots up, I can see the temperature start (very very high like >90C) but then quickly fall back down to reasonably normal idle temps (44-46C with case cover off idling).
So it's acting CPU PM is basically non-existent until booted into some form of OS controlling it(?).
What I've tried:
Anyway, any thoughts would be really appreciated here as I'm out of ideas. Obviously I didn't buy it to sit in setup and it appears it should work fine when booted, but the idea that I have a CPU heat time bomb outside a booted OS doesn't sit well at all.
Minor question:
In trying to figure this out, I noticed the RAM was running at some lower speed (rather than 3200Mhz) when set to auto -- which I've seen before on other builds. So I tried setting it to 3200Mhz manually. In the BMC I see it running at 2666Mhz though. Not a huge deal, but I'm curious why that might be. Just falling back because it doesn't see it as capable or potentially some other setting?
Sorry this got long, and thanks in advance for any thoughts.
-James
I've done quite a few sever builds over the years, but this is my first Supermicro and first Xeon 3rd gen. This is a new home NAS/App server build and I'm still at base level (no cards or drives added) so I've got:
- Supermicro X12SPL-LN4F (LGA-4189)
- Intel Xeon Silver 4310 (12 Core)
- Dynatron N8 passive CPU cooler
- 2 x Noctua NF-R8 80mm PWM fans (less than 2" away exhausting heat)
- 128GB (4x32G) NEMIX DDR4-3200 ECC RDIMM 2Rx4 in slots A,C,E,G
My main mystery -- CPU overheating:
The CPU will overheat and literally hit thermal shutdown just sitting in setup (!) on the Supermicro. Fans are all operational and I feel good airflow. The CPU cooler has a good interface w/ thermal paste -- it definitely is pulling heat and gets hot. The two 80mm fans are under 2" away and exhausting fine.
For some reason I don't see any CPU temp stats in setup (am I missing something?) and they don't seem to be available via the BMC while in setup -- so basically I'm blind, but it will thermal and shutdown if left for several minutes, with an event logged in the BMC.
So, to get sensor stats available via the BMC I decided to just get it to boot up to something, so I connected TrueNAS install media and just let it sit at the main install menu. As soon as that boots up, I can see the temperature start (very very high like >90C) but then quickly fall back down to reasonably normal idle temps (44-46C with case cover off idling).
So it's acting CPU PM is basically non-existent until booted into some form of OS controlling it(?).
What I've tried:
- Pulled the CPU just to make sure the interface was good - seemed fine and re-seated everything very carefully
- Starting with optimized defaults in Supermicro setup (reset several times)
- Updated the BMC firmware to latest
- Updated the BIOS firmware to latest
- A few settings related to CPU PM, although this is definitely beyond my knowledge
Anyway, any thoughts would be really appreciated here as I'm out of ideas. Obviously I didn't buy it to sit in setup and it appears it should work fine when booted, but the idea that I have a CPU heat time bomb outside a booted OS doesn't sit well at all.
Minor question:
In trying to figure this out, I noticed the RAM was running at some lower speed (rather than 3200Mhz) when set to auto -- which I've seen before on other builds. So I tried setting it to 3200Mhz manually. In the BMC I see it running at 2666Mhz though. Not a huge deal, but I'm curious why that might be. Just falling back because it doesn't see it as capable or potentially some other setting?
Sorry this got long, and thanks in advance for any thoughts.
-James