Dell Precision 7920 & 7820: Unfathomable Miscreants

Notice: Page may contain affiliate links for which we may earn a small commission through services like Amazon Affiliates or Skimlinks.

DecentSystems

New Member
Jan 29, 2024
8
1
3
Words fail me in dealing with these damn things. Here follows a choice selection of merely the most perplexing and baffling of issues...

1) You may never again see your screen light up after a RAM or CPU upgrade.

2) RAM errors can be CPU-dependent; simply leaving you screenless on one chip, but present a blinking led on another. Now and then.

3) Sometimes your system will start and shut down, a few times... because you added a PCIE card... other times it won't care all RAM and CPU's have been upgraded.

4) When you connect the power you MUST disconnect lest it belches at you with the intrusion alarm, it will power on, spin up drives... and then shut down, after all. It merely completes those steps during which hardware failures are most likely, and quits. Each time. Well - almost ;) Two times (out of about 11 or so) it did NOT spring to life upon inserting the power cable. Why? Dude... if you need to ask, this PC ain't for you my friend!

5) You will not know there is a problem, what it might be, and why. Like now.... NumLock is on (and I can indeed switch it on and off), the machine responds to the power button right away (it is, I daresay, not crashed!) but there is no display on the Quadro M4000.

6) Random guesswork to just "try something" is not an option - it's the only thing left to do aside of throwing the damn thing (or yourself) out of the window.

7) I happen to have both; Now - the 7820... I bought as presumed faulty, so I can cut it some slack... well - until I replaced its motherboard. Behavior became somewhat more predictable... but still bonkers. Add a second CPU? Nope - this will fail for reasons unknowable.

8) The 7920 has been sitting here all this time from when I started to write this post. It is busily blinking what I think is the HDD led, and for all intents and purposes it looks like it is in Windows just without display; But when I hit the power button - 4 secs and it shuts down. This itself is a random response because the previous time it shut down instantly. No, it's not sleeping, the led is not "breathing"...not this time, for some reason unknown.

9) Whoever implemented DisplayPort really needs to explain how even by version 1.4a the cable still must be connected during power on, or else you can forget getting any video/signal until the next restart. It is the single most enraging regression imaginable. Not to mention guessing which port is purportedly the first one. No, nVidia has NO "rule". M4000 it's the bottom one. but on other cards it can be at the top. Or the one on the left-hand slot... this time!

10) Intrusion alarm. I switched the sound off, "Silent Mode" or somesuch... or so I was led to believe... apparently not - it still switches on, even (briefly) with the power unplugged... The T7920 is filled to the brim with thermometers (2 for each drive cage, and more) - but actually using them to detect if there's a thermal issue - no, that... that's too difficult. C'mon now, that would make SENSE! Preposterous reasonable nonsense! These are not AMD Durons with fragile crumbling cores which, when deprived of a heatsink, shall turn to ash in seconds; I think they can handle the case being open for a moment or two. The SuperMicro equivalent on which I have no choice but to continue working can deal with it... why not this overpriced Dell?

11) Apparently it's of great importance whether or not Secure Boot is enabled, as this can cause the machine to "decide" some security violation has come to pass. Like using a non-Dell video card. The horror! Indeed it is so horrible we shall not tell anyone - let 'em guess on Reddit and see what happens. That'll teach them to try and save $2000 and deny us our RRP sucker premium! Just as well, the machine can't.. you know - figure out that, heyyy I suspect the user might not be entirely satisfied with the screen not coming back on - maybe we should offer an emergency mode in which we temporarily switch it off to alert the customer that therein the problem lies...?

12) I may not, and this is true, use DIMMs of different sizes. Such a configuration is not officially supported. 1x16 and 2x32 you say??! Heresy! Burn at the stake ye will! You shall order your Official Dell Memory Modules for $499 per MB from US, oh peasant, and don't you even think of keeping your existing modules, insolent cheapskate!

1716532349187.png

But most of all - who in their right mind, when designing the... what... 8th? 10th? generation of their FLAGSHIP desktop Product for professionals... would go with "Yeah, let's make it... go dark now and then for reasons we will not explain. The error, whatever it might happen to be, shall not be logged, either. We shall thus make sure the dumb sucker - I mean, the customer ;) has a feeling of dread, insecurity and stressful ignorance... make him have thoughts like - hmm, well, that LGA3647 socket IS very fragile, maybe I... bent a pin...? Perhaps I should reseat (and again run the risk of damaging) this ultra-fidgety CPU...?"

"What if I put the video card in a different slot?"
THE VERY FACT THIS IS EVEN A STEP TO BE CONTEMPLATED is a STRONG indicator *someone* designed things VERY badly. This is not the era of 486 with VESA local bus cards which actually require exact timings lest they literally can't function....
This was 2019-2020, machines so advanced they'd make a T1000 blush... and THIS is whence we have regressed?? And this is not (at the time) cutting edge PCIE 4... no no... even the 2nd Generation Intel scalable was still all PCIE 3. Tried and tested, cut and dried. And yet... the web's abrim with prayers to Randomitor, Overlord of Happenstance, hoping that [the other slot] will appease its capricious dictates.

And finally...
Why DID Intel make PIN1 so hard to readily identify? Like, are you telling me THIS was the ONLY feasible socket design....? Not a SINGLE pin could have been arranged differently... to make the orientation readily self-evident? They got half a dozen deviations from the pin pattern on the socket... but replicate them on the CPU...? - no way! That would, like, totally take away the mystery, dude!

Anyway - I am going back to it. I am honestly believing it might have helped that it got to... "rest a little now"... maybe it will be in a better mood, who knows.
 
Last edited:
  • Like
Reactions: SnJ9MX

DecentSystems

New Member
Jan 29, 2024
8
1
3
Well hallelujah! After another dozen or so assorted attempts and variations, it does now appear I have been graced by the Dell with a display. The Quadro M4000 gave no dice - upper slot (so on CPU2), lower slot 4.. but the K2200 in slot 2 (I think) - yes, that, somehow worked...which is weird coz the K2200 did not work the previous times, neither in the T7820 nor in this T7920....

How am I to trust this machine with my daily workload, all my data, etc.. Hey - at least I got the PERC H730P 2GB working (in the previous single CPU setup with a Silver 4114, so I guess you never know...) so I guess I can move the disk system in its entirety if the 7920 will start having attitude problems again ;)

As things stand, we got ourselves here a 2x Xeon 6140 with 4x 16GB DDR4-2666 RAM (came originally in T7820)

Alas, 64GB ram is for toddlers :D Has anyone managed to get a T7920 to work with more than one DIMM size inserted? I updated the Bios to the newest one (April this year?) so it should be in its best possible condition. On the X11DAI-N I've had 320GB installed (8x32, 4x16) which is ideal!

In terms of deductions/impressions...

1) I suspect it is "good" sign if, after inserting/removing [some hardware] or [changing slot] of existing hardware, the system shuts itself down and starts afresh. I assume this is what one might call a successful reconfiguration;
2) This need not, however, cause a PC not showing any video to start doing so;
3) Changing the video card induces a more "thorough" reset than merely shuffling (other) stuff around, or changing video card slots;
4) The video card is a special case which does not impact boot like other cards might;
5) These systems can end up in a kind of "unyielding" state, which appears to be induced when one:
- does not shut it down correctly and/or forces the intrusion alarm to do it;
- introduces multiple new hardware components in one hit (especially if that happens after "a"...)
6) Above-mentioned state is characterized by an absence of self-induced reboots after hardware changes;
7) One needs to be patient with these damn things, but anything above, say, 2-3 minutes is never correct and indicates a problem;
 
Last edited:

DecentSystems

New Member
Jan 29, 2024
8
1
3
Getting the whole gamut of Mystery & Chance events here... but, overall, ze Machine is doing well. Upped the RAM to 4x32 Samsung - the Lordship was pleased and gracefully made the DIMMs its own, the HPE sticker proving to be no obstacle. Next up, reactivated PERC 730P card, which the computing apparatus duly initialized and allowed to be.

However, we are not het at happy bunnies, rainbows and smiling unicorns. I bought a couple expansion bays (the connector part, the ones for U.2/PCIE drives) and attached the fan and thermometer from them... (they were sold as "fans" so I got them for next to nothing) and unfortunately we now have a FB4 Fan malfunction, even though, clearly, there IS no FB4 fan connected. Found this error elsewhere, in that case it was caused by the intrusion alarm shutting the system down, presenting it upon restart. I'll remove the thermometer, see if it'll help later.

What's left...? Swap video card (to nVidia RTX A2000, now in SuperMicro machine), add SoundBlaster Z SE and a dual port Mellanox 40GBe card because of course no Network can hope to existing without 40Gbit connections.

--------------------------------

Meanwhile I also realized this is in fact the most advanced and perhaps, possibly, the LAST Dual CPU workstation Dell has or will ever make! The current crop of Precisions is all single socket... although Lenovo did release the ThinkStation PX which does have dual Xeons... so who knows :)
 

fcbrants

New Member
Jul 9, 2024
2
0
1
Silly question for you - as it appears from reading your posts that you've seen this beastie (Precision 7820) in various forms of dress and undress, I would like to ask if you've encountered the following scenario:

Upon downloading & installing the following BIOS versions:

1. Current (2.41.0)
2. One version past currently installed version (currently installed is 2.31.0, so tried 2.31.1),
3. An older but apparently significant update (2.6.3)

... and trying every means of installing said BIOS updates:

1. Support Assist
2. Direct Download & Install from within Windows 10
3. F12 Boot Options -> BIOS Update

I am greeted by (your favorite...):

1. Blank Screen
2. Normal (white, no flashes) power button
3. A VERY slowly flashing (like ~ 30 seconds...) numlock light on the keyboard.

This machine is a new-to-me refurb machine from Dell Refurbished - I have changed literally nothing (RAM arrives tomorrow...) & it's a fresh install of Windows 10 for Workstations.

I have made sure Bitlocker is NOT enabled (it wasn't) & reset the BIOS to defaults.

Please let me know if anybody has seen and / or resolved this problem.

Thank you!!

Franko

Getting the whole gamut of Mystery & Chance events here...

<Snip>
 

RolloZ170

Well-Known Member
Apr 24, 2016
7,773
2,454
113
And finally... Why DID Intel make PIN1 so hard to readily identify? Like, are you telling me THIS was the ONLY feasible socket design....? Not a SINGLE pin could have been arranged differently... to make the orientation readily self-evident?
sorry for the late reply:
there is a golden triangle on the CPU. and a triangle on the socket assembly, but you don't need to know
there are two bolts. one big, one small. the CPU in the Carrier mounted on the Cooler doesn't let you install it wrong.
 

edggeee

New Member
Sep 10, 2024
2
1
1
I had the same issue with my Dell 7920. I believe I had to:
- Utilize only a single GPU on CPU 0​
- Installed a fan converter cable on CPU 1 for Noctua Heatsink(this was the biggest hurdle since fan pin is different from DELL)​
- Clear CMOS and remove power cord for 30 minutes​
- Reinstall windows in UEFI (make sure legacy is disabled)​
Afterwards, you can install a secondary GPU on GPU1. I currently have an RTX 2080 super, and RTX 2070 super.
 

Attachments

Last edited:

BusError

Member
Jul 17, 2024
45
9
8
I've been running a 7910 for a few years and it's been working very well! Was pondering wether it needed upgrading but reading all of this seems to have cured this particular bout of Upgradisis :)

Mine has 768GB of ram, 8*1.8TB SAS disks, the odd PCI card (HBA, NVME..), 10Gbe and is a linux compilation farm.
 

Dev_Mgr

Active Member
Sep 20, 2014
167
61
28
Texas
I've got a 7920 with 2 x 6138 and 8 x 32GB for running VMs for labs to learn more VMware stuff (next project is learning NSX).
 

aural

New Member
Feb 2, 2025
1
0
1
I had the same issue with my Dell 7920. I believe I had to:
- Utilize only a single GPU on CPU 0​
- Installed a fan converter cable on CPU 1 for Noctua Heatsink(this was the biggest hurdle since fan pin is different from DELL)​
- Clear CMOS and remove power cord for 30 minutes​
- Reinstall windows in UEFI (make sure legacy is disabled)​
Afterwards, you can install a secondary GPU on GPU1. I currently have an RTX 2080 super, and RTX 2070 super.
Can you let me know which noctua is used, and if anything needs to be mod in order to use it? Thank you!