Stephan submitted a new resource:
Machine Check Exception (mce) workaround - Don't throw away that ebay Xeon just yet
Machine Check Exception (mce) workaround - Don't throw away that ebay Xeon just yet
Read more about this resource...You bought a cheap off-roadmap Intel Xeon CPU from somewhere, but the hardware crashes and reboots, even when idle. You realize the CPU might have gotten thrown out from the hyperscaler's datacenter for a reason. That reason?
Luckily, your CPU has extensive diagnostics and your Linux distribution supports "pstore" crash saving. In the directory /sys/fs/pstore/ within the saved dmesg* and mce* files you find something like this:
Code:mce: [Hardware Error]: CPU 2: Machine Check...