Dear community,
I am really hoping to get some advice on this forum, as my troubleshooting efforts are not leading me to any solution.
Background:
I have recently set up a cluster of 3 Dell Optiplex 3060 micro devices in Proxmox. The cluster ran fine for about 20 days.
They have been working great so far, however, 1 of the devices starting acting weird over the past 3 days.
Issue description:
The node has been randomly rebooting - sometimes after running 1 minute, sometimes after 20.
The operational time is equally between 1-25 minutes.
The rebooting has happened before BIOS, during BIOS config and after loading the OS - so it doesn't seem related to the OS.
Troubleshooting efforts so far:
I have changed the external power brick - no change in behaviour
I have changed the RAM slot - no change in behaviour
I have changed the RAM module - no change in behaviour
I have changed the CMOS battery - no change in behaviour
I have re-pasted the CPU - no change in behaviour
I have run memtest86 via the proxmox boot menu - it didn't finish the test - rebooted during the test
I have run memtest86 via the live USB - it didn't finish the test - rebooted during the test
I have run the Dell SupportAssist function - the test failed 2 times (rebooted during the test) - and finished 1 time successfully (without any errors detected)
I was unable to run the full Diagnostic test built into the Dell BIOS - it rebooted during it
I am unsure what steps I have missed, but would really appreciate some advice, as I am going nuts.
Thanks a lot in advance for any help!
I am really hoping to get some advice on this forum, as my troubleshooting efforts are not leading me to any solution.
Background:
I have recently set up a cluster of 3 Dell Optiplex 3060 micro devices in Proxmox. The cluster ran fine for about 20 days.
They have been working great so far, however, 1 of the devices starting acting weird over the past 3 days.
Issue description:
The node has been randomly rebooting - sometimes after running 1 minute, sometimes after 20.
The operational time is equally between 1-25 minutes.
The rebooting has happened before BIOS, during BIOS config and after loading the OS - so it doesn't seem related to the OS.
Troubleshooting efforts so far:
I have changed the external power brick - no change in behaviour
I have changed the RAM slot - no change in behaviour
I have changed the RAM module - no change in behaviour
I have changed the CMOS battery - no change in behaviour
I have re-pasted the CPU - no change in behaviour
I have run memtest86 via the proxmox boot menu - it didn't finish the test - rebooted during the test
I have run memtest86 via the live USB - it didn't finish the test - rebooted during the test
I have run the Dell SupportAssist function - the test failed 2 times (rebooted during the test) - and finished 1 time successfully (without any errors detected)
I was unable to run the full Diagnostic test built into the Dell BIOS - it rebooted during it
I am unsure what steps I have missed, but would really appreciate some advice, as I am going nuts.
Thanks a lot in advance for any help!