Hi all. I'm hoping someone here can tell me if I've got a hardware error or if I just have a configuration issue. I'm building a FreeNAS system to replace an old 4-bay Synology NAS, so I just bought a used Supermicro server off eBay. The specs are:
- SM 846 case
- SM X9DRI-LNF4 motherboard
- 2x E5-2640L v1
- 4x4GB Samsung DDR3 RDIMM
The system came fully assembled minus hard drives, and has a 30 day warranty. I started by firing the system up and running memtest86, only to have the system completely lock up after a few minutes. The IPMI logs reported multiple single bit memory errors on DIMMC1, so I started troubleshooting. After a day or so of swapping DIMMs sand testing, here's what I've found:
- As delivered, system had RAM in slots A1 and C1 (CPU1) and E1 and G1 (CPU2). In this config it is very unstable and crashes with IPMI errors on DIMMC1 C1
- First, I removed RAM from C1 and E1, switching to just a single DIMM per CPU. In this config the system seems stable. Runs memtest for 1hr+. I ran this test with all four dimms, so I think my RAM is good.
-Next, I swapped the cpus and put the RAM back in the A1/C1,E1/G1 config. In this config the system crashed with C1 single bit errors, so I don't think one of the CPUs is the issue.
- Next, i removed one CPU and ran a single CPU with all four DIMMs in A1,B1,C1,D1. System seemed stable, it was able to run memtest86 for a couple of hours without crashing.
After all of this, it seems likely to me that the motherboard and/or CPU didn't like running the ram in slots A1 and C1. I have rearranged the DIMMs to slots A1,B1, E1 and F1, and that appears stable. It ran memtest86 for 2 hours and it currently 12 hours into running Prime95 for a 24 hour burn in. (I'll run a long duration memtest86 after I finish with Prime95)
What I am looking for is someone to confirm my testing. I want to make sure that I don't have two issues here. Is it common to see Single Bit RAM errors if you aren't running the dual channel RAM in the correct slots? Or is it possible that I have an issue with the motherboard as well? I am planning on upgrading the RAM at some point down the line, so I'd like to know now if I need a new Mobo (before the 30 warranty expires)
- SM 846 case
- SM X9DRI-LNF4 motherboard
- 2x E5-2640L v1
- 4x4GB Samsung DDR3 RDIMM
The system came fully assembled minus hard drives, and has a 30 day warranty. I started by firing the system up and running memtest86, only to have the system completely lock up after a few minutes. The IPMI logs reported multiple single bit memory errors on DIMMC1, so I started troubleshooting. After a day or so of swapping DIMMs sand testing, here's what I've found:
- As delivered, system had RAM in slots A1 and C1 (CPU1) and E1 and G1 (CPU2). In this config it is very unstable and crashes with IPMI errors on DIMMC1 C1
- First, I removed RAM from C1 and E1, switching to just a single DIMM per CPU. In this config the system seems stable. Runs memtest for 1hr+. I ran this test with all four dimms, so I think my RAM is good.
-Next, I swapped the cpus and put the RAM back in the A1/C1,E1/G1 config. In this config the system crashed with C1 single bit errors, so I don't think one of the CPUs is the issue.
- Next, i removed one CPU and ran a single CPU with all four DIMMs in A1,B1,C1,D1. System seemed stable, it was able to run memtest86 for a couple of hours without crashing.
After all of this, it seems likely to me that the motherboard and/or CPU didn't like running the ram in slots A1 and C1. I have rearranged the DIMMs to slots A1,B1, E1 and F1, and that appears stable. It ran memtest86 for 2 hours and it currently 12 hours into running Prime95 for a 24 hour burn in. (I'll run a long duration memtest86 after I finish with Prime95)
What I am looking for is someone to confirm my testing. I want to make sure that I don't have two issues here. Is it common to see Single Bit RAM errors if you aren't running the dual channel RAM in the correct slots? Or is it possible that I have an issue with the motherboard as well? I am planning on upgrading the RAM at some point down the line, so I'd like to know now if I need a new Mobo (before the 30 warranty expires)