hi, been troubleshooting a new build (all but ram was bought used, as working), and am looking for others input with these unusal issues. (i have built and manage 25+ SM setups over the years, so i have done all the troubleshooting i know of here).
Quick question / issue:
(throughout im following the SM manual for ram population order, unless im intentionally testing something)
if i use cpu1 's blue C1 and D1 slots, in anyway, i quickly get all kinds of memory type issues (even w just 1x cpu connected)
If i only use CPU1's A1 and B1 , and CPU2's E1,F1,G1,H1,E2,F2,G2,H2 = no issues (ie using 10x sticks). ( 28 hour memtest , 0 errors , multiple boots, no hint of a problem)
If i change BIOS setting for mem speed from AUTO (which runs mem at 1333 as it should), TO 1066, i have no issues and can use cpu1 's blue C1 and D1 slots.
(when i say issues/problems, i mean bios will lock up, or loop (ipmi event will show catERR at freeze), or if i can get to my UEFI memtest 7.2 pro, it quickly shows ECC errors along with IPMI event log showing ECC errors on CPU1's C1 D1 slots, i have ONLY ever seen ECC errors on C1 and D1 slots.
(i have reseated CPUs , no bent pins on either socket, inspected all of board for visual issues, all testing is with no disks, no pcie slots occupied, and just a usb stick w my memtest boot) I have even swapped CPU 1 with CPU 2 (while running just 1 cpu) to verify issue is not CPU1's fault. Also reset CMOS via Manual. and tried running on just 1 PSU (tried both psus, w just 1x).
(im never mixing memory types / makes here, i only have 3 x sets of memory as to rule out memory issues)
some specs:
CPU: 2x e5-2667 (2.9GHz, 3.5GHz , 6C +ht, L3:xM, 130W, MB microcode patch= 710)
MB: X9DR3-LN4F+ rev 1.10 (read from mb sticker) (BIOS = 3.2 latest as of jun2018)
PSU: 2x SM 720w
Case/Chasis: CSE-825 (2U , 8x 3.5" bays)
RAM options im using in above :
8 x sticks of (from working server):
elpida EBJ41HE4BAFA-DJ-E - 4GB PC3-10600 DDR3-1333H (9-9-9) ECC-Registered CL9 240-Pin DIMM Dual Rank
EBJ41HE4BAFA-DJ-E pdf, EBJ41HE4BAFA-DJ-E description, EBJ41HE4BAFA-DJ-E datasheets, EBJ41HE4BAFA-DJ-E view ::: ALLDATASHEET :::
10 x sticks of (from working server):
Hynix 4GB PC3-10600 DDR3-1333MHz ECC Registered CL9 240-Pin DIMM Dual Rank Memory Module Mfr P/N HMT151R7BFR4C-H9-DB
https://www.skhynix.com/product/filedata/fileDownload.do?seq=2931
4x sticks of:
HMT151R7BFR8C-G7 DB AA-C (1066 ddr3 ECC - QUAD rank
https://www.skhynix.com/product/filedata/fileDownload.do?seq=2937
A good link someone else pointed, Hyinx memory P/N "decoder"
https://www.skhynix.com/static/filedata/fileDownload.do?seq=190
So a pretty basic SM setup, but with weird memory issues (only C1 D1 slots). I realize RAM sticks are not exact part NO's on SM QVL for this MB, but they match up exactly with QVL listed Part NO's specs, (but for the 4x 1066 sticks). and i pretty confidenly know all these sticks are good (never can say 100% good w ram LOL)
any input or ideas guys? thanks!
Quick question / issue:
(throughout im following the SM manual for ram population order, unless im intentionally testing something)
if i use cpu1 's blue C1 and D1 slots, in anyway, i quickly get all kinds of memory type issues (even w just 1x cpu connected)
If i only use CPU1's A1 and B1 , and CPU2's E1,F1,G1,H1,E2,F2,G2,H2 = no issues (ie using 10x sticks). ( 28 hour memtest , 0 errors , multiple boots, no hint of a problem)
If i change BIOS setting for mem speed from AUTO (which runs mem at 1333 as it should), TO 1066, i have no issues and can use cpu1 's blue C1 and D1 slots.
(when i say issues/problems, i mean bios will lock up, or loop (ipmi event will show catERR at freeze), or if i can get to my UEFI memtest 7.2 pro, it quickly shows ECC errors along with IPMI event log showing ECC errors on CPU1's C1 D1 slots, i have ONLY ever seen ECC errors on C1 and D1 slots.
(i have reseated CPUs , no bent pins on either socket, inspected all of board for visual issues, all testing is with no disks, no pcie slots occupied, and just a usb stick w my memtest boot) I have even swapped CPU 1 with CPU 2 (while running just 1 cpu) to verify issue is not CPU1's fault. Also reset CMOS via Manual. and tried running on just 1 PSU (tried both psus, w just 1x).
(im never mixing memory types / makes here, i only have 3 x sets of memory as to rule out memory issues)
some specs:
CPU: 2x e5-2667 (2.9GHz, 3.5GHz , 6C +ht, L3:xM, 130W, MB microcode patch= 710)
MB: X9DR3-LN4F+ rev 1.10 (read from mb sticker) (BIOS = 3.2 latest as of jun2018)
PSU: 2x SM 720w
Case/Chasis: CSE-825 (2U , 8x 3.5" bays)
RAM options im using in above :
8 x sticks of (from working server):
elpida EBJ41HE4BAFA-DJ-E - 4GB PC3-10600 DDR3-1333H (9-9-9) ECC-Registered CL9 240-Pin DIMM Dual Rank
EBJ41HE4BAFA-DJ-E pdf, EBJ41HE4BAFA-DJ-E description, EBJ41HE4BAFA-DJ-E datasheets, EBJ41HE4BAFA-DJ-E view ::: ALLDATASHEET :::
10 x sticks of (from working server):
Hynix 4GB PC3-10600 DDR3-1333MHz ECC Registered CL9 240-Pin DIMM Dual Rank Memory Module Mfr P/N HMT151R7BFR4C-H9-DB
https://www.skhynix.com/product/filedata/fileDownload.do?seq=2931
4x sticks of:
HMT151R7BFR8C-G7 DB AA-C (1066 ddr3 ECC - QUAD rank
https://www.skhynix.com/product/filedata/fileDownload.do?seq=2937
A good link someone else pointed, Hyinx memory P/N "decoder"
https://www.skhynix.com/static/filedata/fileDownload.do?seq=190
So a pretty basic SM setup, but with weird memory issues (only C1 D1 slots). I realize RAM sticks are not exact part NO's on SM QVL for this MB, but they match up exactly with QVL listed Part NO's specs, (but for the 4x 1066 sticks). and i pretty confidenly know all these sticks are good (never can say 100% good w ram LOL)
any input or ideas guys? thanks!