ok folks, i dunno if this is the place to post this request because i dunno if it's hardware or software that's causing my problems.. it's a doozy, so here goes.
for starters, this is a bare metal truenas core (latest stable) build.
the hardware is as follows:
supermicro x11ssh-f (latest firmware, i believe.. flashed it all very recently after downloading it from supermicro, so..)
I3-7100T (hyperthreading is disabled in the bios, so it's just 2 cores = 2 threads).
32Gigs of nemix ecc udimm (as per the spec that supermicro posted, it should work. i ran a memtest64 on it recently for 4 days+ straight, no errors).
LSI 9211-8i hba
intel x710-da2 nic
cpu cooler is complete over-kill as is the heatsink with fan on the hba.. basically, before anyone asks, heat is NOT a problem.. i can promise you that..
2 samsung ssd's for the OS (256Gig... had them laying around.. they're not the problem)
2 samsung 870 evo's (2.5 SATAIII) @500GB.. here's where the problem starts.
5 spinners (not a problem).
as previously stated, this is a bare metal nas and does NOT have any plugin's installed, and probably never will. it's only job will be for A) important file storage (resumes, financial, legal documentation, basic very important stuff for me), and media (movies, tv shows, music, less important, but important no less).
I want it configured as follows:
2x 500Gig SSD's mirrored to hold my ultra important files.
5x spinners will hold my less important media.
so here we go with the problem:
my SSD's are connected directly to the motherboard, while the spinners are connected to the LSI HBA.
during testing, when i'm performing large file copies to the mirrored data ssd's it starts off fine but after a few seconds the file copy slows to a crawl.. after a few more seconds the ssd data pool spits out errors and i get a warning saying that the pool has been degraded. from there, the only way i can get the system back is to connect to the IPMI and do a hard reset as attempting to reboot via the truenas core webgui doesn't work, the cli just shows random pids saying "waiting for whatever"
and at that point the SSDs are HOT.. truenas core claims 45C but they seem far warmer than that.. i've never felt a 2.5 SATA SSD get that warm.
the first time i experienced this, i thought "ok, an ssd died, just replace it.." so that's what i did.. after replacing it with a brand new one, POOF!! same thing after only about 15 minutes of use.
plugging all ssds into a usb caddy and scanning them on my win10 pro pc yields no problems whatsoever. (one drive is brand new, the other 2 were within a year old and were hardly used).
furthermore, i swapped in a pair of brand-new crucial 1TB mx500 SSD's and received the same results..
FURTHERMORE, i tried connecting the SSD's to different SATA ports on the motherboard, SAME PROBLEM!!
i tried connecting the SSD's to the HBA, SAME PROBLEM!!
copying the same 110GB movie folder to the 5 spinners (raidz1) is perfect.. copying the same folder to the nvme (in 2x mode), is perfect..
i can't imagine it being caused by an overloaded cpu that just can't keep up with data writes.. it doesn't happen with the spinners, it doesn't happen with the nvme, and it happens regardless if i send the files across the 10G nic or the 1G nic. none of this makes any sense.. and yes, i reloaded truenas core to see if that was it.. same problem. the only difference is if i send the files across the 10G nic, this blows up after 10-15 seconds, whereas if i send it across the 1G nic, it makes it to the end before blowing up..
if you've made it this far, thank you for taking the time!
for starters, this is a bare metal truenas core (latest stable) build.
the hardware is as follows:
supermicro x11ssh-f (latest firmware, i believe.. flashed it all very recently after downloading it from supermicro, so..)
I3-7100T (hyperthreading is disabled in the bios, so it's just 2 cores = 2 threads).
32Gigs of nemix ecc udimm (as per the spec that supermicro posted, it should work. i ran a memtest64 on it recently for 4 days+ straight, no errors).
LSI 9211-8i hba
intel x710-da2 nic
cpu cooler is complete over-kill as is the heatsink with fan on the hba.. basically, before anyone asks, heat is NOT a problem.. i can promise you that..
2 samsung ssd's for the OS (256Gig... had them laying around.. they're not the problem)
2 samsung 870 evo's (2.5 SATAIII) @500GB.. here's where the problem starts.
5 spinners (not a problem).
as previously stated, this is a bare metal nas and does NOT have any plugin's installed, and probably never will. it's only job will be for A) important file storage (resumes, financial, legal documentation, basic very important stuff for me), and media (movies, tv shows, music, less important, but important no less).
I want it configured as follows:
2x 500Gig SSD's mirrored to hold my ultra important files.
5x spinners will hold my less important media.
so here we go with the problem:
my SSD's are connected directly to the motherboard, while the spinners are connected to the LSI HBA.
during testing, when i'm performing large file copies to the mirrored data ssd's it starts off fine but after a few seconds the file copy slows to a crawl.. after a few more seconds the ssd data pool spits out errors and i get a warning saying that the pool has been degraded. from there, the only way i can get the system back is to connect to the IPMI and do a hard reset as attempting to reboot via the truenas core webgui doesn't work, the cli just shows random pids saying "waiting for whatever"
and at that point the SSDs are HOT.. truenas core claims 45C but they seem far warmer than that.. i've never felt a 2.5 SATA SSD get that warm.
the first time i experienced this, i thought "ok, an ssd died, just replace it.." so that's what i did.. after replacing it with a brand new one, POOF!! same thing after only about 15 minutes of use.
plugging all ssds into a usb caddy and scanning them on my win10 pro pc yields no problems whatsoever. (one drive is brand new, the other 2 were within a year old and were hardly used).
furthermore, i swapped in a pair of brand-new crucial 1TB mx500 SSD's and received the same results..
FURTHERMORE, i tried connecting the SSD's to different SATA ports on the motherboard, SAME PROBLEM!!
i tried connecting the SSD's to the HBA, SAME PROBLEM!!
copying the same 110GB movie folder to the 5 spinners (raidz1) is perfect.. copying the same folder to the nvme (in 2x mode), is perfect..
i can't imagine it being caused by an overloaded cpu that just can't keep up with data writes.. it doesn't happen with the spinners, it doesn't happen with the nvme, and it happens regardless if i send the files across the 10G nic or the 1G nic. none of this makes any sense.. and yes, i reloaded truenas core to see if that was it.. same problem. the only difference is if i send the files across the 10G nic, this blows up after 10-15 seconds, whereas if i send it across the 1G nic, it makes it to the end before blowing up..
if you've made it this far, thank you for taking the time!