Created an account to mention... I have the exact same problem with different equipment. I am running a 480TB array on an Amd 2700x Asus Prime X470 with 32GB 2666Mhz (non ecc). Running Ubuntu 18.04LTS. I have 2 large arrays. I also have Thunar File Manager installed. I am using a LSI SAS 12G 9300 I think it is. I have a 16 Bay Raid Machine and a Super Micro 24 Bay. I originally had the 16 x 8TB IronWolf Drives in the Raid Machine array. The 24 bay wasn't used for much originally (I need more money) LOL!
I had the same problem as mentioned by the OP.
Eventually I purchased 9 x 16TB Exos drives. I zero wipe all drives using DD. Ubuntu runs on a WD 1TB SSD Blue.
Eventually I purchased 10 more 10TB drives Exos. I swapped the arrays around so the SuperMicro was the main array I used and it only holds Exos drives (by the way, I LOVE Exos drives compared to IronWolf Drives).
The problem persisted.
Upon bootup and SOMETIMES randomly throughout the week I would go to access the network. It would appear to freeze... Usually it would copy from Windows to the Server a few seconds and then the speeds would drop to 0 for a minute or so. During this time, ALL the lights of ALL the drives on the array being accessed would be blinking like crazy doing "something".
dmesg gives me the exact error structure as the OP...
I just 4 hours ago upgraded the BIOS to the latest. Went in. Problem still occuring. BUT, after reading these posts, it seems it could be caused by an older MDADM/kernal.
So, I will try and figure out how to do that. I am running a Plex server and a massive backup for my decades of "collecting things" and running a computer business.
I'll try and figure out if I can see which kernal I am using and which mdadm I have. I did just clone the SSD today so I could try the OS upgrade but I was scared to do it as I have Plex, Apache, NodeJS, WOTLK Private server just for me - it's inside VMM).
At least if I butcher anything, I can just swap the SSD's back. I did a clone using DD and I always use the backup "just done" in the system and pull the original out and label it "backup" and swap them in the future after a small multi drive rotation of backups.
I'll update here when I get around to trying that and upgrading my system. It would be really great to finally fix this annoyance. I was always scared it was memory, motherboard, LSI card, or something causing the problem... I have tested this system to death with Prime and other tests and they all come out perfect! The ONLY issue I have with this home server is that the OP had...
Also, sometimes when copying files the system would start off fast, drop to 0... then speed up then drop to 0... but, usually, once it does it's "blinking of the lights or whatever it's doing" the system screams. I also copy from array to array all the time doing backups to the 16 bay raid 6.
I also have a mix in the SuperMicro 24 bay of Raid 5's and 6's and it only SEEMS to do it to the raid 6 arrays. So, who knows.
I can't wait to update my Ubuntu. I plan on buying an Epyc or Threadripper to play with in the future with ECC memory now that speeds are around 3200Mhz these days for ECC memory.
THANK you for posting your problems. It really helps to see this thread and know that at least someone else found a solution to this. Also, I wonder if Thunar file manager is causing the problem by hooking into something in the background?!?!?!?! Who knows. Weird considering Thunar isn't even open when this happens. I'm copying from Windows 10 with Ryzen 5950x on a 1GB Asus NIC across the network to my AMd server... and I also have many other machines that this exact same thing happened to... 3950x, 5600G, 2400G, 2700x, 1700x etc... and I have Ubiquity network equipment, router, switch, wifi, etc... no errors in those for network problems. I have replaced all my cables with Cat 7/8 and redid the rest in Cat 6. Overkill for me, but, I like to learn, tinker.
Will update when I get around to doing this upgrade. And I HOPE my problem is resolved like the OP...
I wanted to comment on here because I am running on a cleanly (non upgraded, non expanded MDADM raid 6 array) and a non Intel system and I have the exact same problem.
Thank you for your time.
David Perry
Perry Computer Services