Sorry took a break from this project after a few hundred failed attempts to get things working smoothly. A few updates with what Ive been doing and what didnt work.
1. Installed a P4510 NVMe in each chassis, fans went to 100% and got a critical fault on every blade that the NVMe was overheating (it wasnt). Updated the firmware with Intels utility, same issue. Then went down the rabbit hole that is Cisco vs OEM firmware and tried a few thousand things. The HUU for the S3260 does NOT contain firmware for the P4510 because it was never offered on this platform, only the C480M5 HUU (and a few others) have the firmware needed to get these drives to show up correctly to CIMC. So, lets try a few things:
a. Tried to modify the HUU ISO to include the correct firmware and edit the Catalog.json file to detect this drive. No dice, using 6 different ISO editing tools resulted in either no boot at all, partial boot or (best so far) it boots all the way up then throws an error after the HUU is loaded that the image cant be mounted. Two likely issues here, the ISO is built using some tool im unfamiliar with and re-writing it is breaking something. The HUU is checking file consistency somehow (more on that later).
b. Instead of editing the ISO with a utility, I hex edit the C480M5 HUU ISO directly and change the machine string to match my S3260M4. Still fails at the intial boot that the HUU is only valid for X when Y detected.
c. Dig further into the ISO and find out that the Catalog.json file is a red herring and isn't actually used. That file does nothing and the real Catalog.json is compressed inside initrd0.gz, I am unable to edit that one because anytime I re-write the ISO i run into corruption issues with HUU when it tries to launch.
d. OK, fine, ill extract the firmware files directly. Extract the Intel firmware from the ISO, but unable to find a utility that can write the NVMe firmware directly. Only the Intel utility but it comes with the firmware baked-in to a DLL file thats signed. Great, I don't know of any utility that can write NVMe firmware outside of the vendor. Decoding how the HUU does it, it uses a utility that comes with it but I can't find a reliable way to launch it.
e. I have a bunch of non-Cisco 10TB HE10 drives. HUH721010AL4204. These are the same as the Cisco HUH721010AL42C0. Tried to use several tools to write the cisco firmware to them, ended up with 1 paperweight that I'm trying to recover now...
So far, using non-Cisco 12TB drives has not had any bad experience. I am seeing some "Unexpected Sense" in MegaRAID but the drives are happily churning along copying my data over for a load test. I have one S3260 set up 50/50 with 30 drives per blade. One blade is fully populated with 30x 12TB SAS HUH721212AL4201 drives, and the other blade has been my test bed for tinkering with a handful off oddball SAS drives. Had a bit of a whoopsie-daisy with this setup as Node1 was copying data at ~500MBs, i installed the paperweight drive from the above comment into the SECOND blade mapped port, so not mapped to the first node at all that was doing the copy, should only impact the second node right? Wrong, backplane had some kind of hiccup or brain freeze that knocked about half the 12TB drives offline on the other server and threw a huge fit. Rebuilt the array, started over. Removed said dead drive from the second node, again, first node has storage explosion. Long story short, these servers are bit sensitive to faulted drives, working on one can absolutely impact the other!!
To answer your questions, the fan control does exactly what it wants, you can tell it to chill out but it will ignore your preference if it detects any anomaly with the devices. E.g. an NVMe reporting incorrect temperature, a HDD not reporting temperature at all will absolutely cause the fans to ramp up and there is no override for the override.
@jtaj it looks like your node(s) are not being detected properly by the chassis so it cant make changes to the SAS expander mapping. Boot both blades up, make sure they are ALL on the same firmware, refresh the CIMC and try again. You can run into odd behavior if you have one blade powered off and make changes to it "offline". I just booted them both up, hit F2 to go into BIOS and then went into CIMC to tinker. At that point all the devices will be initialized and CIMC can talk to the SIOCs/SAS expanders/RAID cards to do any changes needed.