I'm unsure if that would be helpful, but I'll describe my journey with SN2100 recovery.
I got my unit for cheap (cheap enough to risk) as it was sold as "not working, doesn't turn on" for a home lab. After poking with a multimeter around it, I saw that there was 220V on the input but nothing on the output of the PSU when the mains cable was plugged in. And that was on both PSUs. (I initially thought that only on one and another was there no input voltage, but later on, I realised that the mains cable there needed to be pushed way harder than I was doing.)
So I've replaced PSU with Meanwell RPS-300-12 (Mellanox comes with Delta mds300apb-12, but I ordered Meanwell, and probably that was a small mistake, but it was half the price of Delta and had a better reputation among everybody whom I've asked for advice). One problem with Meanwell as a replacement is that neither screw sizes nor screw positions match what Delta have (I plan to model a small replacement plate that can be used, I will share if I'll ever do it), as well as connectors are different. In my case, someone tried to fix the switch before, and the sensing rail (the one that reports PWR_OK and other useful signals) was broken anyway (the cable was damaged).
However, I screwed in the PSU (put it on some insulator), and the switch started to boot. I bought a random RJ45->DB9 cable and plugged in my Prolific serial adapter (I have had it for a very long time), and I saw a BIOS screen, Grub and then some old firmware (3.6.2102). BIOS's password was not `admin`, so no luck there. After boot, the input was not working at all; I have no idea why.
I've bought a new M.2 SATA SSD (I had only one available in the local store - which was 512GB Transcend 430, which should be good enough for OS) and with fresh SSD, it happily booted from Flash and allowed me to embed ONIE first and then from "Install ONIE" on disk to flash it (I've put 3.9.3202 recovery firmware on HTTP server that was running on my router and it happily downloaded it).
During that time, input over serial was not working very well, like something was broken (sometimes characters resulted in garbage being sent); I'm not sure if that is just a PL2303 problem or something else.
After recovery was reflashed, it happily booted and reported a relatively healthy state (I still have one PSU, so it thinks that the second one is faulty, but SN2100 is happy even when PWR_OK reporting is missing completely and the first PSU is healthy from its point of view) and seems to be working just fine.
Check your baud rate. If it's an older BIOS, you may need 9600, but newer BIOS wants 115200. I saw some weird characters when connecting at 9600 on my SN2700M, but that all cleared up when connected at 115200. Perhaps your issue is something else though.
1. MOST IMPORTANT STEP: REPLACE SSD - I recommend Transcend MSA452T-I 128GB
2. Download ONIE recovery image, flash to USB stick, insert USB stick into switch
3. Power Cycle switch, hammer CTRL+B to get into BIOS, if password required, try „admin“ (without quotes)
4. boot from USB (UEFI Mode!!), select „Embed ONIE“ once ONIE has booted
5. Download Onyx 3.10 installer from mega.nz link in first page (should work - if your switch was on ancient firmware before, use the 3.9 instead)
6. Copy the Onyx installer to a FAT32 formatted USB, rename installer file to onie-installer (no extension)
7. insert usb stick with onyx installer into switch, power cycle switch, on ONIE prompt select „Install OS“
8. let the installer finish. Switch should come up ready to use
I found a pair of HPE SN2700M [HPE branded SN2700] switches for a somewhat reasonable price ($1200 each). It's a lot of money for a home lab, but I wanted a step up from my SX6036 switches to play with 100GbE. I have some DAC's and NIC's that all run 56Gb Ethernet and I'm happy to report that the SN2700 supports this speed just like the SX6036 does. The Nvidia documentation was unclear to me, so hopefully this will help others.
After reading this thread, I also ordered two Transcend mSATA SSD's immediately [64GB TS64GMSA452T] for $20 each. They showed up before the switches did. I cannot find the industrial variant here in the US, but these seem to work just fine.
After changing my COM port speed to 115200 in Windows device manager and then connecting via serial in Putty, I saw each unit boot up. The first one was throwing the following errors in the console and rebooting over and over.
The other switch had Cumulous installed. I had planned to remove the factory SSD's anyway and the error was resolved after a new SSD & a fresh install.
Using the post above was key for me to understand the recovery process for an SN2700. It's very simple.
1) Install a new SSD into the switch
2) Boot into the BIOS with CTRL + B (PW: admin worked for me)
3) Disable CSM & reboot
4) Boot from USB using the recovery image (I used Rufus to install 'onie-recovery-x86_64-mlnx_x86-r0.iso' [in the 2020.11-5.3.0005-115200 folder]
5) Select Embed ONIE, which formats the new SSD and installs the bootloader. After this is complete, remove the USB & reboot. You should see 'Install ONIE'.
6) On your PC, delete all partitions on your USB drive created earlier by Rufus. Manually create a new partition and format with FAT32. Copy the file 'X86_64-3.9.3202-installer.bin' onto the FAT32 partition (don't use Rufus). Rename 'X86_64-3.9.3202-installer.bin' to 'onie-installer' without a file extension.
7) With the new installer file on your USB drive, insert the USB drive and reboot the switch. Select 'Install ONIE' and wait for completion.
8) After the install is complete, remove the USB drive while rebooting. Leaving it in will cause the switch to not boot and give you this error: 'disk `hd0,4' not found'. This was confusing to me. You no longer need this USB drive.
9) Make sure the switch boots up. Connect to the mgmt0 web interface from your PC and login. Browse to System --> Onyx Upgrade.
10) Rename 'onyx-X86_64-3.10.4404.zip' to 'onyx-X86_64-3.10.4404.img'. Select Install from local file and upgrade the switch to 3.10.4404 via the browser.
This worked well for me on two of these switches. As previously mentioned in this thread, you might not be able to jump to 3.10.4404 directly if you need a BIOS upgrade and/or ASIC FW upgrade. You might want to do it one version at a time. It's comforting to know that you can just start over though, if you make a mistake.
Check your baud rate. If it's an older BIOS, you may need 9600, but newer BIOS wants 115200. I saw some weird characters when connecting at 9600 on my SN2700M, but that all cleared up when connected at 115200. Perhaps your issue is something else though.
Apparently, due to bad packaging and excessive shock / vibration, one of the connectors of the SE Harness (flat cable connecting MNG and switch board, carrying I²C signals and other stuff) came loose.
I didn't see it, because it was still halfway plugged in (the connector was tilted, so some pins had connection and some not)
Reseated the connector and everything works perfectly fine now.
After few tries, we made a clone off the old mSATA SSD with dd under linux to a new one, checked filesystem with gparted and put the new one back in Switch. Reboot and 3x Upgrades later it was still working fine. Now its running for over a week without Problems.
Apparently, due to bad packaging and excessive shock / vibration, one of the connectors of the SE Harness (flat cable connecting MNG and switch board, carrying I²C signals and other stuff) came loose.
I didn't see it, because it was still halfway plugged in (the connector was tilted, so some pins had connection and some not)
Reseated the connector and everything works perfectly fine now.
This site uses cookies to help personalise content, tailor your experience and to keep you logged in if you register.
By continuing to use this site, you are consenting to our use of cookies.