Mellanox Switches - Tips & Tricks

Notice: Page may contain affiliate links for which we may earn a small commission through services like Amazon Affiliates or Skimlinks.

dbTH

Member
Apr 9, 2017
173
64
28
I have an SN2100. I have both a 100GBASE-PSM4 and a 100GBase-SR4 transceiver and would like to do a 4x 10G breakout. I've been able to configure the ports to do the breakout, but I can only get a link speed of 1G and not 10G. I've tried forcing the 10G speed, but the link just won't come up.

I tried moving the SMF and MMF QSFP28 transceivers to my Mikrotik CRS-504-4XQ-IN, and when I do that, they magically auto-negotiate to 10G. Because of that, I'm only using it essentially as a converter.

In fae mode, I ran the following:
Code:
mlxlink -d /dev/mst/mt52100_pci_cr0 -p 15 -m

Operational Info
----------------
State                           : Polling
Physical state                  : ETH_AN_FSM_ABILITY_DETECT
Speed                           : N/A
Width                           : N/A
FEC                             : N/A
Loopback Mode                   : No Loopback
Auto Negotiation                : ON

Supported Info
--------------
Enabled Link Speed              : 0x68b5f141 (100G,56G,50G,40G,25G,10G,1G)
Supported Cable Speed           : 0x2024a101 (100G,56G,50G,40G,25G,10G,1G)

Troubleshooting Info
--------------------
Status Opcode                   : 2
Group Opcode                    : PHY FW
Recommendation                  : Negotiation failure

Tool Information
----------------
Firmware Version                : 13.2010.4406
MFT Version                     : mft 4.22.1-11

Module Info
-----------
Identifier                      : QSFP28
Compliance                      : 100G PSM4 Parallel SMF
Cable Technology                : 1310 nm EML
Cable Type                      : Optical Module (separated)
OUI                             : Other
Vendor Name                     : FS
Vendor Part Number              : QSFP28-PIR4-100G
Vendor Serial Number            : S2202328654
Rev                             : 01
Wavelength [nm]                 : 1310
Transfer Distance [m]           : 0
Attenuation (5g,7g,12g) [dB]    : N/A
FW Version                      : N/A
Digital Diagnostic Monitoring   : Yes
Power Class                     : 4.5 W max
CDR RX                          : ON
CDR TX                          : ON
LOS Alarm                       : N/A
Temperature [C]                 : 36 [-10..80]
Voltage [mV]                    : 3286.4 [3000..3600]
Bias Current [mA]               : 35.358 [7..65]
Rx Power Current [dBm]          : -40 [-14..6]
Tx Power Current [dBm]          : 0 [-6..4]
I should probably try not using a breakout and just go straight from QSFP28 to SFP+, but it seems silly to just utilize entire QSFP28 ports for such low speeds. Does anyone have ideas on how to make 10G work when using the breakout?
If I read your post correctly, you were trying to do a 4x10 split with a 100GBASE-PSM4 or a 100GBASE-SR4 transceiver? It won't work that way. You need a compatible breakout cable. And you can read this Mellanox document for the cables P/N and configuration details:
 

amlai

New Member
Aug 25, 2023
10
2
3
Based on this document, it seems like it should work:

Maybe I should try to get a 40GbE optical transceiver and try the breakout? It’s odd as in the CLI and in Onyx it lists 10GbE (25, 10, and 1) as an available speed for each breakout.
 

NablaSquaredG

Bringing 100G switches to homelabs
Aug 17, 2020
1,835
1,212
113
you were trying to do a 4x10 split with a 100GBASE-PSM4 or a 100GBASE-SR4 transceiver? It won't work that way.
The transceivers themselves support breakout. I'm just not sure whether they support 10G link speed, it is possible that those transceivers can only do 25G link speed per lane.

Maybe I should try to get a 40GbE optical transceiver and try the breakout?
Yeah, maybe that's the way to go. 40G-SR4 or 40G-PLR4 / 40G-PLRL4 are quite affordable.
 

dbTH

Member
Apr 9, 2017
173
64
28
Based on this document, it seems like it should work:

Maybe I should try to get a 40GbE optical transceiver and try the breakout? It’s odd as in the CLI and in Onyx it lists 10GbE (25, 10, and 1) as an available speed for each breakout.
For upto 100G Meallnox switch, only passive copper breakout cables for 2x50G, 4x25G or 4x10G have been listed as compatible by Nvidia/Mellanox, though there are other non-compatible AOC breakout cables available.
 

NablaSquaredG

Bringing 100G switches to homelabs
Aug 17, 2020
1,835
1,212
113
For upto 100G Meallnox switch, only passive copper breakout cables for 2x50G, 4x25G or 4x10G have been listed as compatible by Nvidia/Mellanox, though there are other non-compatible AOC breakout cables available.
Doesn't matter, I've used breakout transceiver on my own Mellanox switches (both SwitchX and Spectrum 1) a lot and I can assure that it works.
 

dbTH

Member
Apr 9, 2017
173
64
28
The transceivers themselves support breakout. I'm just not sure whether they support 10G link speed, it is possible that those transceivers can only do 25G link speed per lane.


Yeah, maybe that's the way to go. 40G-SR4 or 40G-PLR4 / 40G-PLRL4 are quite affordable.
It would be quite interesting if it will actually work by adding a 40G transceiver to 100G switch port and then split.
 

NablaSquaredG

Bringing 100G switches to homelabs
Aug 17, 2020
1,835
1,212
113
It would be quite interesting if it will actually work by adding a 40G transceiver to 100G switch port and then split.
Of course it will work... I am running this exact same setup with both 40G and 100G transceivers
 

dbTH

Member
Apr 9, 2017
173
64
28
Doesn't matter, I've used breakout transceiver on my own Mellanox switches (both SwitchX and Spectrum 1) a lot and I can assure that it works.
I guess it dpends what cables and the attached transceivers you were using. Currently, I have a working 2x50G split setup in my SN2010 switch setup, but with a compatible passive copper cable. Had tried the non-compatiable AOC cable split (with pre-attached transceivers) before, and it didn't work. It complained about the incompatible transceiver (pre-attached) voltage with the switch
 
Last edited:

tsteine

Active Member
May 15, 2019
182
92
28
Had tried the non-compatiable AOC cable split (with pre-attached transceivers) before, and it didn't work. It complained about the incompatible transceiver (pre-attached) voltage with the switch
I am running 100G -> 4x 25G with an AOC on SN2700. QSFP-100G-4SAO15
The only thing worth mentioning about this setup is that I had to set FEC to Reed Solomon on the other end as the other switch defaulted to Fire Code, while the spectrum switch uses RS by default.
 

amlai

New Member
Aug 25, 2023
10
2
3
It would be quite interesting if it will actually work by adding a 40G transceiver to 100G switch port and then split.
My 40G-PLR4 adapter finally showed up and I was able to get it to link up at 10G. I bought the cheapest transceiver I could find online (Approved Networks JNP-QSFP-4X10GE-LR-A 40GBASE-PLR4 SMF 1310nm 10km Transceiver), and it worked. The only downside is that the instant I plugged it in, the fans in the switch spun up a lot higher - maybe 100%? Looking at the logs, it seems like the temperature readings weren't coming through from the transceiver.

I forced the fans to a lower speed by running:
fae mlxi2c set_fan /MGMT/FAN1 1 30
fae mlxi2c set_fan /MGMT/FAN2 1 30
fae mlxi2c set_fan /MGMT/FAN3 1 30
fae mlxi2c set_fan /MGMT/FAN4 1 30
 
  • Like
Reactions: dbTH

dbTH

Member
Apr 9, 2017
173
64
28
My 40G-PLR4 adapter finally showed up and I was able to get it to link up at 10G. I bought the cheapest transceiver I could find online (Approved Networks JNP-QSFP-4X10GE-LR-A 40GBASE-PLR4 SMF 1310nm 10km Transceiver), and it worked. The only downside is that the instant I plugged it in, the fans in the switch spun up a lot higher - maybe 100%? Looking at the logs, it seems like the temperature readings weren't coming through from the transceiver.

I forced the fans to a lower speed by running:
fae mlxi2c set_fan /MGMT/FAN1 1 30
fae mlxi2c set_fan /MGMT/FAN2 1 30
fae mlxi2c set_fan /MGMT/FAN3 1 30
fae mlxi2c set_fan /MGMT/FAN4 1 30
Some brands of optical transceivers generate more heat than other even idle. In addition, the type of transceiver modules also matters. LR SMF 1310nm (10km) transceiver requires more power than SR MMF 850nm. In my opinion, it is a wasteful of power resource to use LR SMF 1310nm 10km Transceiver in a homelab setup or shorter distance. So, I prefer to use SR MMF 850nm in my setup.
 

BoGs

Active Member
Feb 18, 2019
165
39
28
Some brands of optical transceivers generate more heat than other even idle. In addition, the type of transceiver modules also matters. LR SMF 1310nm (10km) transceiver requires more power than SR MMF 850nm. In my opinion, it is a wasteful of power resource to use LR SMF 1310nm 10km Transceiver in a homelab setup or shorter distance. So, I prefer to use SR MMF 850nm in my setup.
I disagree with your statement, sure its not efficient but I would not say wasteful. There is 5? MMF versions and one SMF - you can standardize on one and use the same optics, cables and all working all the time. There is no need to worry. Yes I will not have to worry about shooting light 1km+ down the road but I do have light shooting 300m which gets into the maybe MMF maybe not.
 

dbTH

Member
Apr 9, 2017
173
64
28
I disagree with your statement, sure its not efficient but I would not say wasteful. There is 5? MMF versions and one SMF - you can standardize on one and use the same optics, cables and all working all the time. There is no need to worry. Yes I will not have to worry about shooting light 1km+ down the road but I do have light shooting 300m which gets into the maybe MMF maybe not.
My opinion about wasteful of power resource was from homelab setup perspective and not from DC deployment. I would prefer to have gears which are energy efficient. And I'm wondering how many of us actually need to have optical cable run more than 100m in a homlab setup?
 

amlai

New Member
Aug 25, 2023
10
2
3
I started with MMF. However, after I decided I wanted to run more fiber, I went with SMF because I could get 100G optics for $5 a piece. In addition, when I ran fiber from the basement to the attic, I could only get 1 MTP/MPO terminated cable and a cat 6A cable through the hole I drilled down from the attic. (And drilling that single run was a royal pain.) If I went with MMF, that would mean that I could only get 1 fiber trunk through. With SMF, I could run 1 24 fiber trunk assembly, go into a break out cassette and have more capacity than I could ever know what to do with. Given all of that, it just made sense to standardize on SMF and only use MMF for specific scenarios or when I already had the optics.
 

BoGs

Active Member
Feb 18, 2019
165
39
28
I started with MMF. However, after I decided I wanted to run more fiber, I went with SMF because I could get 100G optics for $5 a piece. In addition, when I ran fiber from the basement to the attic, I could only get 1 MTP/MPO terminated cable and a cat 6A cable through the hole I drilled down from the attic. (And drilling that single run was a royal pain.) If I went with MMF, that would mean that I could only get 1 fiber trunk through. With SMF, I could run 1 24 fiber trunk assembly, go into a break out cassette and have more capacity than I could ever know what to do with. Given all of that, it just made sense to standardize on SMF and only use MMF for specific scenarios or when I already had the optics.
You can get MTP cables for MMF just like SMF, but the costs of the MMF cables start to grow astronomically after 3-5M. I needed to run 300M to my shed when I was building it back 4 years ago, SMF 12 MTP was ~1k armored and such, MMF would of been 3.5k for the exact same cable. Optics are cheaper, but the cables get expensive fast. So I moved to SMF everywhere and get optics used.