PCIe bridge downgraded but the device link status is normal and lower measured bandwidth

Notice: Page may contain affiliate links for which we may earn a small commission through services like Amazon Affiliates or Skimlinks.

tientuvo

New Member
Apr 27, 2024
1
0
1
Hello,

I'm using the Gigabyte server G293-Z42 with bifurcation setup.
After inserting the GPUs, I checked the PCIe link status at the device level. It was GEN4 with x4 lane (it's normal with the bifurcation setup)

Code:
25:00.0 Processing accelerators: Device 0302 (rev 01)
        Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+
        Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
        Latency: 0, Cache Line Size: 64 bytes
        Interrupt: pin A routed to IRQ 56
        NUMA node: 0
        Region 0: Memory at c5380000 (32-bit, non-prefetchable) [size=256K]
        Region 1: Memory at c5340000 (32-bit, non-prefetchable) [size=256K]
        Region 2: Memory at 6dc70000000 (64-bit, prefetchable) [size=64M]
        Region 4: Memory at 6dc60000000 (64-bit, prefetchable) [size=256M]
        Capabilities: [40] Express (v2) Endpoint, MSI 00
                DevCap: MaxPayload 512 bytes, PhantFunc 0, Latency L0s <64ns, L1 <1us
                        ExtTag+ AttnBtn- AttnInd- PwrInd- RBE+ FLReset+ SlotPowerLimit 25.000W
                DevCtl: CorrErr+ NonFatalErr+ FatalErr+ UnsupReq-
                        RlxdOrd+ ExtTag+ PhantFunc- AuxPwr- NoSnoop+ FLReset-
                        MaxPayload 512 bytes, MaxReadReq 512 bytes
                DevSta: CorrErr+ NonFatalErr- FatalErr- UnsupReq+ AuxPwr- TransPend-
                LnkCap: Port #0, Speed 16GT/s, Width x4, ASPM L0s L1, Exit Latency L0s unlimited, L1 unlimited
                        ClockPM- Surprise- LLActRep- BwNot- ASPMOptComp+
                LnkCtl: ASPM Disabled; RCB 64 bytes, Disabled- CommClk-
                        ExtSynch- ClockPM- AutWidDis- BWInt- AutBWInt-
                LnkSta: Speed 16GT/s (ok), Width x4 (ok)
Here is the lspci tree

Code:
 +-[0000:40]-+-00.0
 |           +-00.3
 |           +-01.0
 |           +-01.4-[41-4e]----00.0-[42-4e]--+-00.0-[43-48]----00.0-[44-48]--+-10.0-[45]--
 |           |                               |                               +-14.0-[46]--
 |           |                               |                               +-18.0-[47]--
 |           |                               |                               \-1c.0-[48]--
 |           |                               \-04.0-[49-4e]----00.0-[4a-4e]--+-00.0-[4b]----00.0
 |           |                                                               +-04.0-[4c]--
 |           |                                                               +-08.0-[4d]--
 |           |                                                               \-0c.0-[4e]--
 |           +-02.0
 |           +-03.0
 |           +-04.0
 |           +-05.0
 |           +-07.0
 |           \-07.1-[4f]--+-00.0
 |                        \-00.1
 +-[0000:20]-+-00.0
 |           +-00.3
 |           +-01.0
 |           +-01.4-[21-2e]----00.0-[22-2e]--+-00.0-[23-28]----00.0-[24-28]--+-10.0-[25]--+-00.0
 |           |                               |                               |            \-01.0
 |           |                               |                               +-14.0-[26]--+-00.0
 |           |                               |                               |            \-01.0
 |           |                               |                               +-18.0-[27]--+-00.0
 |           |                               |                               |            \-01.0
 |           |                               |                               \-1c.0-[28]--+-00.0
 |           |                               |                                            \-01.0
 |           |                               \-04.0-[29-2e]----00.0-[2a-2e]--+-00.0-[2b]--
 |           |                                                               +-04.0-[2c]--
 |           |                                                               +-08.0-[2d]--
 |           |                                                               \-0c.0-[2e]--

However, at the bridge level, I see that the PCIe link is downgraded. The last time it arrived on the same server model is because of the PCIe insertion.
This time I inserted the card very carefully but it still arrives. Is there any way to confirm that it comes from the BIOS driver or hardward setup ?

Is it normal that the PCI bridge support only GEN5 x4 but four devices connected to it support GEN4 x4 each ?
From my PCIe bandwidth measurement, I see a degradation in the bandwidth and the link status at the device should be GEN3 x4 instead of GEN4).

Code:
sudo lspci -vvs 21:00.0
21:00.0 PCI bridge: Broadcom / LSI Device c034 (rev a0) (prog-if 00 [Normal decode])
        Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
        Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
        Latency: 0, Cache Line Size: 64 bytes
        Interrupt: pin A routed to IRQ 56
        NUMA node: 0
        Region 0: Memory at c4000000 (32-bit, non-prefetchable) [size=16M]
        Bus: primary=21, secondary=22, subordinate=2e, sec-latency=0
        I/O behind bridge: [disabled]
        Memory behind bridge: c5000000-c53fffff [size=4M]
        Prefetchable memory behind bridge: 000006dc00000000-000006dc73ffffff [size=1856M]
        Secondary status: 66MHz- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- <SERR- <PERR-
        BridgeCtl: Parity- SERR+ NoISA- VGA- VGA16+ MAbort- >Reset- FastB2B-
                PriDiscTmr- SecDiscTmr- DiscTmrStat- DiscTmrSERREn-
        Capabilities: [40] Power Management version 3
                Flags: PMEClk- DSI- D1- D2- AuxCurrent=0mA PME(D0+,D1-,D2-,D3hot+,D3cold+)
                Status: D0 NoSoftRst+ PME-Enable- DSel=0 DScale=0 PME-
        Capabilities: [48] MSI: Enable- Count=1/8 Maskable+ 64bit+
                Address: 0000000000000000  Data: 0000
                Masking: 00000000  Pending: 00000000
        Capabilities: [68] Express (v2) Upstream Port, MSI 00
                DevCap: MaxPayload 2048 bytes, PhantFunc 0
                        ExtTag- AttnBtn- AttnInd- PwrInd- RBE+ SlotPowerLimit 75.000W
                DevCtl: CorrErr+ NonFatalErr+ FatalErr+ UnsupReq-
                        RlxdOrd+ ExtTag- PhantFunc- AuxPwr- NoSnoop+
                        MaxPayload 512 bytes, MaxReadReq 128 bytes
                DevSta: CorrErr+ NonFatalErr- FatalErr- UnsupReq+ AuxPwr- TransPend-
                LnkCap: Port #0, Speed 32GT/s, Width x16, ASPM L0s L1, Exit Latency L0s <4us, L1 <32us
                        ClockPM- Surprise- LLActRep- BwNot- ASPMOptComp+
                LnkCtl: ASPM Disabled; Disabled- CommClk-
                        ExtSynch- ClockPM- AutWidDis- BWInt- AutBWInt-
                LnkSta: Speed 32GT/s (ok), Width x4 (downgraded)
                        TrErr- Train- SlotClk- DLActive- BWMgmt- ABWMgmt-

$ sudo lspci -vvs 41:00.0
41:00.0 PCI bridge: Broadcom / LSI Device c034 (rev a0) (prog-if 00 [Normal decode])
        Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
        Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
        Latency: 0, Cache Line Size: 64 bytes
        Interrupt: pin A routed to IRQ 37
        NUMA node: 0
        Region 0: Memory at f0000000 (32-bit, non-prefetchable) [size=16M]
        Bus: primary=41, secondary=42, subordinate=4e, sec-latency=0
        I/O behind bridge: [disabled]
        Memory behind bridge: f1000000-f27fffff [size=24M]
        Prefetchable memory behind bridge: 000003c000000000-000003e041ffffff [size=132128M]
        Secondary status: 66MHz- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- <SERR- <PERR-
        BridgeCtl: Parity- SERR+ NoISA- VGA- VGA16+ MAbort- >Reset- FastB2B-
                PriDiscTmr- SecDiscTmr- DiscTmrStat- DiscTmrSERREn-
        Capabilities: [40] Power Management version 3
                Flags: PMEClk- DSI- D1- D2- AuxCurrent=0mA PME(D0+,D1-,D2-,D3hot+,D3cold+)
                Status: D0 NoSoftRst+ PME-Enable- DSel=0 DScale=0 PME-
        Capabilities: [48] MSI: Enable- Count=1/8 Maskable+ 64bit+
                Address: 0000000000000000  Data: 0000
                Masking: 00000000  Pending: 00000000
        Capabilities: [68] Express (v2) Upstream Port, MSI 00
                DevCap: MaxPayload 2048 bytes, PhantFunc 0
                        ExtTag- AttnBtn- AttnInd- PwrInd- RBE+ SlotPowerLimit 75.000W
                DevCtl: CorrErr+ NonFatalErr+ FatalErr+ UnsupReq-
                        RlxdOrd+ ExtTag- PhantFunc- AuxPwr- NoSnoop+
                        MaxPayload 256 bytes, MaxReadReq 128 bytes
                DevSta: CorrErr+ NonFatalErr- FatalErr- UnsupReq+ AuxPwr- TransPend-
                LnkCap: Port #0, Speed 32GT/s, Width x16, ASPM L0s L1, Exit Latency L0s <4us, L1 <32us
                        ClockPM- Surprise- LLActRep- BwNot- ASPMOptComp+
                LnkCtl: ASPM Disabled; Disabled- CommClk-
                        ExtSynch- ClockPM- AutWidDis- BWInt- AutBWInt-
                LnkSta: Speed 32GT/s (ok), Width x4 (downgraded)