EDIT 1 : post #3, ESXI is not the culprit here. Launching anything bare metal do exactly the same.
Hi,
I'm finally updating my homelab network to 10GB, and I have some issues with my ConnectX-2 cards.
The server is an R710, with ESXI 6.5 u1. I want to passtrough the card to a Windows Server 2016 VM, or a Debian.
They are the HP version of the ConnectX-2 MNPA19-XTR. The firmware was old (2.9.1000), and needed an upgrade before going further. The process is common, well documented, and I flashed the generic firmware 2.10.720. In the process, I backuped the original firmware and config, just in case.
After that, the Windows VM crash the entire host when booting. In fact, even a new VM with the install ISO do the same before showing anything. With the Debian VM, it's fine... but the Mellanox driver say something is wrong (something about bad IRQ mapping, the last message is "BIOS or ACPI interrupt routing problem?"), and there is no adapter available.
So I tried to flash the 2.9.8350 firmware. Now, I can't even boot a linux VM with the card, and the host just hang, without even rebooting.
Some details :
- Tested with two cards, exactly the same result on both.
- The VM had all the CPUs, reserved memory and was the only one launched.
- Latest driver was installed on Windows, without improvement.
- With 2.9.1000 and 2.9.1200 firmware, Windows boot correctly, without crashing the host. Sadly, I didn't verified the result with mlx4_core on Linux.
- I will try to reflash the original firmware, with the recovery jumper.
- I did not replace the HP PXE rom boot with the default, because it's more recent. But the PXE boot work well, and I have seen examples of people keeping the HP one anyway.
- There is no difference between the HP ini file and the default ones.
My questions is simple : What is going on ? It's a common OEM card, a common procedure, an very common host, vanilla OSes. There is even examples here of people with the same configuration.
Thank you !
Hi,
I'm finally updating my homelab network to 10GB, and I have some issues with my ConnectX-2 cards.
The server is an R710, with ESXI 6.5 u1. I want to passtrough the card to a Windows Server 2016 VM, or a Debian.
They are the HP version of the ConnectX-2 MNPA19-XTR. The firmware was old (2.9.1000), and needed an upgrade before going further. The process is common, well documented, and I flashed the generic firmware 2.10.720. In the process, I backuped the original firmware and config, just in case.
After that, the Windows VM crash the entire host when booting. In fact, even a new VM with the install ISO do the same before showing anything. With the Debian VM, it's fine... but the Mellanox driver say something is wrong (something about bad IRQ mapping, the last message is "BIOS or ACPI interrupt routing problem?"), and there is no adapter available.
So I tried to flash the 2.9.8350 firmware. Now, I can't even boot a linux VM with the card, and the host just hang, without even rebooting.
Some details :
- Tested with two cards, exactly the same result on both.
- The VM had all the CPUs, reserved memory and was the only one launched.
- Latest driver was installed on Windows, without improvement.
- With 2.9.1000 and 2.9.1200 firmware, Windows boot correctly, without crashing the host. Sadly, I didn't verified the result with mlx4_core on Linux.
- I will try to reflash the original firmware, with the recovery jumper.
- I did not replace the HP PXE rom boot with the default, because it's more recent. But the PXE boot work well, and I have seen examples of people keeping the HP one anyway.
- There is no difference between the HP ini file and the default ones.
My questions is simple : What is going on ? It's a common OEM card, a common procedure, an very common host, vanilla OSes. There is even examples here of people with the same configuration.
Thank you !
Last edited: