1) Did you follow the setup config licensing guide v2 in the first post of this thread?Is anyone able to help me get to the bottom of the issues I'm having with [probably] the switch? I need to know if I should be asking for a refund for the switch, or other parts, or if there's really nothing wrong anywhere and I'm just an idiot.
k.1. Yes
2. Yes
3. No. I was asked to do this, and I do not know how. I asked how to do this, and got no reply. I can SSH into the switch. What do I need to do to supply that information?
Regarding Intel being picky about DACs. I did the xl710_unlocker thing (which is supposed to be for optics, right?). Regardless, I think people keep glossing over the fact that only 1 of the 40 gig ports show any sign of life, and the one that does is riddled with nothing but errors. I don't mean to be rude about that last part, but please tell me what I specifically need to do on the switch to provide the required information.
show run
Current configuration:
!
ver 08.0.30uT7f3
!
stack unit 1
module 1 icx6610-24p-poe-port-management-module
module 2 icx6610-qsfp-10-port-160g-module
module 3 icx6610-8-port-10g-dual-mode-module
stack disable
!
!
!
!
vlan 1 name DEFAULT-VLAN by port
router-interface ve 1
!
!
!
!
!
aaa authentication web-server default local
aaa authentication login default local
hostname switcheroo
ip dhcp-client disable
!
no telnet server
username root password .....
!
!
!
!
!
!
!
!
!
interface ethernet 1/1/3
inline power
!
interface ethernet 1/1/6
inline power
!
interface ethernet 1/1/7
inline power
!
interface ethernet 1/3/1
speed-duplex 10G-full
!
interface ethernet 1/3/2
speed-duplex 10G-full
!
interface ethernet 1/3/3
speed-duplex 10G-full
!
interface ethernet 1/3/4
speed-duplex 10G-full
!
interface ethernet 1/3/5
speed-duplex 10G-full
!
interface ethernet 1/3/6
speed-duplex 10G-full
!
interface ethernet 1/3/7
speed-duplex 10G-full
!
interface ethernet 1/3/8
speed-duplex 10G-full
!
interface ve 1
ip address 192.168.1.5 255.255.255.0
!
!
!
!
!
!
!
!
!
end
with the QSFP DAC plugged into each of the left side connections please provide the switch's show interface output. It may also be helpful to show status output (from your OS) as well for each port but that is up to you.Excellent, thank you. Here it is:
with the QSFP DAC plugged into each of the left side connections please provide the switch's show interface output. It may also be helpful to show status output (from your OS) as well for each port but that is up to you.
recommend using the same cable for each interface test.
SSH@switcheroo>show interface ethernet 1/2/1
40GigabitEthernet1/2/1 is up, line protocol is up
Port up for 1 hour(s) 17 minute(s) 51 second(s)
Hardware is 40GigabitEthernet, address is 748e.f8fe.93f6 (bia 748e.f8fe.940f)
Interface type is 40Gig Fiber
Configured speed 40Gbit, actual 40Gbit, configured duplex fdx, actual fdx
Configured mdi mode AUTO, actual none
Member of L2 VLAN ID 1, port is untagged, port state is FORWARDING
BPDU guard is Disabled, ROOT protect is Disabled, Designated protect is Disabled
Link Error Dampening is Disabled
STP configured to ON, priority is level0, mac-learning is enabled
Openflow is Disabled, Openflow Hybrid mode is Disabled, Flow Control is enabled
Mirror disabled, Monitor disabled
Mac-notification is disabled
Not member of any active trunks
Not member of any configured trunks
No port name
MTU 1500 bytes, encapsulation ethernet
300 second input rate: 224 bits/sec, 0 packets/sec, 0.00% utilization
300 second output rate: 1976 bits/sec, 2 packets/sec, 0.00% utilization
7685 packets input, 826324 bytes, 0 no buffer
Received 272 broadcasts, 7404 multicasts, 9 unicasts
33 input errors, 0 CRC, 0 frame, 0 ignored
0 runts, 30 giants
502846 packets output, 56757049 bytes, 0 underruns
Transmitted 181923 broadcasts, 319738 multicasts, 1185 unicasts
0 output errors, 0 collisions
Relay Agent Information option: Disabled
Egress queues:
Queue counters Queued packets Dropped Packets
0 502861 0
1 0 0
2 0 0
3 0 0
4 0 0
5 2 0
6 0 0
7 0 0
tim@fileserver:~$ ethtool enp98s0
Settings for enp98s0:
Supported ports: [ FIBRE ]
Supported link modes: 40000baseCR4/Full
Supported pause frame use: Symmetric Receive-only
Supports auto-negotiation: Yes
Supported FEC modes: Not reported
Advertised link modes: 40000baseCR4/Full
Advertised pause frame use: No
Advertised auto-negotiation: Yes
Advertised FEC modes: Not reported
Speed: 40000Mb/s
Duplex: Full
Port: Direct Attach Copper
PHYAD: 0
Transceiver: internal
Auto-negotiation: off
Cannot get wake-on-lan settings: Operation not permitted
Current message level: 0x00000007 (7)
drv probe link
Link detected: yes
SSH@switcheroo>show interface ethernet 1/2/6
40GigabitEthernet1/2/6 is up, line protocol is up
Port up for 32 second(s)
Hardware is 40GigabitEthernet, address is 748e.f8fe.93f6 (bia 748e.f8fe.9414)
Interface type is 40Gig Fiber
Configured speed 40Gbit, actual 40Gbit, configured duplex fdx, actual fdx
Configured mdi mode AUTO, actual none
Member of L2 VLAN ID 1, port is untagged, port state is FORWARDING
BPDU guard is Disabled, ROOT protect is Disabled, Designated protect is Disabled
Link Error Dampening is Disabled
STP configured to ON, priority is level0, mac-learning is enabled
Openflow is Disabled, Openflow Hybrid mode is Disabled, Flow Control is enabled
Mirror disabled, Monitor disabled
Mac-notification is disabled
Not member of any active trunks
Not member of any configured trunks
No port name
MTU 1500 bytes, encapsulation ethernet
300 second input rate: 216 bits/sec, 0 packets/sec, 0.00% utilization
300 second output rate: 1416 bits/sec, 1 packets/sec, 0.00% utilization
10 packets input, 964 bytes, 0 no buffer
Received 0 broadcasts, 10 multicasts, 0 unicasts
0 input errors, 0 CRC, 0 frame, 0 ignored
0 runts, 0 giants
64 packets output, 6290 bytes, 0 underruns
Transmitted 20 broadcasts, 44 multicasts, 0 unicasts
0 output errors, 0 collisions
Relay Agent Information option: Disabled
Egress queues:
Queue counters Queued packets Dropped Packets
0 68 0
1 0 0
2 0 0
3 0 0
4 0 0
5 0 0
6 0 0
7 0 0
tim@fileserver:~$ ethtool enp98s0
Settings for enp98s0:
Supported ports: [ FIBRE ]
Supported link modes: 40000baseCR4/Full
Supported pause frame use: Symmetric Receive-only
Supports auto-negotiation: Yes
Supported FEC modes: Not reported
Advertised link modes: 40000baseCR4/Full
Advertised pause frame use: No
Advertised auto-negotiation: Yes
Advertised FEC modes: Not reported
Speed: 40000Mb/s
Duplex: Full
Port: Direct Attach Copper
PHYAD: 0
Transceiver: internal
Auto-negotiation: off
Cannot get wake-on-lan settings: Operation not permitted
Current message level: 0x00000007 (7)
drv probe link
Link detected: yes
...I don't really know what's going on and I don't know how to articulate it.
STOP again! Please don't wipe your 14 year maintained system. If it has been maintained that long I'd either consider it "prod" or of significant sentimental value. Put it back on your network the way it was and leave it alone until you've finished testing the switch.What was in the machine before was a 10G SFP+ NIC. It was enp98s0. So I put this thing in (XL710) and it takes the same name. I've since put in a different 10G NIC and it took the name of enp129s0. All is fine and works there. Got its MAC address in my router for a static lease (192.168.1.2 is the IP of the machine I desire). When I mess around with this XL710 on enp98s0 and I go to kill it with ifdown, it says "DHCPRELEASE of 192.168.1.2 on enp98s0 to 192.168.1.1 port 67 (xid=0x7d5d72e3)" This is after assigning it a static IP in /etc/network/interfaces. I really don't get what's going
on, and I am about to just wipe the OS install (which I've had maintained for like 14 years now, and would be a HUGE pain in the butt to start over).
management-vrf MGMT
which was accepted fine, and when I check with 'show management-vrf' it does indeed appear to have worked.management-[B]vrf
?SSH connections (inbound):
1 established, client ip address 192.168.1.50, server hostkey RSA, user is USER, privilege super-user, in config mode
using vrf default-vrf.
you are connecting to this session
10 second(s) in idle
sh ip int
Interface IP-Address OK? Method Status Protocol VRF
Eth mgmt1 192.168.1.171 YES manual up up default-vrf
Ve 10 192.168.1.173 YES manual up up MGMT
VRF MGMT, default RD 4:10, Table ID 1
Configured as management-vrf
IP Router-Id: 10.10.10.10
Interfaces:
ve10
Address Family IPv4
Max Routes: 200
Number of Unicast Routes: 1
When I tell it to use a static IP is when I can actually form a connection with it, but it's then doing all of those RX errors. And using a static IP is also when it says that thing with the old IP address and DHCP when I go to ifdown the connection. I am going to force it to use the old "eth" names and do some more poking. Later I will stick the card in another server I just got and installed 22.04 onto. Honestly, the 14 year old install has been a pain when I've gone and upgraded it. It's pretty janky at this point, which is probably why I'm having so many issues. I will report back some stuff later. Thank you....
STOP! Take a deep breath.
I reviewed your previous posts. I'd like to get some clarity first. I'm going to give you some suggestions in a bit so bear with me.
Your interface output looks "fine" even the small number of giants (hoping they're just 1522 and not something else - which could be coming form something else on your network). I'm going with the test on the 1/2/6 was simply brief as the uptime on it is short. - please confirm
based on the show int I'd say both 40gbe ports are fine.
for each interface test did you have good link / status "blinky lights" on your NIC?
You previously reported you had no electrical activity on the bottom port - is that still the case?
STOP again! Please don't wipe your 14 year maintained system. If it has been maintained that long I'd either consider it "prod" or of significant sentimental value. Put it back on your network the way it was and leave it alone until you've finished testing the switch.
You are experimenting and troubleshooting - use something else for this process.
Beg, borrow, steal another system. Throw an OS on it (clean install) and let's troubleshoot the rest of the ports on your switch. yeah?
seriously. Don't EFF around with the system you've been using. Use something else to finish testing out the switch.
Next, I saw a comment that made me think you want to rely on DHCP for testing. Bad idea right now.
80% of most network issues are physical layer.
If you rely on DHCP you may question whether DHCP is behaving or you have something else going on. Remove DHCP from the equation. Use static IP's for the remainder of the testing and on a CLEAN system.
that way you can focus on do I have a bad port, do i have a bad cable.
simplify your troubleshooting by simplifying the number of variables in any given test.
this. Makes me think you may have a conflict on your network with your static or some other thing unrelated to the switch (L2).When I tell it to use a static IP is when I can actually form a connection with it, but it's then doing all of those RX errors. And using a static IP is also when it says that thing with the old IP address and DHCP when I go to ifdown the connection. I am going to force it to use the old "eth" names and do some more poking.
yep.Later I will stick the card in another server I just got and installed 22.04 onto. Honestly, the 14 year old install has been a pain when I've gone and upgraded it. It's pretty janky at this point, which is probably why I'm having so many issues. I will report back some stuff later. Thank you.
Changed the naming to eth*. No change. I don't understand why, when I use a static IP address (under /etc/network/interfaces) and I go to ifdown it, it mentions releasing the old IP address of 192.168.1.2. The static IP I'm assigning it is 192.168.1.17. When I do an ifconfig, I get something like the following:this. Makes me think you may have a conflict on your network with your static or some other thing unrelated to the switch (L2).
yep.
please do report back as to whether the blinkies were working correctly on your NIC when connected to BOTH 40Gbe interfaces on the switch.
eth0: flags=4163<UP,BROADCAST,RUNNING,MULTICAST> mtu 1500
inet 192.168.1.17 netmask 255.255.255.0 broadcast 192.168.1.255
inet6 fe80::3efd:feff:feb2:cd30 prefixlen 64 scopeid 0x20<link>
ether 3c:fd:fe:b2:cd:30 txqueuelen 1000 (Ethernet)
RX packets 3392 bytes 504447 (504.4 KB)
RX errors 3819 dropped 0 overruns 0 frame 3819
TX packets 5184 bytes 637121 (637.1 KB)
TX errors 0 dropped 0 overruns 0 carrier 0 collisions 0
If it's set to use DHCP it just hangs on the system looking for a DHCP lease forever and ever. On static it can send and receive pings, but it has no real connectivity. Pings only work within the LAN. It can't get any DNS info for some reason.@ramicio
dumb question. setting it back to DHCP or with the server assigned to .17 turned off - you can't ping .17 right? just checking. Same thing happen if you pick some other address for a static. one you've tested for no replies before assigning it?
you got some other funk going on in your network.
If it were me I'd at least finish testing the breakout ports (assuming you have a DAC or AOC 40gbe to 10gbe breakout... the breakouts are probably fine but would be ncie to know all the ports are good before your return window closes...
dont' feel like a liar. Having someone else help you - often is nothing more than a different set of eyes giving you a (slightly even) different process which forces you out of behaviors which may have contributed to you false negative testing of the ports. its why we help each other.
besides you re-learned how to pull a config and look at port stats on the switch. sounds like a winner afternoon of network geekdom to me.
the fact that you can ping the local lan with a static IP leads me to believe that dhcp packets aren't making it. ping with default is a very small packet. Have you looked at the interface statistics to see if there are errors (incrementing) on the connected switch port?If it's set to use DHCP it just hangs on the system looking for a DHCP lease forever and ever. On static it can send and receive pings, but it has no real connectivity. Pings only work within the LAN. It can't get any DNS info for some reason.
I really don't think it has anything to do with anything on my network. Why is it these 2 ports with this specific NIC, and I have zero problem with my 10 gig stuff?
I don't have any kind of breakout cable and I don't plan on buying one for any reason.
I need to know if it's the switch, the DAC, or the NIC. How can I determine this? I'm honestly about to just junk it all, because I'm willing to bet it's all a problem with the XL710 and it's probably never going to work, and I refuse to buy a Mellanox or any other brand.
I'm a hobbyist, so I don't have extra stuff, and won't have extra stuff.the fact that you can ping the local lan with a static IP leads me to believe that dhcp packets aren't making it. ping with default is a very small packet. Have you looked at the interface statistics to see if there are errors (incrementing) on the connected switch port?
Have you tried traceroute to a known ip - say 8.8.8.8 - again very small packets, would not involve dns if you disable ip to name resolution (traceroute option) it will even go relatively quickly. If it goes slowly then I'm inclined to believe packet corruption is occurring.
typically when I see a problem like this I suspect the cable first, then the nic, then the switch port. Just my experience is all that leads me to things in that order. not saying its right or wrong to look at it other ways. If you have another DAC, AOC, set of optics with patch, you may want to try that. If that doesn't work and you have another nic then try that. You may have to build a matrix of tests and results to ultimately weed out what is wrong. could be the switch too, could be the port on the switch, the nic, the DAC all of the above in combination. Key thing here is change 1 thing at a time and try again. record the result.
regarding your last post - it is very clear you are quite frustrated - and I say this with all due respect (and not knowing what other efforts and time have been spent, nor your skill, knowledge level or experience history)
This may not be the best solution set for you. Only you can decide if the return on your investment - time and money - is worth the return that you will get from making this work. for some people its the end result, for some folks its the end result as well as what they learned getting there. I don't know the answer for you. only you do.