Arista 7050QX - hard reboot and packet-storm

Notice: Page may contain affiliate links for which we may earn a small commission through services like Amazon Affiliates or Skimlinks.

Stril

Member
Sep 26, 2017
191
12
18
41
Hi!

I really need help...

In the last 48h one of my Arista 7050QX-32 rebootet two times. In the same time, there was a packet-storm and some recalculation of RSTP. The Arista is part of a MLAG with another unit, and linked to another Arista 7050QX.

There are NO other devices connected to the unit!

The logs are not very meaningful:

c40-0002-1#show reload cause
Reload Cause 1:
-------------------
The system rebooted due to a watchdog

Reload Time:
------------
Reload occurred at Thu Sep 20 08:25:44 2018 CEST.

Recommended Action:
-------------------
This may indicate a software or hardware problem.
Contact your customer support representative.

Debugging Information:
----------------------
None available.​


The "next" Arista shows:

Sep 20 08:22:12 c40-01312-1 Stp: %SPANTREE-6-STABLE_CHANGE: Stp state is now not stable
Sep 20 08:22:12 c40-01312-1 PortSec: %ETH-4-HOST_FLAPPING: Host 00:0e:1e:5f:8a:c0 in VLAN 310 is flapping between interface Ethernet30 and interface Port-Channel1
Sep 20 08:22:18 c40-01312-1 PortSec: %ETH-4-HOST_FLAPPING: Host 44:37:e6:c0:2b:d1 in VLAN 1 is flapping between interface Ethernet30 and interface Port-Channel1 (message repeated 2004 times in 5.76267 secs)
Sep 20 08:22:23 c40-01312-1 PortSec: %ETH-4-HOST_FLAPPING: Host 90:e2:ba:61:14:b4 in VLAN 310 is flapping between interface Ethernet30 and interface Port-Channel1 (message repeated 514 times in 5.03287 secs)
Sep 20 08:22:28 c40-01312-1 PortSec: %ETH-4-HOST_FLAPPING: Host ce:a7:a9:e5:a6:58 in VLAN 1 is flapping between interface Ethernet30 and interface Port-Channel1 (message repeated 434 times in 5.02307 secs)
Sep 20 08:22:36 c40-01312-1 Ebra: %LINEPROTO-5-UPDOWN: Line protocol on Interface Ethernet30, changed state to down
Sep 20 08:22:36 c40-01312-1 Stp: %SPANTREE-6-INTERFACE_DEL: Interface Ethernet30 has been removed from instance MST0
Sep 20 08:22:36 c40-01312-1 PortSec: %ETH-4-HOST_FLAPPING: Host 44:37:e6:e3:9e:d7 in VLAN 1 is flapping between interface Port-Channel12 and interface Port-Channel1 (message repeated 1271 times in 7.80836 secs)
Sep 20 08:23:06 c40-01312-1 Stp: %SPANTREE-6-STABLE_CHANGE: Stp state is now stable​


--> Ethernet30 is connected to the "rebootet" Arista
--> PortChannel1 is the MLAG-Link of this switch.


Did you ever see something like that?
What do you think - is it a hardware-problem?

Thank you for your hints!
 

zedascuras

New Member
Feb 15, 2015
12
1
3
39
Accordingly to what I was able to search on google, seems like hardware related:

"The reload cause shown in the command ‘show reload cause’ indicates that the device reloaded due to a watchdog. This means that the watchdog feature on the switch detected that software was unresponsive for over 45 seconds and to correct this state the switch was reloaded.

I would suggest getting in touch with our support (support@arista.com) providing the show tech output collected from the affected device so we can further investigate the reason for the reboot. "

From here:
Arista EOS Central - watchdog reason
 

Stril

Member
Sep 26, 2017
191
12
18
41
what version of EOS? I know older ones had issues related to storms and the management cpu getting held up
Hi!

I am using eos 4.18.5M
Is there any newer version out there for the QX?

I can't contact Arista support, as the units cone from eBay...

Thank you
Stril