Switching

 View Only
last person joined: 23 hours ago 

Ask questions and share experiences about EX and QFX portfolios and all switching solutions across your data center, campus, and branch locations.
Expand all | Collapse all

Two switches showing disconnected multiple times per day but no errors on port

  • 1.  Two switches showing disconnected multiple times per day but no errors on port

    Posted 05-14-2025 16:11

    I have two EX2300 switches doing the same thing at two different school sites. They lose connection to the Miast dashboard, but I can connect via ssh with MobaXterm. I checked optic power on one with a fiber cable and it's fine. I checked for any errors on the port and there were none. Not even drops. The other switch has a DAC cable to the primary switch in the rack and also does not show any errors on the port.  I do see topo changes generated on the uplink port on both switches. I tested to see if they were both connecting with port 443 as a system connection and they were. 

    Other than scheduling a " have you tried turning it off and on again " switch reboot, I can't think of a next thing to check.  Any suggestions?

     



    ------------------------------
    JOHN WILLIAMSON
    ------------------------------


  • 2.  RE: Two switches showing disconnected multiple times per day but no errors on port

    Posted 9 days ago

    Sounds like a spanning tree issue with the topo change.

    Potential storming?  have you looked at the logs: 
    show log messages

    Suggest setting up spanning tree root with a lower priority and setting up access ports as edge ports. See if that helps. 



    ------------------------------
    Chris Floom
    ------------------------------



  • 3.  RE: Two switches showing disconnected multiple times per day but no errors on port

    Posted 8 days ago

    I'd go with what Chris is saying.    Check which ports are ROOT against the two devices....



    ------------------------------
    TEDD
    ------------------------------



  • 4.  RE: Two switches showing disconnected multiple times per day but no errors on port

    Posted 8 days ago

    The root on these switches and all the switches at each school site is the core fiber switch at the site.  The fiber switch has a much better priority than any other switch.  Many of the sites have few enough  IDFs that each switch is a home run to the fiber switch. Of the two switches in this topic, one hits a primary switch in the IDF and one is the only switch in the rack.  I will have to check to see if the same mac addr is the root bridge for all the switches at a site. I suspect it is.  



    ------------------------------
    JOHN WILLIAMSON
    ------------------------------



  • 5.  RE: Two switches showing disconnected multiple times per day but no errors on port

    Posted 7 days ago

    Would be interested to know if the logs can shed more light. 



    ------------------------------
    Chris Floom
    ------------------------------



  • 6.  RE: Two switches showing disconnected multiple times per day but no errors on port

    Posted 6 days ago

    I can certainly post a snippet of the message log around the time the uplink ports of the problem switches, if you like.  It shows the topo changes generated on the port.  Interestingly, for a pair of switches not doing much, the log file can only have 2 days worth of events before it archives.  There are a lot of these on one of the switches..

    Jun 10 20:25:45  RVNL-IDFFS02 mgd[24736]: UI_CHILD_EXITED: Child exited: PID 24738, status 1, command '/sbin/snapshot'

    And a lot of lerning all the ports on the other switch:

    Jun 10 10:33:40  SRNL-IDFJS01 dc-pfe[20667]: LBCM-L2,brcm_ifl_l2_init(),2898:(brcm_ifl_l2_init:2898) Setting port learning to sw learning :mge-0/0/19, filter_count 0, enable 1
    Jun 10 10:33:40  SRNL-IDFJS01 dc-pfe[20667]: LBCM-L2,brcm_ifl_l2_init(),2907:Setting port learning config: mge-0/0/19, learn_flg 2, move_flg 2

     



    ------------------------------
    JOHN WILLIAMSON
    ------------------------------



  • 7.  RE: Two switches showing disconnected multiple times per day but no errors on port

    Posted 6 days ago

    So in your problem statement, you say you do see topo changes.....  and we are assuming from the wording that there is only 1 uplink to the "fiber" switch.

    This tells me that there could not be any loops so if your using spanning tree, on all vlans, and the primary switch has a lowest priority,(assuming there is a primary switch at each school), then there is something else that is triggering topo changes..  You also mention losing connection to Mist....the could trigger a topo change if your internet connection is somehowe using STP as well..... 

    I am also assuming each school install is by itself, and not connected back to  a central site... if all locations are going back to a central site, then that primary needs to be root, and ALL other devices have a higher priority to keep them pointed to ONE location.     

    Can you provide a model of one of the problem locations?  and/or a snippet of the configuration, especially protocols and vlan configs....



    ------------------------------
    TEDD
    ------------------------------



  • 8.  RE: Two switches showing disconnected multiple times per day but no errors on port

    Posted 6 days ago

    All the schools are setup the same.  A core EX4600 fiber switch that all the IDFs feed to and the 4600 is the site router and gateway to the data center for all the outbound traffic. Since I only have 2 switches in the inventory of 350 switches that are showing this issue I think I can feel safe that it is not a WAN configuration issue. 

    Each school site is a star topology from each IDF to the fiber switch. I don't currently have any dual-connected switches that would enable a secondary port, if the primary went down.  Our WAN is a star topology, and the lines from each school site are using BGP to the data center.  



    ------------------------------
    JOHN WILLIAMSON
    ------------------------------



  • 9.  RE: Two switches showing disconnected multiple times per day but no errors on port

    Posted 6 days ago

    Here is a sparse definition of the LCBM syslog:

    The syslog message "LBCM-L2,brcm_ifl_l2_init()" is related to the initialization of Layer 2 interfaces on devices using Broadcom chipsets. This message typically appears in the context of control plane operations and does not impact the forwarding plane, meaning transit traffic is not affected.

    I would think that if the logical i/f (hence the ifl_L2 in the syslog) had an issue, it could logically shut down or restarted at the chip level which would trigger a topo change as stp is extremely fast at any comms issues between switches....

    I would prob compare hardware against a known good 2300 vs. the one in question.... or replace the sfp's between primary and idf.   Ive had this issue before and sfp was at fault even though it runs 99% of the time.....



    ------------------------------
    TEDD
    ------------------------------



  • 10.  RE: Two switches showing disconnected multiple times per day but no errors on port

    Posted 6 days ago

    In the case of the switch named RV I swapped out a DAC cable for a new SM fiber and a pair of SFPs.  This switch is daisy-chained to a primary switch in the IDF rack. This did not resolve the issue. The switch named SR has a single-mode fiber home run to the core switch. I haven't swapped anything out on this one , yet. 

    What makes me pause is these two switches have nothing plugged into them, in the case of the RV switch and only three devices in the case of the SR switch and yet their logs get filled in 2 days, where as other switches I took a look at had about 7 days of log and did not have the load of messages I posted above. 



    ------------------------------
    JOHN WILLIAMSON
    ------------------------------



  • 11.  RE: Two switches showing disconnected multiple times per day but no errors on port

    Posted 5 days ago

    A head scratcher for sure......

    I  would compare the entire config of those two against a known switch to not have these issues....

    the rv switch appears to have auto-snapshot enabled, and may be doing it too often from the syslog msg....you can safely remove it if it is there...

    Also, make sure NTP associations are working.

    compare JunosOS and firmware versions if ya havn't already done so....

    Are you using RTSP, or VSTP ?   

    What type of devices are on the SR?

    You can put those as an "edge" device to mitigate any bpdu/stp traffic....     



    ------------------------------
    TEDD
    ------------------------------



  • 12.  RE: Two switches showing disconnected multiple times per day but no errors on port

    Posted 2 days ago

    All the switches EX 2300s in our inventory have the same firmware, 23.4R2-S4.11.  

    We're using RSTP

    There are three VOIP phones plugged into the EX2300 named SR.  The profile they use has the standard data ports set as an edge device. 

    The NTP settings are correct, as this is pushed out by the template.  Assuming you mean Time Protocol. 



    ------------------------------
    JOHN WILLIAMSON
    ------------------------------