Switching

 View Only
last person joined: 2 days ago 

Ask questions and share experiences about EX and QFX portfolios and all switching solutions across your data center, campus, and branch locations.

PIM problems on two EX4600s

  • 1.  PIM problems on two EX4600s

    Posted 04-18-2022 18:47
    This should "just work", according to Juniper's documentation...but it doesn't, and I'd really appreciate some suggestions for how to get this working.

    I have two EX4600s at different campuses, connected via a VPN. All IPs can ping site-to-site, and all inter-site TCP, UDP, and ICMP traffic works fine.

    We have our phone PBX and paging server located at one campus, and IP speakers at both sites. Unicast paging works fine, but multicast paging does not.

    I set up PIM to allow the forwarding of multicast, using the following commands:
    set protocols pim traceoptions file pim.log
    set protocols pim traceoptions file size 5m
    set protocols pim traceoptions file world-readable
    set protocols pim traceoptions flag join detail
    set protocols pim traceoptions flag normal detail
    set protocols pim traceoptions flag register detail
    set protocols pim rp local family inet address 10.100.254.254
    set protocols pim rp auto-rp discovery
    set protocols pim rp static address 10.101.254.254
    set protocols pim interface all mode sparse
    set protocols pim interface fxp0.0 disable
    set protocols pim reset-tracking-bit
    set protocols pim propagation-delay 500
    set protocols pim override-interval 4000

    Identical config on the remote EX4600 (except IPs reversed, of course).

    LocalEx4600> show pim rps
    Instance: PIM.master

    address-family INET
    RP address      Type        Mode   Holdtime Timeout Groups Group prefixes
    10.101.254.254 static      sparse        0    None      0 224.0.0.0/4
    10.100.254.254 static      sparse      150    None      5 224.0.0.0/4

    address-family INET6

    BUT no neighbours:

    LocalEx4600> show pim neighbors
    B = Bidirectional Capable, G = Generation Identifier
    H = Hello Option Holdtime, L = Hello Option LAN Prune Delay,
    P = Hello Option DR Priority, T = Tracking Bit,
    A = Hello Option Join Attribute

    Instance: PIM.master

    AND apparently no route?

    LocalEx4600> show pim statistics

    PIM Message type        Received       Sent  Rx errors
    V2 Hello                       0     533652          0
    V2 Register                  303          0          0
    V2 Register Stop               0          0          0
    V2 Join Prune                  0          0          0
    V2 Bootstrap                   0          0          0
    V2 Assert                      0          0          0
    V2 Graft                       0          0          0
    V2 Graft Ack                   0          0          0
    V2 Candidate RP                0          0          0
    V2 State Refresh               0          0          0
    V2 DF Election                 0          0          0
    V1 Query                       0          0          0
    V1 Register                    0          0          0
    V1 Register Stop               0          0          0
    V1 Join Prune                  0          0          0
    V1 RP Reachability             0          0          0
    V1 Assert                      0          0          0
    V1 Graft                       0          0          0
    V1 Graft Ack                   0          0          0
    AutoRP Announce                0          0          0
    AutoRP Mapping                 0          0          0
    AutoRP Unknown type            0
    Anycast Register               0          0          0
    Anycast Register Stop          0          0          0

    Global Statistics

    Hello dropped on neighbor policy                  0
    Unknown type                                      0
    V1 Unknown type                                   0
    Unknown Version                                   0
    Neighbor unknown                                  0
    Bad Length                                        0
    Bad Checksum                                      0
    Bad Receive If                                    0
    Rx Bad Data                                       0
    Rx Intf disabled                                  0
    Rx V1 Require V2                                  0
    Rx V2 Require V1                                  0
    Rx Register not RP                              300
    Rx Register no route                              0
    Rx Register no decap if                           1
    Null Register Timeout                             0
    RP Filtered Source                                0
    Rx Unknown Reg Stop                               0
    Rx Join/Prune no state                            0
    Rx Join/Prune on upstream if                      0
    Rx Join/Prune for invalid group                   0
    Rx Join/Prune messages dropped                    0
    Rx sparse join for dense group                    0
    Rx Graft/Graft Ack no state                       0
    Rx Graft on upstream if                           0
    Rx CRP not BSR                                    0
    Rx BSR when BSR                                   0
    Rx BSR not RPF if                                 0
    ipv4 BSR pkt drop due to excessive rate           0
    ipv6 BSR pkt drop due to excessive rate           0
    Rx unknown hello opt                              0
    Rx data no state                              88132
    Rx RP no state                                    0
    Rx aggregate                                      0
    Rx malformed packet                               0
    Rx illegal TTL                                    0
    Rx illegal destination address                    0
    No RP                                             0
    No register encap if                              6
    No route upstream                                 1
    Nexthop Unusable                                  0
    RP mismatch                                       0
    RP mode mismatch                                  0
    RPF neighbor unknown                              0
    Rx Joins/Prunes filtered                          0
    Tx Joins/Prunes filtered                          0
    Embedded-RP invalid addr                          0
    Embedded-RP limit exceed                          0
    Embedded-RP added                                 0
    Embedded-RP removed                               0
    Rx Register msgs filtering drop                   0
    Tx Register msgs filtering drop                   0
    Rx Bidir Join/Prune on non-Bidir if               0
    Rx Bidir Join/Prune on non-DF if                  0
    (*,G)Join drop due to ssm range check             0
         
    Interfaces:

    LocalEx4600> show pim interfaces  

    Stat = Status, V = Version, NbrCnt = Neighbor Count,
    S = Sparse, D = Dense, B = Bidirectional,
    DR = Designated Router, DDR = Dual DR, DistDR = Distributed DR,
    P2P = Point-to-point link, P2MP = Point-to-Multipoint,
    Active = Bidirectional is active, NotCap = Not Bidirectional Capable

    Name               Stat Mode IP V State        NbrCnt JoinCnt(sg/*g) DR address
    irb.55             Up   S     4 2 DR,NotCap         0 143/0          10.100.155.1
    lo0.0              Up   S     4 2 DR,NotCap         0 0/0            10.100.254.254
    pimd.32771         Up   S     4 2 P2P,NotCap        0 0/0
    pime.32769         Up   S     4 2 P2P,NotCap        0 0/0

    Ping works fine:

    LocalEx4600> ping 10.101.254.254 source 10.100.254.254
    PING 10.101.254.254 (10.101.254.254): 56 data bytes
    64 bytes from 10.101.254.254: icmp_seq=0 ttl=62 time=11.640 ms
    64 bytes from 10.101.254.254: icmp_seq=1 ttl=62 time=27.534 ms
    64 bytes from 10.101.254.254: icmp_seq=2 ttl=62 time=24.173 ms
    ^C
    --- 10.101.254.254 ping statistics ---
    3 packets transmitted, 3 packets received, 0% packet loss
    round-trip min/avg/max/stddev = 11.640/21.116/27.534/6.839 ms

    A random sample of pim.log looks like this:

    Apr 17 21:24:28.614456 PIM resolving request for Group 239.1.1.5 Source 10.101.155.21 Interface pimd.32771
    Apr 17 21:24:34.464449 PIM resolving request for Group 239.1.1.5 Source 10.101.155.30 Interface pimd.32771
    Apr 17 21:24:35.767842 PIM resolving request for Group 239.1.1.5 Source 10.101.155.24 Interface pimd.32771
    Apr 17 21:24:36.271139 PIM resolving request for Group 239.1.1.5 Source 10.101.155.19 Interface pimd.32771
    Apr 17 21:24:36.738223 PIM resolving request for Group 239.1.1.5 Source 10.101.155.14 Interface pimd.32771
    Apr 17 21:24:36.738327 PIM resolving request for Group 239.1.1.5 Source 10.101.155.28 Interface pimd.32771
    Apr 17 21:24:36.786451 PIM resolving request for Group 239.1.1.5 Source 10.101.155.13 Interface pimd.32771
    Apr 17 21:24:38.218008 PIM resolving request for Group 239.1.1.4 Source 10.101.155.24 Interface pimd.32771
    Apr 17 21:24:38.218451 PIM.master 239.1.1.4, 10.101.155.24/64 RPF: looking up RPF nbr info towards 10.101.155.24
    Apr 17 21:24:38.218482 PIM.master 239.1.1.4, 10.101.155.24/64 RPF: RPF nbr info towards 10.101.155.24: 0.0.0.0 via unknown

    A few lines from pim join:
    LocalEx4600> show pim join 
    Group: 239.1.1.3
        Source: 10.100.155.4
        Flags: sparse,spt
        Upstream interface: irb.55                

    Group: 239.1.1.4
        Source: *
        RP: 10.100.254.254
        Flags: sparse,rptree,wildcard
        Upstream interface: Local                

    Group: 239.1.1.4
        Source: 10.101.155.12
        Flags: sparse
        Upstream interface: unknown (no neighbor)

    Group: 239.1.1.4
        Source: 10.100.155.4
        Flags: sparse,spt
        Upstream interface: irb.55                

    I can page individual speakers at both campuses (dial page server, then dial speaker - I think this uses unicast). 

    I can page speaker groups at the main campus (dial page server, then dial speaker group - this uses multicast).

    When I dial remote speaker groups, nothing happens (page server gives about 5 seconds of silence, then an error tone).

    I have also set up OSPF, IGMP, IGMP-Snooping, and MSDP:

    LocalEx4600> show msdp  
    Peer address    Local address   State       Last up/down Peer-Group   SA Count
    10.101.254.254 10.100.254.254 Established 3w1d 21:37:44             0/0

    Thanks in advance for any assistance!

    Julian