Hi There,
I have 2 Juniper MX 240 running MCLAG and ICCP with ICL interfaces. We have irbs configured and connected to the LAN side of the network. Both routers are using very little CPU. For the ICCP liveliness detection we have the following. Both routers have the same configs except the peer IPs
show protocols iccp
local-ip-addr 10.10.10.1;
peer 10.10.10.2 {
redundancy-group-id-list 1;
backup-liveness-detection {
backup-peer-ip 172.16.255.201;
}
liveness-detection {
minimum-receive-interval 1000;
multiplier 1;
transmit-interval {
minimum-interval 1000;
}
detection-time {
threshold 2000;
}
Router1
show bfd session
Detect Transmit
Address State Interface Time Interval Multiplier
10.10.10.1 Up 1.000 0.900 1
1 sessions, 1 clients
Cumulative transmit rate 1.1 pps, cumulative receive rate 1.0 pps
Client ICCP realm 10.10.10.2, TX interval 1.000, RX interval 1.000
Session up time 03:59:30, previous down time 00:00:02
Local diagnostic None, remote diagnostic None
Remote state Up, version 1
Session type: Multi hop BFD
Min async interval 1.000, min slow interval 1.000
Adaptive async TX interval 1.000, RX interval 1.000
Local min TX interval 1.000, minimum RX interval 1.000, multiplier 1
Remote min TX interval 1.000, min RX interval 1.000, multiplier 1
Threshold for detection time 2.000
Local discriminator 18, remote discriminator 18
Echo mode disabled/inactive
Multi-hop route table 0, local-address 10.10.10.1
Session ID: 0x0
1 sessions, 1 clients
Cumulative transmit rate 1.1 pps, cumulative receive rate 1.0 pps
Router 2
show bfd session
Detect Transmit
Address State Interface Time Interval Multiplier
10.10.10.2 Up 1.000 0.900 1
Client ICCP realm 10.10.10.1, TX interval 1.000, RX interval 1.000
Session up time 04:01:29, previous down time 00:00:01
Local diagnostic None, remote diagnostic None
Remote state Up, version 1
Session type: Multi hop BFD
Min async interval 1.000, min slow interval 1.000
Adaptive async TX interval 1.000, RX interval 1.000
Local min TX interval 1.000, minimum RX interval 1.000, multiplier 1
Remote min TX interval 1.000, min RX interval 1.000, multiplier 1
Threshold for detection time 2.000
Local discriminator 18, remote discriminator 18
Echo mode disabled/inactive
Multi-hop route table 0, local-address 10.10.10.2
Session ID: 0x0
1 sessions, 1 clients
Cumulative transmit rate 1.1 pps, cumulative receive rate 1.0 pps
The above BFD session is frequently flapping with the error that
Router 1 Same time
BFD Session 10.10.10.2 (IFL 0) state Up -> Down LD/RD(18/18) Up time:1w4d 08:08 Local diag: NbrSignal Remote diag: CtlExpire Reason: Received DO
WN from PEER.
Dec 27 08:18:40 edge1.da1 bfdd[3903]: BFDD_TRAP_MHOP_STATE_DOWN: local discriminator: 18, new state: down, peer addr: 10.10.10.2
Dec 27 08:18:40 edge1.da1 lacpd[24612]: mcae_icl_event_handle_icl_up_iccp_down_from_iccp_up: for mcae ae7 preferred_active is TRUE
Dec 27 08:18:40 edge1.da1 mib2d[3892]: SNMP_TRAP_LINK_DOWN: ifIndex 563, ifAdminStatus up(1), ifOperStatus down(2), ifName ae6.0
Dec 27 08:18:41 edge1.da1 rpd[3949]: RPD_OSPF_NBRDOWN: OSPF neighbor 216.52.72.9 (realm ospf-v2 irb.3312 area 0.0.0.0) state changed from Full to Init due to 1WayRcvd (event reason:
neighbor is in one-way mode)
Router 2: same time
BFD Session 10.10.10.1 (IFL 0) state Up -> Down LD/RD(18/18) Up time:1w4d 08:08 Local diag: CtlExpire Remote diag: None Reason: Detect Timer Expiry.
Any idea how to tune the BFD settings and make the BFD stable.
#mclag#BFD#iccp