I have a problem with high CPU and memory usage seen in routing engines of both EX4600 that are stacked in VC and both are on newest/recommended Junos version 21.4R3-S2.4
Result of this problem is slow response of ssh and very high latency when pinging this stack (900ms+) where traffic going through it is fine and without problems.
It doesn't do much traffic for now as it is a new installation and I have this problem from start. There are no loops and RSTP is enabled.
Below are the outputs of show commands where you can see that the fxpc process is hogging up all of this CPU and memory.
> show chassis routing-engine
Routing Engine status:
Slot 0:
Current state Master
Temperature 38 degrees C / 100 degrees F
CPU temperature 38 degrees C / 100 degrees F
DRAM 1920 MB
Memory utilization 77 percent
5 sec CPU utilization:
User 63 percent
Background 0 percent
Kernel 2 percent
Interrupt 0 percent
Idle 34 percent
1 min CPU utilization:
User 63 percent
Background 0 percent
Kernel 3 percent
Interrupt 0 percent
Idle 34 percent
5 min CPU utilization:
User 64 percent
Background 0 percent
Kernel 3 percent
Interrupt 0 percent
Idle 34 percent
15 min CPU utilization:
User 63 percent
Background 0 percent
Kernel 2 percent
Interrupt 0 percent
Idle 34 percent
Model EX4600-40F
Serial ID BUILTIN
Start time 2023-03-03 01:07:01 UTC
Uptime 6 days, 15 hours, 54 minutes, 41 seconds
Last reboot reason 0x1:power cycle/failure
Load averages: 1 minute 5 minute 15 minute
1.28 1.16 1.02
Routing Engine status:
Slot 1:
Current state Backup
Temperature 39 degrees C / 102 degrees F
CPU temperature 39 degrees C / 102 degrees F
DRAM 1920 MB
Memory utilization 56 percent
5 sec CPU utilization:
User 62 percent
Background 0 percent
Kernel 2 percent
Interrupt 0 percent
Idle 36 percent
Model EX4600-40F
Serial ID BUILTIN
Uptime 6 days, 15 hours, 55 minutes, 42 seconds
Last reboot reason 0x1:power cycle/failure
Load averages: 1 minute 5 minute 15 minute
0.88 0.87 0.90
{master:0}
> show system processes extensive
last pid: 17028; load averages: 1.07, 1.12, 1.01 up 6+15:57:43 17:04:14
171 processes: 5 running, 145 sleeping, 1 zombie, 20 waiting
Mem: 964M Active, 129M Inact, 458M Wired, 290M Cache, 69M Buf, 7808K Free
Swap: 1106M Total, 1106M Free
PID USERNAME PRI NICE SIZE RES STATE TIME WCPU COMMAND
1951 root 68 -1 1268M 589M RUN 102.8H 61.13% fxpc
10 root 155 52 0K 12K RUN 52.9H 34.52% idle
1941 root 40 0 127M 37932K select 61:26 0.10% chassisd
2185 root 4 0 55152K 41336K pfesta 22:55 0.05% mib2d
1951 root -52 -52 1268M 589M select 102.8H 0.00% fxpc
> show chassis fpc
Temp CPU Utilization (%) CPU Utilization (%) Memory Utilization (%)
Slot State (C) Total Interrupt 1min 5min 15min DRAM (MB) Heap Buffer
0 Online 39 64 0 0 0 0 1920 0 77
1 Online 40 65 0 0 0 0 1920 0 55
2 Empty
> show virtual-chassis status
Virtual Chassis ID: 9075.1b20.1236
Virtual Chassis Mode: Enabled
Mstr Mixed Route Neighbor List
Member ID Status Serial No Model prio Role Mode Mode ID Interface
0 (FPC 0) Prsnt TC37XXXXXXXX ex4600-40f 128 Master* N VC 1 vcp-255/0/26
1 vcp-255/0/27
1 (FPC 1) Prsnt TC37XXXXXXXX ex4600-40f 128 Backup N VC 0 vcp-255/0/26
0 vcp-255/0/27
Member ID for next new member: 2 (FPC 2)
I tried troubleshooting the problem . There are no system or chassis alarms, VC is Established with just "no-split-detection" option in configuration.
I think I found root of those problems as I see weird bugs after trying to vty to fpc0 and it just outputs errors regarding MACSEC which I doesn't plan of using and don't have anything in configuration for this.
> start shell
% vty fpc0
TOR platform (1500 Mhz Pentium processor, 511MB memory, 0KB flash)
TFXPC0( vty)# [Thu Mar 9 17:06:34.540 LOG: Debug] MMI op failed (op=0) address = 0x001000d9
[Thu Mar 9 17:06:50.410 LOG: Debug] MACSEC_MII_1 Error: Not Ready timeout
[Thu Mar 9 17:06:50.410 LOG: Debug] MMI op failed (op=0) address = 0x00100108
[Thu Mar 9 17:06:50.751 LOG: Debug] MACSEC_MII_1 Error: Not Ready timeout
[Thu Mar 9 17:06:50.751 LOG: Debug] MMI op failed (op=0) address = 0x001000d9
[Thu Mar 9 17:06:52.486 LOG: Debug] MACSEC_MII_1 Error: Not Ready timeout
[Thu Mar 9 17:06:52.486 LOG: Debug] MMI op failed (op=0) address = 0x00100108
[Thu Mar 9 17:06:52.826 LOG: Debug] MACSEC_MII_1 Error: Not Ready timeout
[Thu Mar 9 17:06:52.826 LOG: Debug] MMI op failed (op=0) address = 0x001000d9
[Thu Mar 9 17:06:53.063 LOG: Info] [DCBCM] dcbcm_eth_an_config: ifd - ge-0/0/5 AN spd = 1000000000, dpx = 2, an = 2, fc = 0
[Thu Mar 9 17:06:53.063 LOG: Info] [DCBCM] dcbcm_check_an_config_change: ifd - ge-0/0/5, OLD CFG spd = 1000000000, dpx = 2, an = 1, fc = 0 auto_speed:0 auto_mdix = 1 NEW CFG spd = 1000000000, dpx = 2, an = 1, fc = 0 auto_speed:0 auto_mdix = 1 ifd->ifd_media.ifm_speed = 100000000[Thu Mar 9 17:06:53.470 LOG: Debug] MACSEC_MII_1 Error: Not Ready timeout
[Thu Mar 9 17:06:53.470 LOG: Debug] MMI op failed (op=0) address = 0x00100108
[Thu Mar 9 17:06:53.811 LOG: Debug] MACSEC_MII_1 Error: Not Ready timeout
[Thu Mar 9 17:06:53.811 LOG: Debug] MMI op failed (op=0) address = 0x001000d9
[Thu Mar 9 17:06:54.475 LOG: Debug] MACSEC_MII_1 Error: Not Ready timeout
[Thu Mar 9 17:06:54.475 LOG: Debug] MMI op failed (op=0) address = 0x00100108
I can't do anything on the console of fpc as it constantly just outputs errors I provided above.
I thought, that problem is connected to my lastest/recommended software so I tried downgrading it but it fails with another, frist time for me, error - rcp which prevents me from upgrading this VC :(
> show version
fpc0:
--------------------------------------------------------------------------
Model: ex4600-40f
Junos: 21.4R3-S2.4
JUNOS Base OS boot [21.4R3-S2.4]
JUNOS Base OS Software Suite [21.4R3-S2.4]
JUNOS Online Documentation [21.4R3-S2.4]
JUNOS Crypto Software Suite [21.4R3-S2.4]
JUNOS Crypto Software Suite [21.4R3-S2.4]
JUNOS Kernel Software Suite [21.4R3-S2.4]
JUNOS Phone-Home Software Suite [21.4R3-S2.4]
JUNOS Packet Forwarding Engine Support (qfx-ex-x86-32) [21.4R3-S2.4]
JUNOS jsd [i386-21.4R3-S2.4-jet-1]
JUNOS Routing Software Suite [21.4R3-S2.4]
JUNOS SDN Software Suite [21.4R3-S2.4]
JUNOS Enterprise Software Suite [21.4R3-S2.4]
JUNOS Web Management Platform Package [21.4R3-S2.4]
JUNOS Openconfig [21.4R3-S2.4]
JUNOS py-base-i386 [21.4R3-S2.4]
JUNOS py-extensions-i386 [21.4R3-S2.4]
JUNOS Host Software [18.1R3-S6.1]
fpc1:
--------------------------------------------------------------------------
Model: ex4600-40f
Junos: 21.4R3-S2.4
JUNOS Base OS boot [21.4R3-S2.4]
JUNOS Base OS Software Suite [21.4R3-S2.4]
JUNOS Online Documentation [21.4R3-S2.4]
JUNOS Crypto Software Suite [21.4R3-S2.4]
JUNOS Crypto Software Suite [21.4R3-S2.4]
JUNOS Kernel Software Suite [21.4R3-S2.4]
JUNOS Phone-Home Software Suite [21.4R3-S2.4]
JUNOS Packet Forwarding Engine Support (qfx-ex-x86-32) [21.4R3-S2.4]
JUNOS jsd [i386-21.4R3-S2.4-jet-1]
JUNOS Routing Software Suite [21.4R3-S2.4]
JUNOS SDN Software Suite [21.4R3-S2.4]
JUNOS Enterprise Software Suite [21.4R3-S2.4]
JUNOS Web Management Platform Package [21.4R3-S2.4]
JUNOS Openconfig [21.4R3-S2.4]
JUNOS py-base-i386 [21.4R3-S2.4]
JUNOS py-extensions-i386 [21.4R3-S2.4]
JUNOS Host Software [18.1R3-S6.1]
> show system storage all-members
fpc0:
--------------------------------------------------------------------------
Filesystem Size Used Avail Capacity Mounted on
/dev/ad0s1a 1.6G 1.4G 140M 91% /
devfs 1.0K 1.0K 0B 100% /dev
/dev/md0 121M 121M 0B 100% /packages/mnt/jbase
/dev/md1 712M 712M 0B 100% /packages/mnt/jkernel-qfx-ex-x86-32-21.4R3-S2.4
/dev/md2 138M 138M 0B 100% /packages/mnt/jpfe-qfx-ex-x86-32-21.4R3-S2.4
/dev/md3 13M 13M 0B 100% /packages/mnt/jdocs-dc-re-21.4R3-S2.4
/dev/md4 290M 290M 0B 100% /packages/mnt/jroute-qfx-ex-x86-32-21.4R3-S2.4
/dev/md5 38M 38M 0B 100% /packages/mnt/jcrypto-dc-re-21.4R3-S2.4
/dev/md6 28M 28M 0B 100% /packages/mnt/jcrypto-qfx-5-21.4R3-S2.4
/dev/md7 12M 12M 0B 100% /packages/mnt/jphone-home-21.4R3-S2.4
/dev/md8 11M 11M 0B 100% /packages/mnt/jsd-i386-21.4R3-S2.4-jet-1
/dev/md9 17M 17M 0B 100% /packages/mnt/jsdn-i386-21.4R3-S2.4
/dev/md10 71M 71M 0B 100% /packages/mnt/jswitch-qfx-ex-x86-32-21.4R3-S2.4
/dev/md11 32M 32M 0B 100% /packages/mnt/jweb-ex-21.4R3-S2.4
/dev/md12 19M 19M 0B 100% /packages/mnt/py-base-i386-21.4R3-S2.4
/dev/md13 14M 14M 0B 100% /packages/mnt/py-extensions-i386-21.4R3-S2.4
/dev/md14 1.1G 74K 999M 0% /tmp
/dev/ad2s1e 504M 244K 463M 0% /config
procfs 4.0K 4.0K 0B 100% /proc
/dev/ad1s1f 484M 2.1M 443M 0% /var
/dev/ad1s1d 1.9G 598M 1.2G 33% /var/tmp
/dev/ad1s1a 242M 20M 202M 9% /var/log
/dev/ad1s1e 242M 10.0K 222M 0% /altconfig
/dev/ad3s1 1006M 4.0K 926M 0% /user
/dev/md15 98M 164K 90M 0% /var/mfs/var/run
/var/jails/rest-api 484M 2.1M 443M 0% /packages/mnt/jroute-qfx-ex-x86-32-21.4R3-S2.4/web-api/var
devfs 1.0K 1.0K 0B 100% /packages/mnt/jroute-qfx-ex-x86-32-21.4R3-S2.4/web-api/dev
/var/jail/etc 484M 2.1M 443M 0% /packages/mnt/jweb-ex-21.4R3-S2.4/jail/var/etc
/var/jail/run 484M 2.1M 443M 0% /packages/mnt/jweb-ex-21.4R3-S2.4/jail/var/run
/var/jail/tmp 484M 2.1M 443M 0% /packages/mnt/jweb-ex-21.4R3-S2.4/jail/var/tmp
/var/tmp 1.9G 598M 1.2G 33% /packages/mnt/jweb-ex-21.4R3-S2.4/jail/var/tmp/uploads
/var/jail/sess 484M 2.1M 443M 0% /packages/mnt/jweb-ex-21.4R3-S2.4/jail/var/sess
/var/jail/log 484M 2.1M 443M 0% /packages/mnt/jweb-ex-21.4R3-S2.4/jail/var/log
devfs 1.0K 1.0K 0B 100% /packages/mnt/jweb-ex-21.4R3-S2.4/jail/dev
/var/jail/jweb-app 484M 2.1M 443M 0% /packages/mnt/jweb-ex-21.4R3-S2.4/jail/var/jweb-app
/dev/md16 18M 18M 0B 100% /packages/mnt/junos-openconfig-21.4R3-S2.4
/dev/md17 504M 95M 369M 20% /var/rundb
fpc1:
--------------------------------------------------------------------------
Filesystem Size Used Avail Capacity Mounted on
/dev/ad0s1a 1.6G 1.4G 140M 91% /
devfs 1.0K 1.0K 0B 100% /dev
/dev/md0 121M 121M 0B 100% /packages/mnt/jbase
/dev/md1 712M 712M 0B 100% /packages/mnt/jkernel-qfx-ex-x86-32-21.4R3-S2.4
/dev/md2 138M 138M 0B 100% /packages/mnt/jpfe-qfx-ex-x86-32-21.4R3-S2.4
/dev/md3 13M 13M 0B 100% /packages/mnt/jdocs-dc-re-21.4R3-S2.4
/dev/md4 290M 290M 0B 100% /packages/mnt/jroute-qfx-ex-x86-32-21.4R3-S2.4
/dev/md5 38M 38M 0B 100% /packages/mnt/jcrypto-dc-re-21.4R3-S2.4
/dev/md6 28M 28M 0B 100% /packages/mnt/jcrypto-qfx-5-21.4R3-S2.4
/dev/md7 12M 12M 0B 100% /packages/mnt/jphone-home-21.4R3-S2.4
/dev/md8 11M 11M 0B 100% /packages/mnt/jsd-i386-21.4R3-S2.4-jet-1
/dev/md9 17M 17M 0B 100% /packages/mnt/jsdn-i386-21.4R3-S2.4
/dev/md10 71M 71M 0B 100% /packages/mnt/jswitch-qfx-ex-x86-32-21.4R3-S2.4
/dev/md11 32M 32M 0B 100% /packages/mnt/jweb-ex-21.4R3-S2.4
/dev/md12 19M 19M 0B 100% /packages/mnt/py-base-i386-21.4R3-S2.4
/dev/md13 14M 14M 0B 100% /packages/mnt/py-extensions-i386-21.4R3-S2.4
/dev/md14 1.1G 74K 999M 0% /tmp
/dev/ad2s1e 504M 236K 463M 0% /config
procfs 4.0K 4.0K 0B 100% /proc
/dev/ad1s1f 484M 2.0M 443M 0% /var
/dev/ad1s1d 1.9G 26K 1.8G 0% /var/tmp
/dev/ad1s1a 242M 11M 211M 5% /var/log
/dev/ad1s1e 242M 10.0K 222M 0% /altconfig
/dev/ad3s1 1006M 4.0K 926M 0% /user
/dev/md15 98M 154K 90M 0% /var/mfs/var/run
/var/jails/rest-api 484M 2.0M 443M 0% /packages/mnt/jroute-qfx-ex-x86-32-21.4R3-S2.4/web-api/var
devfs 1.0K 1.0K 0B 100% /packages/mnt/jroute-qfx-ex-x86-32-21.4R3-S2.4/web-api/dev
/var/jail/etc 484M 2.0M 443M 0% /packages/mnt/jweb-ex-21.4R3-S2.4/jail/var/etc
/var/jail/run 484M 2.0M 443M 0% /packages/mnt/jweb-ex-21.4R3-S2.4/jail/var/run
/var/jail/tmp 484M 2.0M 443M 0% /packages/mnt/jweb-ex-21.4R3-S2.4/jail/var/tmp
/var/tmp 1.9G 26K 1.8G 0% /packages/mnt/jweb-ex-21.4R3-S2.4/jail/var/tmp/uploads
/var/jail/sess 484M 2.0M 443M 0% /packages/mnt/jweb-ex-21.4R3-S2.4/jail/var/sess
/var/jail/log 484M 2.0M 443M 0% /packages/mnt/jweb-ex-21.4R3-S2.4/jail/var/log
devfs 1.0K 1.0K 0B 100% /packages/mnt/jweb-ex-21.4R3-S2.4/jail/dev
/var/jail/jweb-app 484M 2.0M 443M 0% /packages/mnt/jweb-ex-21.4R3-S2.4/jail/var/jweb-app
/dev/md16 18M 18M 0B 100% /packages/mnt/junos-openconfig-21.4R3-S2.4
/dev/md17 504M 95M 369M 20% /var/rundb
> request system software add /var/tmp/jinstall-host-ex-4600-20.4R3-S6.3-signed.tgz force-host reboot no-validate force
Checking pending install on fpc1
Checking pending install on fpc0
Pushing bundle /var/tmp/jinstall-host-ex-4600-20.4R3-S6.3-signed.tgz to fpc1
error: rcp for member 1, failed
Can anybody help me with resolving this big issue?
On a side note, I wonder why now, Juniper recommends newest software for devices which are mostly put in core of networks (having multiple 10Gs interfaces).