Hi there,
we have a virtual chassis with 6 EX4300 members.
The firmware version is: 18.2R1.9
We have high CPU usage on the primary routing engine.
This load is caused by our SNMP monitoring. We query normal values. For example, temperature, CPU usage or just the traffic of the individual ports.
It is precisely this cancellation of the traffic (values "IF-MIB :: ifHCInOctets" "IF-MIB :: ifHCOutOctets") that causes the load.
Here is a top excerpt:
last pid: 11414; load averages: 1.63, 1.42, 1.30 up 418+05:36:18 16:13:06
66 processes: 3 running, 63 sleeping
CPU states: 38.0% user, 0.0% nice, 41.5% system, 0.5% interrupt, 20.1% idle
Mem: 986M Active, 81M Inact, 152M Wired, 560M Cache, 112M Buf, 81M Free
Swap:
PID USERNAME THR PRI NICE SIZE RES STATE TIME WCPU COMMAND
1793 root 1 76 0 59772K 38904K RUN 3246.6 30.08% mib2d
1646 root 2 -52 -52 564M 208M select 2637.2 17.14% pfex_junos
1792 root 1 76 0 35796K 25132K select 1214.3 12.45% snmpd
1798 root 1 51 0 41172K 23352K select 518.2H 3.42% pfed
1633 root 1 49 0 64888K 29128K select 334.9H 2.59% chassisd
11407 root 1 42 0 11836K 5572K select 0:00 0.22% sshd
We collect the values every 10 seconds. We also do this on our older ex4200 switches. We have absolutely no problems with the ex4200.
If I set that we only collect the values for the graphs every 20 seconds, then the load decreases, but we have peaks in the graph every 20 seconds.
What could be the problem here? I think we are not the only ones requesting SNMP values, so I hope that one or the other has a tip.
Best regards 🙂