Realtime Load - Explaination

RangerZ · October 27, 2017, 9:48pm

I was streaming video last night and got curious how much bandwidth I was using.

I looked at the Realtime Load screen and the left side values do not have any unit associated with them. While streaming I am seeing numbers in the 1.1 range.

Can someone please help me understand what this is telling me? I am not finding a reference to this window's content.

FWIW, the stream is generating peaks of about 5 Mbits every 4-5 seconds on the Traffic screen.

lleachii · October 27, 2017, 10:00pm

"System Load" is not network traffic, it's best described as - the current queue of processes being handled by the CPU. On most embedded CPU's, the processor can handle a load of 1 to 4, so be mindful of that when comparing to desktop CPU loads.

See: https://en.wikipedia.org/wiki/Load_(computing)

You should check Realtime Traffic or Realtime Connections to get the information you were looking for.

To get a running total since boot (or Interface up), go the the Interfaces page and see how much the WAN interface has used. If you don't have LuCI installed, type ifconfig - it will also show the traffic used on all interfaces.

mk24 · October 27, 2017, 10:36pm

In a single-core CPU, load of 1.0 is considered fully loaded. Processes will start to slow down due to lack of CPU capacity.

The load number is exactly the same as would be seen on a desktop or server Linux, so there are lots of write-ups about it.

lleachii · October 27, 2017, 11:00pm

I concur, slightly. The semantics are in how load and utilization percentage are calculated.

There's a difference between full processor utilization (percentage of time system is not in Idle) and full load (how many processes are running at x sample times). Nonetheless, I agree - a load of 1.0 on a single core means the system is full-load (though, the process may not be utilizing the CPU 100%).

RangerZ · October 27, 2017, 11:22pm

With a single video stream running I see a load of .88-1.1, with a second stream I see loads the go as high as 1.30 on what I believe is a single core 500Mhz AMD Geode LX-800. What inferences can I make from this for my device?

On the traffic screen, for the WAN interface, I see peaks around 5 Mbit/s for a single stream and 8.2 for two streams. The graph show spikes for data about every 4-5 seconds. The Average is all over the place, but around 500 Kbits/s - 3.5 Mbits/s. Not clear what the numbers are next to the Inbound and Outbound text strings.

I think it's the average inbound number that I am looking for, but please continue....

lleachii · October 27, 2017, 11:29pm

When I run LuCI, it causes my CPU load to jump to ~1.2...when logged out, I go back to ~0.04...Also, when LuCI is running (and Auto Refreshing), the CPU utilization is at ~30%, when logged out - ~1.25%.

I also use softflowd and snmpd to gather statistics from the router, so I don't always have to run LuCI to get this data.

I think it's just LuCI drawing graphics and pulling the usage statistics real-time.

RangerZ · October 28, 2017, 1:11am

So when I run top, the CPU (usr) utilization with Luci, 2 graphs running is about 10% and the load now about .30-.40.

Without Luci the CPU it is 0%, 99 % idle and Load ave about 10%, so I see what you mean about the GUI.

I do not generally run Luci

So can I monitor the average load on my WAN port from the command line? ifconfig appears to be total traffic, from boot I assume.

How can I kill top with out killing putty?

lleachii · October 28, 2017, 1:30am

Press CTRL+C.

I tested what the manual said ("Q") on LEDE, it didn't work, but CTRL+C generally closes (or "kills process") a running Command Line program.

anomeome · October 28, 2017, 2:27am

To get expected htop TTY behaviour:

CONFIG_BUSYBOX_CONFIG_FEATURE_TELNETD_STANDALONE=y
CONFIG_BUSYBOX_CONFIG_FEATURE_TELNET_TTYPE=y

lleachii · October 28, 2017, 5:46pm

Can you tell us where these commands or settings are placed?

...the .config file? ...the command line?

This allows "Q" to be pressed in top?

What happens to other programs?

Do they detach now?

mrbene · October 30, 2017, 8:43pm

@RangerZ - I've been monitoring LEDE performance for an unrelated issue. On Android, I found "JuiceSSH" with the "Performance Monitor" plug-in to be a handy way of putting live LEDE performance numbers at my fingertips.

anomeome · October 31, 2017, 8:25pm

I pulled those from an old configdiff, and may have pulled incomplete/wrong defines, been a while since I made things work, Here is all the busybox TTY/top configs from a current configdiff:

CONFIG_BUSYBOX_CONFIG_FEATURE_TELNETD_STANDALONE=y
CONFIG_BUSYBOX_CONFIG_FEATURE_TELNET_TTYPE=y
CONFIG_BUSYBOX_CONFIG_FEATURE_TOPMEM=y
CONFIG_BUSYBOX_CONFIG_FEATURE_TOP_INTERACTIVE=y
CONFIG_BUSYBOX_CONFIG_FEATURE_TOP_SMP_CPU=y
CONFIG_BUSYBOX_CONFIG_FEATURE_TOP_SMP_PROCESS=y
CONFIG_BUSYBOX_CONFIG_STTY=y
CONFIG_BUSYBOX_CONFIG_TELNET=y
CONFIG_BUSYBOX_CONFIG_TELNETD=y

So this allows top in an image to respond to commands such as '1' to display all cores on multi core device, 'q' to quit...

RangerZ · October 31, 2017, 11:08pm

Thanks, That does it

Unfortunately I am on an iPhone

@anomeome, I 'm the dumb Windows user in the room. Not a clue what your saying, but thanks.

Conno · December 5, 2018, 9:08am

I am just trying to understand the load behaviour on my otherwise rock-solid Linksys WRT1900ACS running OpenWrt 18.06.1.
Once every 50 minutes I see a strong increase in the load:
2018-12-05_10-00-32%20Load
While the CPU stays fairly low at the same time:
2018-12-05_10-01-34%20CPU
This behaviour seems to repeat itself 24/7:
2018-12-05_10-02-29%20Load%2024h
Any clues?

lleachii · December 5, 2018, 8:37pm

Sure...what are you running on your machine or network every 50 minutes?

(Also, this thread is over a year old, please consider making a new thread in the future.)

Conno · December 6, 2018, 10:14am

Excuse me for not starting a new topic.
I've no idea what is running on my machine. I am running OpenWRT 18.06.1. with SQM QoS piece-of-cake and the statistics package.
But the good news is that since this morning the spikes are gone:
2018-12-05_10-02-29%20Load%201wk
It seems leaving the Luci browser open with auto refresh on caused these spikes in load. Is this normal behaviour?

hnyman · December 6, 2018, 10:36am

There are strange aspects

"Every 50 minutes" does not sounds like any normal refresh cycle.
you had system load spikes, but no spikes in the CPU usage. Some I/O related bottleneck at some download/upload item, USB stick write task, or something like that? You copy something to a slow memory device every 50 minutes?
WRT1900AC is rather high-powered device, so having a rather steady 7-9% CPU load requires that you have some rather heavy tasks ongoing all the time. A high-speed torrent upload/download? or something like that.

Ps. Cake has recently been showed to use rather high CPU amounts with high-speed traffic, so it may not be quite that optimal for high speeds (as it maybe calculated too much). But even with that, there is not really anything to explain the 50 minutes interval. (you might test the old simple.qos with fd_codel instead of cake, if you still see the spikes.)

moeller0 · December 6, 2018, 11:57am

In that case, please try to also test the current master version of sqm-scripts from https://github.com/tohojo/sqm-scripts (see README.md for pointers how to do that). And have a look into /usr/lib/sqm/defaults.sh and try to put larger values into SHAPER_QUANTUM_DUR_US; this variable is used to size HTB/TBF's burst-buffer so that emptying it at the configured shaper-rate will take SHAPER_QUANTUM_DUR_US number of microseconds (assuming the buffer is full). Please note that the higher this value the burstier SQM is going to behave with noticeable effects on latency-under-load (aka bufferbloat). But please try and report back any observations like success or failure as a new issue at https://github.com/tohojo/sqm-scripts/issues