Linksys EA7300 getting slower after one day

CrAsH0v3r · March 9, 2022, 1:32am

Impact of reactivating SQM QOS with params you gave (before/after):

Also note that I had 34mbps 5 hours ago, now its less than 28!

Waveform test with SQM :

and without :

So there is improvement on bufferbloat but overall speed is slower (due to processing?).

But in both cases that does not solve my issue of speed slowing down until reboot..

Thanks again!

anon89577378 · March 9, 2022, 2:02am

How many client devices are currently connected?

Try changing cake and piece.of.cake to fq_codel and simplest_tbf.

Make sure you are saving and applying settings in LuCI, or doing uci commit if using the CLI.

Rerun the tests.

Do a cat /etc/config/sqm again, and post the results.

Your ISP may be using CGNAT.

CrAsH0v3r · March 9, 2022, 2:32am

With these new params :

Slower..

root@OpenWrt:~# cat /etc/config/sqm

config queue 'eth1'
        option debug_logging '0'
        option verbosity '5'
        option interface 'wan'
        option download '59000'
        option upload '9000'
        option linklayer 'ethernet'
        option overhead '18'
        option qdisc 'fq_codel'
        option script 'simplest_tbf.qos'
        option enabled '1'

I have about 30 devices, but more than half are IOT devices with very low traffic (temperature, garage door, motion sensor) or smart plugs. The "big" devices, are 2 cellphone, 2 tablets, my desktop and laptop, then 2 work laptops. One Google Home. I can see with my grafana graphs based on prometheus that network is petty quiet except when I run the tests.

And finally, I rebooted the router, here is the result right after .
With SQM :

Without :

Thanks!

anon89577378 · March 9, 2022, 2:37am

So try fq_codel, simplest_tbf, zero for download shaping, and 10000 for upload.

If no improvement, or worse, turn SQM off again.

Since your device subtarget is mt7621, you can take advantage of hardware NAT, which could help improve your speed.

LuCI > Network > Firewall > General Settings

In the Routing/NAT Offloading section, check Software flow offloading, which will display Hardware flow offloading.

Check that, save and apply, reboot, and test.

bill888 · March 9, 2022, 3:53am

fwiw, have you checked LuCI > Status > Realtime graphs > Traffic to confirm no other devices are using the internet when you perform speed test?

CrAsH0v3r · March 9, 2022, 1:15pm

Thanks for replies.
I'll test that at lunch time or tonight to avoid troubles with my wife conf calls:D

To Bill : Yes, it was very calm on the network between the tests I did, as you can see.

Also I did the bufferbloat test again when I woke up this morning and it was lower than last night :

CrAsH0v3r · March 9, 2022, 2:43pm

With SQM 0 Download and 10000 upload :

Without :

Download speed is similar now, but upload speed is bad.

So I won't activate SQM for now as it's not my issue.

Will try software offloading later.

And continue to monitor as speed may go down by itself later today.

CrAsH0v3r · March 9, 2022, 4:51pm

Me again. Network was calm before test.
Tests without SQM Before Offloading :

And with hardware offloading :

Almost the same.

Now lets see if it drops in a couple of hours!

CrAsH0v3r · March 10, 2022, 5:07pm

For now, with more than 24h of uptime, I'm not experiencing any speed issue.. Lets hope it will last!

CrAsH0v3r · April 7, 2022, 3:37pm

@anon89577378
Hi again, a couple weeks later.

After a certain time using Hardware offloading, I started experiencing some issues with streaming, facebook etc.
Examples :

Facebook conversations do not load. When I click on "see more", it just turns and turns indefinitely.
Youtube videos are buffering more than before
No issue with speed tests, I hit the max. ..

I went back to initial settings (no offloading), it fixed issues with facebook and streaming, but after one day or 2, internet speed is slowing down drastically (60mb -> 30 -> now it's around 17mb after 2 days!) A reboot will fix that...
Observation :
The process ksoftirqd seems to be the one using most CPU.

I tried to check only Software offloading, and it seemed to fix both issues, but this morning, my RSA Token (work related) was telling "no internet" but there was no issues with internet.. And right now, I had issue loading pictures on Facebook, everything was black.

Unchecked Software offloading and no more issue loading pics.. but internet speed is back to 17mb/sec until next reboot.

Really don't know what to do. I'm about to go back to my ISP router and sell the Linksys on marketplace..

Any idea somebody ?

anon89577378 · April 7, 2022, 7:00pm

Let's see what your CPU and RAM look like during the "slow" period.

Install the package sysstat.

SSH in to the router.

To trace CPU usage -

pidstat -T TASK 2 (which snapshots every 2 seconds)

To log the results (name the file whatever you want) -

pidstat -T TASK 2 | tee -a CPUtrace.txt

CTRL-C to end the trace.

Log file will be in the root directory.

To trace RAM usage -

pidstat -r -p ALL 2 (which snapshots every 2 seconds)

To log the results (name the file whatever you want) -

pidstat -r -p ALL 2 | tee -a RAMtrace.txt

CTRL-C to end the trace.

Log file will be in the root directory.

CrAsH0v3r · April 8, 2022, 12:13pm

Hi,

It took less than 24h to drop from 67mbps to 40.
Here a the 2 traces while I was doing the speedtests.

Very clear that ksoftirqd is taking all the place.
You will also see Prometheus. It's to feed by Grafana monitoring graphs. First thing I'll test this morning is to deactivate this and reboot to see if it's the cause .

Do you see something else wrong ?

CPUTrace :

Linux 5.4.179 (OpenWrt) 	04/08/22 	_mips_	(4 CPU)

07:10:32      UID       PID    %usr %system  %guest   %wait    %CPU   CPU  Command
07:10:34        0         5    0.00    0.49    0.00    0.00    0.49     0  kworker/0:0-events
07:10:34        0         9    0.00   39.81    0.00    0.00   39.81     0  ksoftirqd/0
07:10:34        0        10    0.00    0.49    0.00    0.00    0.49     2  rcu_sched
07:10:34        0      3149   67.96   31.07    0.00    0.00   99.03     3  prometheus-node
07:10:34      453      7684    0.97    0.97    0.00    0.00    1.94     0  dnsmasq
07:10:34        0     22219    0.97    0.97    0.00    0.00    1.94     1  pidstat

07:10:34      UID       PID    %usr %system  %guest   %wait    %CPU   CPU  Command
07:10:36        0         9    0.00   56.22    0.00    0.00   56.22     0  ksoftirqd/0
07:10:36        0        10    0.00    0.50    0.00    0.00    0.50     3  rcu_sched
07:10:36        0      3149    0.50    0.00    0.00    0.00    0.50     3  prometheus-node
07:10:36        0     22219    1.00    1.00    0.00    0.00    1.99     1  pidstat

07:10:36      UID       PID    %usr %system  %guest   %wait    %CPU   CPU  Command
07:10:38        0         9    0.00    3.00    0.00    0.00    3.00     0  ksoftirqd/0
07:10:38      453      7684    0.50    0.00    0.00    0.00    0.50     1  dnsmasq
07:10:38        0     22219    0.50    1.00    0.00    0.00    1.50     1  pidstat
07:10:38        0     22220    0.00    0.50    0.00    0.00    0.50     0  tee

07:10:38      UID       PID    %usr %system  %guest   %wait    %CPU   CPU  Command
07:10:40        0         9    0.00   20.50    0.00    0.00   20.50     0  ksoftirqd/0
07:10:40        0       776    0.00    0.50    0.00    0.00    0.50     0  urngd
07:10:40        0      1928    0.00    0.50    0.00    0.00    0.50     0  odhcpd
07:10:40      453      7684    1.00    2.00    0.00    0.00    3.00     1  dnsmasq
07:10:40        0     22219    1.50    0.50    0.00    0.00    2.00     1  pidstat

07:10:40      UID       PID    %usr %system  %guest   %wait    %CPU   CPU  Command
07:10:42        0         9    0.00    8.46    0.00    0.00    8.46     0  ksoftirqd/0
07:10:42      453      7684    0.00    0.50    0.00    0.00    0.50     1  dnsmasq
07:10:42        0     22210    0.00    0.50    0.00    0.00    0.50     2  dropbear
07:10:42        0     22219    0.50    1.49    0.00    0.00    1.99     1  pidstat

07:10:42      UID       PID    %usr %system  %guest   %wait    %CPU   CPU  Command
07:10:44        0         9    0.00    4.00    0.00    0.00    4.00     0  ksoftirqd/0
07:10:44      453      7684    0.50    0.00    0.00    0.00    0.50     1  dnsmasq
07:10:44        0     22219    1.00    0.00    0.00    0.00    1.00     1  pidstat

07:10:44      UID       PID    %usr %system  %guest   %wait    %CPU   CPU  Command
07:10:46        0         5    0.00    0.50    0.00    0.00    0.50     0  kworker/0:0-mm_percpu_wq
07:10:46        0         9    0.00   59.70    0.00    0.00   59.70     0  ksoftirqd/0
07:10:46        0      1928    0.50    0.00    0.00    0.00    0.50     1  odhcpd
07:10:46      453      7684    0.00    0.50    0.00    0.00    0.50     1  dnsmasq
07:10:46        0     22219    0.50    1.49    0.00    0.00    1.99     1  pidstat

07:10:46      UID       PID    %usr %system  %guest   %wait    %CPU   CPU  Command
07:10:48        0         9    0.00    4.00    0.00    0.00    4.00     0  ksoftirqd/0
07:10:48        0        10    0.00    0.50    0.00    0.00    0.50     0  rcu_sched
07:10:48        0      3149   43.50   12.50    0.00    0.00   56.00     3  prometheus-node
07:10:48      453      7684    0.00    0.50    0.00    0.00    0.50     1  dnsmasq
07:10:48        0     22219    1.00    1.00    0.00    0.00    2.00     1  pidstat
07:10:48        0     22220    0.00    0.50    0.00    0.00    0.50     0  tee

07:10:48      UID       PID    %usr %system  %guest   %wait    %CPU   CPU  Command
07:10:50        0         9    0.00   38.50    0.00    0.00   38.50     0  ksoftirqd/0
07:10:50        0        10    0.00    0.50    0.00    0.00    0.50     1  rcu_sched
07:10:50        0       514    0.00    0.50    0.00    0.00    0.50     2  kworker/2:2-events_power_efficient
07:10:50       81       741    0.50    0.00    0.00    0.00    0.50     2  ubusd
07:10:50        0      3149   48.00   29.00    0.00    0.00   77.00     3  prometheus-node
07:10:50      453      7684    0.50    0.50    0.00    0.00    1.00     1  dnsmasq
07:10:50        0     22219    1.00    1.00    0.00    0.00    2.00     1  pidstat

07:10:50      UID       PID    %usr %system  %guest   %wait    %CPU   CPU  Command
07:10:52        0         9    0.00  100.00    0.00    0.00  100.00     0  ksoftirqd/0
07:10:52        0        10    0.00    0.50    0.00    0.00    0.50     3  rcu_sched
07:10:52      453      7684    0.50    3.48    0.00    0.00    3.98     1  dnsmasq
07:10:52        0     22210    0.00    0.50    0.00    0.00    0.50     2  dropbear
07:10:52        0     22219    1.49    1.00    0.00    0.00    2.49     2  pidstat

07:10:52      UID       PID    %usr %system  %guest   %wait    %CPU   CPU  Command
07:10:54        0         5    0.00    0.50    0.00    0.00    0.50     0  kworker/0:0-mm_percpu_wq
07:10:54        0         9    0.00   99.50    0.00    0.00   99.50     0  ksoftirqd/0
07:10:54        0        26    0.00    0.50    0.00    0.00    0.50     3  kworker/3:0-events_power_efficient
07:10:54      453      7684    0.50    1.00    0.00    0.00    1.50     1  dnsmasq
07:10:54        0     22219    0.50    0.50    0.00    0.00    1.00     2  pidstat

07:10:54      UID       PID    %usr %system  %guest   %wait    %CPU   CPU  Command
07:10:56        0         9    0.00   99.50    0.00    0.00   99.50     0  ksoftirqd/0
07:10:56      453      7684    0.50    1.00    0.00    0.00    1.49     1  dnsmasq
07:10:56        0     22219    1.00    1.00    0.00    0.00    1.99     2  pidstat

07:10:56      UID       PID    %usr %system  %guest   %wait    %CPU   CPU  Command
07:10:58        0         9    0.00  100.00    0.00    0.00  100.00     0  ksoftirqd/0
07:10:58        0     22219    0.50    1.00    0.00    0.00    1.50     2  pidstat

07:10:58      UID       PID    %usr %system  %guest   %wait    %CPU   CPU  Command
07:11:00        0         5    0.00    0.50    0.00    0.00    0.50     0  kworker/0:0-mm_percpu_wq
07:11:00        0         9    0.00  100.00    0.00    0.00  100.00     0  ksoftirqd/0
07:11:00        0      1772    0.00    0.50    0.00    0.00    0.50     3  netifd
07:11:00        0     22219    0.50    1.00    0.00    0.00    1.50     2  pidstat

07:11:00      UID       PID    %usr %system  %guest   %wait    %CPU   CPU  Command
07:11:02        0         9    0.00  100.00    0.00    0.00  100.00     0  ksoftirqd/0
07:11:02        0        10    0.00    0.50    0.00    0.00    0.50     1  rcu_sched
07:11:02        0      1772    0.50    0.00    0.00    0.00    0.50     2  netifd
07:11:02        0      3149   29.85    6.47    0.00    0.00   36.32     3  prometheus-node
07:11:02      453      7684    0.00    0.50    0.00    0.00    0.50     1  dnsmasq
07:11:02        0     22219    0.50    1.99    0.00    0.00    2.49     2  pidstat

07:11:02      UID       PID    %usr %system  %guest   %wait    %CPU   CPU  Command
07:11:04        0         5    0.00    0.50    0.00    0.00    0.50     0  kworker/0:0-mm_percpu_wq
07:11:04        0         9    0.00   99.50    0.00    0.00   99.50     0  ksoftirqd/0
07:11:04        0        10    0.00    0.50    0.00    0.00    0.50     2  rcu_sched
07:11:04        0        29    0.00    0.50    0.00    0.00    0.50     3  kworker/u8:1-flush-ubifs_0_1
07:11:04        0      3149   36.00   12.50    0.00    0.00   48.50     3  prometheus-node
07:11:04      453      7684    0.50    1.00    0.00    0.00    1.50     1  dnsmasq
07:11:04        0     22210    0.50    0.50    0.00    0.00    1.00     1  dropbear
07:11:04        0     22219    0.00    1.50    0.00    0.00    1.50     2  pidstat
07:11:04        0     22220    0.00    0.50    0.00    0.00    0.50     2  tee

07:11:04      UID       PID    %usr %system  %guest   %wait    %CPU   CPU  Command
07:11:06        0         9    0.00   80.60    0.00    0.00   80.60     0  ksoftirqd/0
07:11:06      453      7684    0.00    0.50    0.00    0.00    0.50     1  dnsmasq
07:11:06        0     22219    0.50    1.00    0.00    0.00    1.49     2  pidstat
07:11:06        0     22220    0.50    0.00    0.00    0.00    0.50     3  tee

07:11:06      UID       PID    %usr %system  %guest   %wait    %CPU   CPU  Command
07:11:08        0         9    0.00   31.00    0.00    0.00   31.00     0  ksoftirqd/0
07:11:08        0        10    0.00    0.50    0.00    0.00    0.50     3  rcu_sched
07:11:08        0       776    0.00    0.50    0.00    0.00    0.50     1  urngd
07:11:08      453      7684    0.50    1.50    0.00    0.00    2.00     1  dnsmasq
07:11:08        0     22219    0.50    1.50    0.00    0.00    2.00     2  pidstat

07:11:08      UID       PID    %usr %system  %guest   %wait    %CPU   CPU  Command
07:11:10        0         9    0.00   15.50    0.00    0.00   15.50     0  ksoftirqd/0
07:11:10      453      7684    0.00    1.50    0.00    0.00    1.50     1  dnsmasq
07:11:10        0     22219    0.50    1.00    0.00    0.00    1.50     2  pidstat


Thanks!

CrAsH0v3r · April 8, 2022, 12:16pm

RAM Trace is too big.
Both files are available here
https://drive.google.com/drive/folders/104qMM94waBAEwGI8MwnFybkWeUNfd8M-?usp=sharing

Just uploaded 2 "long" traces without any speedtests. I don't see anything wrong. I shut Prometheus down.
But the speed is still slow.

anon89577378 · April 8, 2022, 9:08pm

Indicates that the device is under heavy interrupt load.

Caused by IRQs coming in faster than the system can handle them.

They are queued for later processing by ksoftirqd.

Run...

cat /proc/interrupts

... and post the results.

Your device target is ramips/mt7621, which is a DSA device.

Even though the EA7300 has 2 CPUs, only one is being used for DSA devices in 21.02.

I suspect the best option is to roll back to 19.07.n, until the issue is fixed.

CrAsH0v3r · April 8, 2022, 11:49pm

During a speedtest, this command gives :

           CPU0       CPU1       CPU2       CPU3
  8:   28589101   28589132   28589122   28589117  MIPS GIC Local   1  timer
  9:    1157445          0          0          0  MIPS GIC  63  IPI call
 10:          0      41179          0          0  MIPS GIC  64  IPI call
 11:          0          0     988399          0  MIPS GIC  65  IPI call
 12:          0          0          0      41810  MIPS GIC  66  IPI call
 13:     153062          0          0          0  MIPS GIC  67  IPI resched
 14:          0     633781          0          0  MIPS GIC  68  IPI resched
 15:          0          0     618748          0  MIPS GIC  69  IPI resched
 16:          0          0          0     564527  MIPS GIC  70  IPI resched
 17:          0          0          0          0  MIPS GIC  19  1e000600.gpio-bank0, 1e000600.gpio-bank1, 1e000600.gpio-bank2
 19:         12          0          0          0  MIPS GIC  33  ttyS0
 20:          0          0          0          0  MIPS GIC  29  xhci-hcd:usb1
 21:   33780702          0          0          0  MIPS GIC  10  1e100000.ethernet
 23:         23          0          0          0  MIPS GIC  11  mt7603e
 24:         18          0          0          0  MIPS GIC  31  mt7615e
 26:          0          0          0          0  1e000600.gpio  17  keys
 27:          0          0          0          0  1e000600.gpio  12  keys
ERR:          1

CrAsH0v3r · April 8, 2022, 11:51pm

And after fresh reboot

           CPU0       CPU1       CPU2       CPU3
  8:      10998      11028      11038      11011  MIPS GIC Local   1  timer
  9:      19175          0          0          0  MIPS GIC  63  IPI call
 10:          0       6531          0          0  MIPS GIC  64  IPI call
 11:          0          0      18069          0  MIPS GIC  65  IPI call
 12:          0          0          0       6388  MIPS GIC  66  IPI call
 13:       3946          0          0          0  MIPS GIC  67  IPI resched
 14:          0       3963          0          0  MIPS GIC  68  IPI resched
 15:          0          0       4043          0  MIPS GIC  69  IPI resched
 16:          0          0          0       4662  MIPS GIC  70  IPI resched
 17:          0          0          0          0  MIPS GIC  19  1e000600.gpio-bank0, 1e000600.gpio-bank1, 1e000600.gpio-bank2
 19:         12          0          0          0  MIPS GIC  33  ttyS0
 20:          0          0          0          0  MIPS GIC  29  xhci-hcd:usb1
 21:      10788          0          0          0  MIPS GIC  10  1e100000.ethernet
 23:         23          0          0          0  MIPS GIC  11  mt7603e
 24:         18          0          0          0  MIPS GIC  31  mt7615e
 26:          0          0          0          0  1e000600.gpio  17  keys
 27:          0          0          0          0  1e000600.gpio  12  keys
ERR:          1

Warlock · April 9, 2022, 3:11am

Since your router is one of those making use of MediaTek MT7621A - MT7615N combination,
Provided you're comfortable installing LuCi manually,
You could give Development Snapshot builds a try and see if that improves the situation.

Due to bug(s),
I stopped testing/using the stable releases on DIR-878 and DIR-882 (MediaTek MT7621A - MT7615N) routers.

CrAsH0v3r · April 9, 2022, 10:40am

I can do that. But when I bought the router, I installed by mistake a dev snapshot instead of the stable one and configured LUCI myself. But I don't remember which one, but I think that there were a couple other packages that were not comptatible.

I may take a look at this option after Easter because I'll be out of the country a couple a days and I dont want to make the situation worse before leaving.

For now I'll schedule a reboot every night as a temporary patch..

CrAsH0v3r · May 31, 2022, 2:14pm

Hi all,

Just a little follow up on this one.. I may have found the (or one of the) guilty package.

Few weeks ago, I updated my router to the latest version, and the issue diseapered.

I did not reinstall any of my previous packages(except VPN) to keep the setup clean.

After a few weeks running without issue, last week I wanted to limit bandwidth for a specific device, so after reading, I installed NFT-QOS (which was installed before but unused).

I was unabled to get it to work properly.. So I unchecked the "active" box in Lucy and I left it like that.
Two days later, my Internet speed was veeeeery slow, on all devices. As I knew that the only difference was the NFT-QOS "inactive" package, I decided to completely remove it.
I'm now on the sixth day without reboot with no issue at all.

So lesson learned, I'll go one package at a time and wait a couple of days to see if there are issues!

Thanks all for help !

system · June 10, 2022, 2:14pm

This topic was automatically closed 10 days after the last reply. New replies are no longer allowed.