Archer C7 : Atheros QCA9880 802.11nac random crashes since kernel 4.14.94

I self-build all my images, so my versions are whatever happened to be on master that day.

I migrate this evening my install to 4.19 and i will tell you if it crash, it happen after 2 days uptime today

HW Offloading is broken with 4.19 kernel, i will reinstall a 4.14 kernel, my gateway is overloaded all the time with my gigabit connection

The NAT Passthrouth is at 390 Mbit/s with 4.19, 890 Mbit/s with 4.14

Currently HW offload is only available on MT7621 device, so you should only be enabling SW offload on C7V2.

I don't know why NAT passthrouth are so bad with OpenWrt SNAPSHOT r10307-629e6538a1 / LuCI Master (f138fc93)

[  6]   0.00-1.00   sec  39.4 MBytes   331 Mbits/sec                  
[  6]   1.00-2.00   sec  34.2 MBytes   287 Mbits/sec                  
[  6]   2.00-3.00   sec  29.4 MBytes   246 Mbits/sec    

With 19.07-SNAPSHOT with 4.14 kernel sw_offload 1

$ iperf3 -c 192.168.1.17
Connecting to host 192.168.1.17, port 5201
[ 5] local 192.168.10.103 port 55753 connected to 192.168.1.17 port 5201
[ ID] Interval Transfer Bitrate
[ 5] 0.00-1.00 sec 110 MBytes 923 Mbits/sec
[ 5] 1.00-2.00 sec 108 MBytes 904 Mbits/sec
[ 5] 2.00-3.00 sec 113 MBytes 948 Mbits/sec
[ 5] 3.00-4.00 sec 110 MBytes 921 Mbits/sec
[ 5] 4.00-5.00 sec 110 MBytes 923 Mbits/sec
[ 5] 5.00-6.00 sec 112 MBytes 941 Mbits/sec
[ 5] 6.00-7.00 sec 110 MBytes 926 Mbits/sec
[ 5] 7.00-8.00 sec 112 MBytes 941 Mbits/sec
[ 5] 8.00-9.00 sec 111 MBytes 928 Mbits/sec
[ 5] 9.00-10.00 sec 110 MBytes 922 Mbits/sec 

With master-SNAPSHOT with 4.19 kernel, sw_offload 0 :

$ iperf3 -c 192.168.1.17
Connecting to host 192.168.1.17, port 5201
[  5] local 192.168.10.103 port 53897 connected to 192.168.1.17 port 5201
[ ID] Interval           Transfer     Bitrate
[  5]   0.00-1.00   sec  77.2 MBytes   648 Mbits/sec                  
[  5]   1.00-2.00   sec  74.8 MBytes   628 Mbits/sec                  
[  5]   2.00-3.00   sec  75.0 MBytes   629 Mbits/sec                  
[  5]   3.00-4.00   sec  75.6 MBytes   634 Mbits/sec                  
[  5]   4.00-5.00   sec  77.0 MBytes   646 Mbits/sec                  
[  5]   5.00-6.00   sec  77.7 MBytes   652 Mbits/sec                  
[  5]   6.00-7.00   sec  77.5 MBytes   650 Mbits/sec                  
[  5]   7.00-8.00   sec  76.6 MBytes   642 Mbits/sec                  
[  5]   8.00-9.00   sec  77.6 MBytes   651 Mbits/sec                  
[  5]   9.00-10.00  sec  77.8 MBytes   652 Mbits/sec                  
- - - - - - - - - - - - - - - - - - - - - - - - -
[ ID] Interval           Transfer     Bitrate
[  5]   0.00-10.00  sec   767 MBytes   643 Mbits/sec                  sender
[  5]   0.00-10.00  sec   767 MBytes   643 Mbits/sec                  receiver

With master-SNAPSHOT with 4.19 kernel, sw_offload 1 :

$ iperf3 -c 192.168.1.17
Connecting to host 192.168.1.17, port 5201
[  5] local 192.168.10.103 port 53930 connected to 192.168.1.17 port 5201
[ ID] Interval           Transfer     Bitrate
[  5]   0.00-1.00   sec  63.5 MBytes   533 Mbits/sec                  
[  5]   1.00-2.00   sec  44.9 MBytes   377 Mbits/sec                  
[  5]   2.00-3.00   sec  44.2 MBytes   370 Mbits/sec                  
[  5]   3.00-4.00   sec  46.0 MBytes   387 Mbits/sec                  
[  5]   4.00-5.00   sec  46.1 MBytes   386 Mbits/sec                  
[  5]   5.00-6.00   sec  47.0 MBytes   394 Mbits/sec                  
[  5]   6.00-7.00   sec  47.4 MBytes   397 Mbits/sec                  
[  5]   7.00-8.00   sec  45.0 MBytes   378 Mbits/sec                  
[  5]   8.00-9.00   sec  47.7 MBytes   400 Mbits/sec                  
[  5]   9.00-10.00  sec  47.2 MBytes   396 Mbits/sec                  
- - - - - - - - - - - - - - - - - - - - - - - - -
[ ID] Interval           Transfer     Bitrate
[  5]   0.00-10.00  sec   479 MBytes   402 Mbits/sec                  sender
[  5]   0.00-10.00  sec   479 MBytes   402 Mbits/sec                  receive

So the master image has a performance problem with sw offloading as iperf show us here

I will rollback to a working image (except 5Ghz) of 19.07

Hello

I had do this modif on my repo of openwrt master

diff --git a/target/linux/ath79/Makefile b/target/linux/ath79/Makefile
index cfdf8c87e5..82f6317ac6 100644
--- a/target/linux/ath79/Makefile
+++ b/target/linux/ath79/Makefile
@@ -8,7 +8,7 @@ SUBTARGETS:=generic nand tiny
 
 FEATURES:=ramdisk
 
-KERNEL_PATCHVER:=4.19
+KERNEL_PATCHVER:=4.14
 
 include $(INCLUDE_DIR)/target.mk

and iperf3 is so good, so there is an real issue with soft offload and 4.19 kernel

$ iperf3 -c 192.168.1.17
Connecting to host 192.168.1.17, port 5201
[  5] local 192.168.10.103 port 55868 connected to 192.168.1.17 port 5201
[ ID] Interval           Transfer     Bitrate
[  5]   0.00-1.00   sec   112 MBytes   942 Mbits/sec                  
[  5]   1.00-2.00   sec  97.9 MBytes   821 Mbits/sec                  
[  5]   2.00-3.00   sec   113 MBytes   948 Mbits/sec                  
[  5]   3.00-4.00   sec   113 MBytes   946 Mbits/sec                  
[  5]   4.00-5.00   sec   112 MBytes   941 Mbits/sec                  
[  5]   5.00-6.00   sec   113 MBytes   947 Mbits/sec                  
[  5]   6.00-7.00   sec   113 MBytes   944 Mbits/sec                  
[  5]   7.00-8.00   sec   113 MBytes   948 Mbits/sec                  
[  5]   8.00-9.00   sec   113 MBytes   948 Mbits/sec                  
[  5]   9.00-10.00  sec   113 MBytes   948 Mbits/sec                  
- - - - - - - - - - - - - - - - - - - - - - - - -
[ ID] Interval           Transfer     Bitrate
[  5]   0.00-10.00  sec  1.09 GBytes   933 Mbits/sec                  sender
[  5]   0.00-10.00  sec  1.09 GBytes   933 Mbits/sec                  receiver

So now i'm running master repo with 4.14 kernel and it's ok, so i will check in two day if 5Ghz crash or not

Just logged into console on an EA8300 on my bench and saw

[83109.818200] ath10k_pci 0000:01:00.0: SWBA overrun on vdev 0, skipped old beacon
[83109.818319] ath10k_pci 0000:01:00.0: SWBA overrun on vdev 0, skipped old beacon
[83109.824546] ath10k_pci 0000:01:00.0: SWBA overrun on vdev 0, skipped old beacon
[83109.831847] ath10k_pci 0000:01:00.0: SWBA overrun on vdev 0, skipped old beacon
[83109.839069] ath10k_pci 0000:01:00.0: SWBA overrun on vdev 0, skipped old beacon
[83109.847883] ath10k_ahb a800000.wifi: SWBA overrun on vdev 0, skipped old beacon
[83109.853640] ath10k_ahb a800000.wifi: SWBA overrun on vdev 0, skipped old beacon

so on both the IPQ4019 and the QCA9888

root@test:/# opkg list-installed | fgrep ath10
ath10k-firmware-qca4019-ct - 2018-10-10-d366b80d-1
ath10k-firmware-qca9888-ct - 2018-10-10-d366b80d-1
kmod-ath10k-ct - 4.19.53+2019-06-13-a045b1ce-1

root@test:/# uname -a
Linux [...] 4.19.53 #0 SMP Fri Jun 21 18:15:51 2019 armv7l GNU/Linux

Seeming more and more like a firmware/driver issue.

Hi

I'm deployed master firmware on my c7 since my last post: no 5ghz crashs since 2 days now, I hope it will continue.

The second archer c7 with previous 19.07 snapshot has crashed at 24h uptime precise, each day.

But I run with 4.14 Kernel because 4.19 won't offload with my archers c7

FWIW, I was one of the two linked instances, and Ive been having the problem on 4.14 but not ath79 bearing snapshots

The ath10k-ct works with latest version but has a memory leak, so I rollback to non ct and it work fine 300 Mbps real instead of 400 but stability of link and of the radio

To add to my prior mentioned 5ghz drops on pre 18.06.04 release snapshots, Ive now had my 2.4ghz radio go non responsive a few days after I loaded 18.06.4, but no 5ghz issues yet. Nothing at all in the logs.

Target ar71xx ?

If you migrated to ath79 from ar71xx the file path of wireless devices is not the same, you need to delete wireless conf in /etc/config for generate a new one by a reboot of router

Actually, I havent migrated. Had been usi g 18.x.x snapshots and 18.06.x only.