FWIW, I've been running NSS builds from around 10/1 on 2x MX4300 dumb APs w/ almost no problems.

Non NSS has other issues unfortunately. The whole QCA platform is just flawed at least driver wise and stock is completely outdated.

hi @sppmaster and @anom3, would you please check on your 301w, the result of cat /proc/cpuinfo ?

on my build(which is from qosmio's repo), the result does not have "model name"

below is the screenshot of my build

and below is the screenshot of a vanilla openwrt installed on the same 301w

Thanks in advance!

1 Like

Here it is on my NSS build.

root@QNAP:~# cat /proc/cpuinfo
processor       : 0
BogoMIPS        : 38.40
Features        : fp asimd evtstrm aes pmull sha1 sha2 crc32 cpuid
CPU implementer : 0x41
CPU architecture: 8
CPU variant     : 0x0
CPU part        : 0xd03
CPU revision    : 4

processor       : 1
BogoMIPS        : 38.40
Features        : fp asimd evtstrm aes pmull sha1 sha2 crc32 cpuid
CPU implementer : 0x41
CPU architecture: 8
CPU variant     : 0x0
CPU part        : 0xd03
CPU revision    : 4

processor       : 2
BogoMIPS        : 38.40
Features        : fp asimd evtstrm aes pmull sha1 sha2 crc32 cpuid
CPU implementer : 0x41
CPU architecture: 8
CPU variant     : 0x0
CPU part        : 0xd03
CPU revision    : 4

processor       : 3
BogoMIPS        : 38.40
Features        : fp asimd evtstrm aes pmull sha1 sha2 crc32 cpuid
CPU implementer : 0x41
CPU architecture: 8
CPU variant     : 0x0
CPU part        : 0xd03
CPU revision    : 4


Thanks, then its confirmed that the NSS build does not show "model name"...

@qosmio would you mind take a look at this, and figure out why...

do you use vlan/mDNS reflector ? do you have one ssid where your streaming devices are connected(VLAN1) and another SSID where your phone is connected(VLAN2) and can get it to reliably work?

1 Like

I don't use mdns. I use multiple VLANs (with different SSID) and 802.11r. Wired backhaul.

my setup is similar with mx4300(just with additiional mDNS reflector which causes issues when one vlan has streaming device and another vlan is trying to access airplay etc. i have a unifi AP too in the mix and it does not exhibit this issue). have noticed with the setup though, if you have management Ip's on your vlans(for the ap) defined, then AP does not like it when you try to access Luci using a different vlan management ip than the current client device vlan. its very slow (firewall allowing). similar for iperf tests

For me STA/client mode is working on my ax3600 running an old AgustinLorenzo prebuilt (NSS-WiFi) dated 2024-08-26-2004
A few releases later the 2024-09-14-1545 build already showed the problem, i've not tested the inbetween releases.

It might be related to the version of nss firmware.
12.x does not like mesh etc. for that you need to revert to the 11.4

@qosmio can you add that to your repo? Maybe this finally solves the broadcast misery

https://lore.kernel.org/linux-wireless/Z2Q9POuV-6MIdzRf@pilgrim/T/#t

2 Likes

Hi guys. I wish you happy holidays.
I want to report an unexpected return of an year old issue.

I haven't had any major issue for a year.

The above was true during the year and now suddenly when a torrent is run (on a PC) and is saved to SSD (attached to USB3 router port using ksmbd for SMB sharing), Luci web interface and SSH completely hang.
During the time the router has hanged, no DNS requests can be made (so there is no Internet), already established connections and traffic continue to work but no new connections are possible. No OOM or other crashes but it seems like all services do not respond.
Once I stop the torrent both Luci and SSH recover immediately. DNS and other services recover too and I can access the internet like nothing wrong has ever happened.

These errors can be seen in the log. Obviously a consequence of the services stall.

daemon.notice hostapd: nl80211: nl80211_recv_beacons->nl_recvmsgs failed: -5
daemon.warn collectd[4112]: Sleeping only 2s because the next interval is 112.108 seconds in the past!
daemon.notice hostapd: nl80211: wpa_driver_nl80211_event_receive->nl_recvmsgs failed: -5
daemon.warn collectd[4112]: plugin_read_thread: read-function of the `cpufreq' plugin took 518.148 seconds, which is above its read interval (30.000 seconds). You might want to adjust the `Interval' or `ReadThreads' settings.
daemon.warn collectd[4112]: plugin_read_thread: read-function of the `thermal' plugin took 516.716 seconds, which is above its read interval (30.000 seconds). You might want to adjust the `Interval' or `ReadThreads' settings.

If I check the statistics all the graphics have empty intervals when the hang occurred.

root@QNAP:~# nss_diag
     MODEL: QNAP 301w
   OPENWRT: r28489-408dbcb419
IPQ BRANCH: main-nss
IPQ COMMIT: 408dbcb419
  IPQ DATE: 2024-12-16
    NSS FW: NSS.FW.12.2-161-HK.R
  MAC80211: v6.11.2-0-g7aa21fec187b
 ATH11K FW: WLAN.HK.2.9.0.1-02146-QCAHKSWPL_SILICONZ-1
 INTERFACE: br-lan     tx-checksumming: on  rx-gro-list: off
            10g-1      tx-checksumming: on  rx-gro-list: off
            10g-2      tx-checksumming: on  rx-gro-list: off
            lan1       tx-checksumming: on  rx-gro-list: off
            lan2       tx-checksumming: on  rx-gro-list: off
            lan3       tx-checksumming: on  rx-gro-list: off
            lan4       tx-checksumming: on  rx-gro-list: off
            phy0-ap0   tx-checksumming: on  rx-gro-list: off
            phy1-ap0   tx-checksumming: on  rx-gro-list: off

  NSS PKGS: kmod-qca-mcs-6.6.65.12.5.2024.02.27~26d6424-r1 aarch64_cortex-a53 {feeds/nss_packages/qca-mcs} () [installed]
            kmod-qca-nss-dp-6.6.65.2024.04.16~5bf8b91e-r1 aarch64_cortex-a53 {feeds/base/kernel/qca-nss-dp} () [installed]
            kmod-qca-nss-drv-6.6.65.12.5.2024.04.06~53a0dc1-r15 aarch64_cortex-a53 {feeds/nss_packages/qca-nss-drv} () [installed]
            kmod-qca-nss-drv-bridge-mgr-6.6.65.12.5.2024.06.12~1bcef16-r7 aarch64_cortex-a53 {feeds/nss_packages/qca-nss-clients} () [installed]
            kmod-qca-nss-drv-igs-6.6.65.12.5.2024.06.12~1bcef16-r7 aarch64_cortex-a53 {feeds/nss_packages/qca-nss-clients} () [installed]
            kmod-qca-nss-drv-qdisc-6.6.65.12.5.2024.06.12~1bcef16-r7 aarch64_cortex-a53 {feeds/nss_packages/qca-nss-clients} () [installed]
            kmod-qca-nss-drv-vlan-mgr-6.6.65.12.5.2024.06.12~1bcef16-r7 aarch64_cortex-a53 {feeds/nss_packages/qca-nss-clients} () [installed]
            kmod-qca-nss-ecm-6.6.65.12.5.5.2024.09.02~bd5057b-r3 aarch64_cortex-a53 {feeds/nss_packages/qca-nss-ecm} () [installed]
            kmod-qca-ssdk-6.6.65.2024.06.13~c451136b-r3 aarch64_cortex-a53 {feeds/base/kernel/qca-ssdk} () [installed]
            nss-firmware-default-2024.08.04~794fe373-r1 aarch64_cortex-a53 {feeds/nss_packages/firmware/nss-firmware} () [installed]
            nss-firmware-ipq8074-2024.08.04~794fe373-r1 aarch64_cortex-a53 {feeds/nss_packages/firmware/nss-firmware} () [installed]

This happens only when the torrent is saved to a ksmbd share on a SSD connected to the router USB3 port.
If I save the torrent to a local PC disk drive this hang never occurs.
I cannot tell for sure if ksmbd causes the hang but watching htop (till the SSH hangs) I can see that several services like dnsmasq, https-dns-proxy cause high CPU usage and then the router stops responding.

Hi @sppmaster,

Regarding the latter, OpenWRT updated the mac80211 backports to version 6.12.6, so we need a refresh of the NSS patches to be able to reapply them correctly.

In the meantime this is the last commit we can go to without fail: https://github.com/openwrt/openwrt/commit/de0c143742517d401c4730137f092be8fb7e882a

FYI @qosmio

Regards, Agustin

1 Like

@qosmio Hello, are you interested in adapting the 6.12 kernel? I've already completed most of the work, and the system is running stably, but there are errors related to wireless functionality. If you're interested, you can check out https://github.com/LiBwrt/openwrt-6.x.git The related bug link is: https://github.com/breeze303/openwrt-ci/issues/35

5 Likes

never mind, I managed to find the root cause...

in order to "let /proc/cpuinfo show model name", a patch is needed.

Ref: https://lore.kernel.org/lkml/1472461345-28219-1-git-send-email-sumitg@nvidia.com/

1 Like

You can take this patch

If you want to go beyond mac80211 6.12 backport

2 Likes

Here's what it looks like when it's on the machine

1 Like

Hi @asvio,

Perfect, thank you

I didn't see that you had already done this task.

I am going to prepare a new build with the new backports + this patch (https://git.codelinaro.org/clo/qsdk/oss/system/feeds/wlan-open/-/blob/win.wlan_host_opensource.3.0.r24/patches/ath11k/350-ath11k-Revert-clear-the-keys-properly-when-DISABLE_K.patch) to try to fix the broadcast failure that @Crect mentions

@asvio can you try that patch? https://github.com/AgustinLorenzo/openwrt/commit/e09b9b1457e338be20e3a7a1356d50f9c19bfe01 (I have good feedback on the devices I currently compile.) @qosmio can you integrate?

Regards, Agustin

I'm actually running on my nbg7815 this patch to see how It woks.

I will do after i see there is no problem with current patches.

There seems to be a memory leak.
image