Xiaomi AX3600: ath11k firmware crash - qcom-q6v5-wcss-pil cd00000.q6v5_wcss: fatal error received:

So you did get 17 on 1 radio. Hmm... when the other one got 128.

Looking at the code I can't figure out how this is supposed to work.
This is when 128 is fixed and then we have a patch to check if 128 is present and make it random between 1 and 63, right?

the patch

But earlier it was a simpler code from robi

@nbd Could you please explain how today's code is working and setting the he_bss_color? I always get 128 in my hostapd. Am I doing something wrong in my config, do I have to enable an option to make it work?
You are refering to - add option to disable bss color / spatial reuse. But I can't figure out how...

Should he_bss_color:128 \ be something else? Robi used he_bss_color only.
Sorry if I sound like a dumb person and just don't get it :slight_smile: But I'm just curious why I always end up with 128 and have to manually set the color number.

Isn't the 17 because of my config file (see above)?

Ah, yes it is. Sorry... didn't see that. :+1:

I have lately also had longer-than-earlier uptimes with DL-WRX36, even with multiple SSIDs enabled.

I wonder if some hostapd / mac80211 / nl80211 / whatever change has magically removed the error. E.g. mac80211: ath11k: sync with ath-next

... and still crashed after 5 days.
I hope a new firmware brings a solution.

1 Like

Mine crashes in the first 24 hours with recent snapshots. I'm again back to r23845. I can get the crashes in shorter intervals by letting steam download large games. Interestingly, the "steam" computer is connected via LAN cable and the Openwrt device acts as a gateway.

I use the following cron script (runs every 5 minutes) which reboots when the firmware error occurs:

#!/bin/sh
/sbin/logread -l 100 | /bin/grep -n "failed to send"
if [ $? -eq 0 ]; then
    /sbin/reboot
fi

I have now installed R24328 with the IRQ settings script...

1 Like

Until now not crashing...(there was another network problem)

1 Like

Does it work more stable going back to r23845?

I had no crashes since more than 6 days.

I am freshly trying on r24403 with the irq set affinity script now.

So far all good. About 1d uptime and still 145 mb ram free where I had 40 or less in the past without running irqbalance script.

Looks still good...

Where can I get this page @uweklatt
Still fine since running irqbalance on startup, even though I scheduled a daily reboot.

@Catfriend1 The graph is part of luci-app-statistics.

1 Like

Normal LuCI statistics, if you have that installed.

Memory consumption varies according to SSIDs enabled. As I have been testing various wifi 802.11r configs, there is lots of variation in the last few weeks in my own DL-WRX36...

Ps. LuCI now offers the possibility to hide "free", which amplifies the actual memory consumption.

1 Like

fd90f5fc-fec5-4f69-bcb3-42d17ca2265d

My wifi, since this morning, dies too.
Same crash...

Sat Nov 25 09:58:36 2023 daemon.info hostapd: wifi.2: STA 74:c6:3b:**:**:** IEEE 802.11: authenticated
Sat Nov 25 09:58:36 2023 daemon.notice hostapd: wifi.2: AP-STA-DISCONNECTED 74:c6:3b:**:**:**
Sat Nov 25 09:58:36 2023 daemon.err collectd[2717]: Sleeping only 2s because the next interval is 163412.286 seconds in the past!
Sat Nov 25 09:58:36 2023 kern.err kernel: [   46.554843] qcom-q6v5-wcss-pil cd00000.q6v5_wcss: fatal error received:
Sat Nov 25 09:58:36 2023 kern.err kernel: [   46.554843] QC Image Version: QC_IMAGE_VERSION_STRING=WLAN.HK.2.9.0.1-01890-QCAHKSWPL_SILICONZ-1
Sat Nov 25 09:58:36 2023 kern.err kernel: [   46.554843] Image Variant : IMAGE_VARIANT_STRING=8074.wlanfw.eval_v2Q
Sat Nov 25 09:58:36 2023 kern.err kernel: [   46.554843]
Sat Nov 25 09:58:36 2023 kern.err kernel: [   46.554843] wal_peer_control.c:2904 Assertion is_graceful_to_handle failedparam0 :zero, param1 :zero, param2 :zero.
Sat Nov 25 09:58:36 2023 kern.err kernel: [   46.554843] Thread ID      : 0x00000060  Thread name    : WLAN RT1  Process ID     : 0
Sat Nov 25 09:58:36 2023 kern.err kernel: [   46.554843] Register:
Sat Nov 25 09:58:36 2023 kern.err kernel: [   46.554843] SP : 0x4bfd5a48
Sat Nov 25 09:58:36 2023 kern.err kernel: [   46.554843] FP : 0x4bfd5a50
Sat Nov 25 09:58:36 2023 kern.err kernel: [   46.554843] PC : 0x4b1080c4
Sat Nov 25 09:58:36 2023 kern.err kernel: [   46.554843] SSR : 0x00000008
Sat Nov 25 09:58:36 2023 kern.err kernel: [   46.554843] BADVA : 0x00020000
Sat Nov 25 09:58:36 2023 kern.err kernel: [   46.554843] LR : 0x4b107860
Sat Nov 25 09:58:36 2023 kern.err kernel: [   46.554843]
Sat Nov 25 09:58:36 2023 kern.err kernel: [   46.554843] Stack Dump
Sat Nov 25 09:58:36 2023 kern.err kernel: [   46.554843] from : 0x4bfd5a48
Sat Nov 25 09:58:36 2023 kern.err kernel: [   46.554843] to   : 0x4bfd62a8
Sat Nov 25 09:58:36 2023 kern.err kernel: [   46.554843]
Sat Nov 25 09:58:36 2023 kern.err kernel: [   46.603967] remoteproc remoteproc0: crash detected in cd00000.q6v5_wcss: type fatal error
Sat Nov 25 09:58:36 2023 kern.err kernel: [   46.626166] remoteproc remoteproc0: handling crash #1 in cd00000.q6v5_wcss
Sat Nov 25 09:58:36 2023 kern.err kernel: [   46.634308] remoteproc remoteproc0: recovering cd00000.q6v5_wcss
Sat Nov 25 09:58:36 2023 kern.info kernel: [   46.666976] remoteproc remoteproc0: stopped remote processor cd00000.q6v5_wcss
Sat Nov 25 09:58:36 2023 kern.warn kernel: [   46.680552] ath11k c000000.wifi: failed to transmit frame -108
Sat Nov 25 09:58:36 2023 kern.warn kernel: [   46.681334] ath11k c000000.wifi: failed to transmit frame -108
Sat Nov 25 09:58:37 2023 kern.warn kernel: [   46.972704] ath11k c000000.wifi: failed to find peer 74:c6:3b:**:**:** on vdev 3 after creation

Do you use irqbalance on startup?

Uptime with irqbalance script is now more than 19 days.

1 Like