Belkin RT3200/Linksys E8450 WiFi AX discussion

PeterPan · October 1, 2021, 7:01pm

Hi Lynx,
I have it at 80Mhz, did only following to the howto.
Its stable as rock (Uptime 15 days), and its fast enough toward my needs.

Fasttransition works in my environment, measured with winfi app my windows laptop switched smooth with no ping loss to the mesh ap with the strongest signal.
I'm using wireguard, but my gateway is my internet router, so I do not need port forwarding.

alleyu2 · October 2, 2021, 5:02am

Hi. I am on snapshot build 17598 and somehow I am not able to upgrade via attendedsysupgrade and auc. Attendedsysupgrade is not downloading the new build SNAPSHOT (r17648-16e83a7491) and also auc is displaying ssl error. Screenshot attached.

drikus · October 2, 2021, 6:43am

Both online image builders @
https://asu.aparcar.org/
https://chef.libremesh.org/
uses a Let's Encrypt certificate issued with a root CA in the chain with CN = DST Root CA X3 and got expired on 30th sept. older openssl systems seems to continue due to another kind of chain validation implementation but with wolfssl systems seem to have more issues. So could be depending on the implementation or the chain offered by the server. You might also try to remove the expired cert in /etc/ssl/certs/ca-certificates.crt if this cert is used to build the chain.

Lynx · October 2, 2021, 9:21am

OK devs, I have three RT3200's but one of them gets into a bad state about 1 in 3 boots in which the 5Ghz radio driver initialization fails and I see kernel errors.

hexdump mtdblock2:

00000000  22 76 02 00 e8 9f 80 d5  d5 20 00 00 00 00 00 00  |"v....... ......|
00000010  83 55 4b 00 00 00 00 00  00 00 00 00 00 00 00 00  |.UK.............|
00000020  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
00000030  00 00 00 00 44 00 00 20  00 00 00 00 10 00 20 00  |....D.. ...... .|
00000040  00 00 44 00 04 00 00 00  00 00 00 00 00 00 00 00  |..D.............|
00000050  20 00 00 bb 40 b0 c3 c3  26 c0 80 80 41 c4 26 c1  | ...@...&...A.&.|
00000060  c0 c0 41 c4 26 c1 c0 c0  c0 c5 26 c0 80 80 00 00  |..A.&.....&.....|
00000070  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
*
000000b0  00 00 00 00 00 00 00 00  00 00 00 00 00 00 c6 c6  |................|
000000c0  c3 c3 c2 c1 00 c3 00 c3  00 83 83 83 85 c1 c1 82  |................|
000000d0  85 85 85 82 00 00 00 00  00 00 00 00 00 00 00 00  |................|
000000e0  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
000000f0  86 00 2e 00 a1 00 91 87  88 00 00 00 00 00 00 00  |................|
00000100  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
*
00000120  0b 00 00 09 00 00 00 00  00 00 00 00 00 00 00 00  |................|
00000130  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
*
00000150  00 00 00 00 00 00 00 00  00 00 00 00 00 00 77 07  |..............w.|
00000160  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
*
00000400  ff ff ff ff ff ff ff ff  ff ff ff ff ff ff ff ff  |................|
*
00005000  15 79 00 00 e8 9f 80 d5  d5 21 00 0c 43 26 59 97  |.y.......!..C&Y.|
00005010  15 79 c3 14 00 80 02 00  15 79 c3 14 ec 98 01 00  |.y.......y......|
00005020  16 79 c3 14 00 80 02 00  16 79 c3 14 00 00 01 00  |.y.......y......|
00005030  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
*
00005050  01 00 94 00 00 00 00 00  00 00 00 82 55 4b 00 00  |............UK..|
00005060  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
00005070  30 34 00 00 00 00 07 ad  84 00 00 00 00 00 00 00  |04..............|
00005080  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
*
000050a0  00 00 00 00 00 00 00 00  b1 a9 5d ef b8 7a c9 33  |..........]..z.3|
000050b0  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
*
00005190  24 52 06 00 28 00 00 15  00 00 00 00 00 00 00 00  |$R..(...........|
000051a0  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
*
000051d0  01 00 fe fd fd fd f8 f9  01 00 fe fd fd fd f8 f9  |................|
000051e0  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
*
00005230  00 00 00 00 00 00 00 00  00 00 e4 e4 ee ee fa fa  |................|
00005240  06 06 16 16 df df e7 e7  f3 f3 fb fb 03 03 0b 0b  |................|
00005250  25 25 c6 c6 c2 c2 c2 00  00 c2 00 c2 00 00 81 82  |%%..............|
00005260  85 85 c2 c2 00 81 81 82  85 85 00 c2 c2 00 00 81  |................|
00005270  82 85 85 87 87 c2 c2 00  81 81 82 85 85 87 87 c2  |................|
00005280  c2 00 00 81 82 85 85 87  87 c2 c2 00 00 81 82 85  |................|
00005290  85 87 87 c2 c2 00 00 81  82 85 85 87 87 c4 c2 c2  |................|
000052a0  c1 00 c3 00 c3 c3 c1 00  81 83 83 c3 c3 c3 c1 00  |................|
000052b0  81 83 83 00 c2 c2 c2 c0  81 82 84 84 00 00 00 c3  |................|
000052c0  c3 c3 c1 00 81 83 83 86  86 c3 c3 c3 c1 00 81 83  |................|
000052d0  83 86 86 c2 c2 c2 c0 81  82 84 84 86 86 c3 c3 c3  |................|
000052e0  c1 00 81 83 83 86 86 c3  c3 c3 c1 00 81 83 83 86  |................|
000052f0  86 c3 c3 c3 c1 00 81 83  83 86 86 02 28 00 02 28  |............(..(|
00005300  00 02 28 00 02 28 00 00  00 00 00 00 00 00 00 00  |..(..(..........|
00005310  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
00005320  00 00 00 00 00 00 00 00  00 00 00 00 00 4c 58 65  |.............LXe|
00005330  71 7d 00 4d 59 65 72 7e  00 4c 58 64 70 7c 00 4c  |q}.MYer~.LXdp|.L|
00005340  58 64 70 7c 88 88 88 88  03 03 03 26 26 26 26 26  |Xdp|.......&&&&&|
00005350  26 26 26 00 03 03 03 26  26 26 26 26 26 26 26 00  |&&&....&&&&&&&&.|
00005360  03 03 03 26 26 26 26 26  26 26 26 00 03 03 03 26  |...&&&&&&&&....&|
00005370  26 26 26 26 26 26 26 00  00 00 00 00 00 00 00 00  |&&&&&&&.........|
00005380  00 00 00 00 00 00 aa aa  aa aa aa aa aa aa aa 99  |................|
00005390  99 99 00 00 00 00 00 00  99 99 99 99 99 99 99 99  |................|
000053a0  99 88 88 88 99 99 99 99  99 99 99 99 99 99 99 99  |................|
000053b0  00 00 00 00 00 00 cc cc  cc dd dd dd cc cc cc bb  |................|
000053c0  bb bb 00 00 00 00 00 00  88 88 88 88 88 88 88 88  |................|
000053d0  88 88 88 88 88 88 88 88  88 88 99 99 99 99 99 99  |................|
000053e0  00 00 00 00 00 00 bb bb  bb cc cc cc dd dd dd cc  |................|
000053f0  cc cc 00 00 00 00 00 00  cc cc cc bb bb bb bb bb  |................|
00005400  bb bb bb bb bb bb bb bb  bb bb aa aa aa aa aa aa  |................|
00005410  00 00 00 00 00 00 bb bb  bb bb bb bb bb bb bb bb  |................|
00005420  bb bb 00 00 00 00 00 00  aa aa aa aa aa aa bb bb  |................|
00005430  bb bb bb bb aa aa aa bb  bb bb 99 99 99 aa aa aa  |................|
00005440  08 42 51 51 63 74 08 42  51 51 64 74 08 42 51 51  |.BQQct.BQQdt.BQQ|
00005450  64 74 08 42 51 51 64 74  08 42 51 51 64 74 08 42  |dt.BQQdt.BQQdt.B|
00005460  51 51 64 74 08 43 54 54  66 74 08 42 56 56 68 74  |QQdt.CTTft.BVVht|
00005470  08 42 51 51 63 72 08 42  51 51 63 74 08 42 50 50  |.BQQcr.BQQct.BPP|
00005480  63 74 08 42 51 51 63 75  08 42 51 51 63 74 08 42  |ct.BQQcu.BQQct.B|
00005490  51 51 63 74 08 42 53 53  65 76 08 42 55 55 66 79  |QQct.BSSev.BUUfy|
000054a0  08 42 51 51 64 74 08 42  51 51 64 75 08 42 51 51  |.BQQdt.BQQdu.BQQ|
000054b0  64 75 08 42 51 51 65 75  08 43 51 51 64 75 08 44  |du.BQQeu.CQQdu.D|
000054c0  52 52 65 76 08 44 54 54  67 78 08 43 56 56 68 7a  |RRev.DTTgx.CVVhz|
000054d0  08 42 51 51 63 74 08 43  52 52 64 76 08 42 51 51  |.BQQct.CRRdv.BQQ|
000054e0  64 74 08 42 51 51 64 74  08 44 53 53 66 77 08 44  |dt.BQQdt.DSSfw.D|
000054f0  53 53 66 77 08 44 55 55  68 79 08 43 57 57 69 7b  |SSfw.DUUhy.CWWi{|
00005500  88 88 88 88 88 88 88 88  88 88 88 88 88 88 88 88  |................|
*
00005520  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
*
000057b0  00 00 00 00 00 00 00 00  00 00 00 00 00 26 25 02  |.............&%.|
000057c0  00 00 00 00 00 00 00 00  00 00 00 00 00 90 00 90  |................|
000057d0  00 90 00 90 00 00 00 00  00 00 00 00 00 00 00 00  |................|
000057e0  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
*
00005800  00 00 db a8 65 43 21 00  97 00 db a8 65 43 33 00  |....eC!.....eC3.|
00005810  87 00 ec b9 76 54 32 00  87 00 ca 97 54 32 21 00  |....vT2.....T2!.|
00005820  87 00 0c 0b 0c 0c 0f 09  0f 0e 00 00 00 00 00 03  |................|
00005830  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
*
000058c0  82 a1 81 84 00 00 00 00  00 00 00 00 00 00 00 00  |................|
000058d0  00 e8 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
000058e0  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
*
00005900  43 c0 c3 c0 43 c1 43 c1  c0 cd 40 cc 40 cb 42 ca  |C...C.C...@.@.B.|
00005910  3e cc 3e cc 3d cc 40 ca  be cb be ca 3e cb c0 ca  |>.>.=.@.....>...|
00005920  3e cc 3e cc 3e cc be ca  c0 cd c0 cb 40 cb 42 ca  |>.>.>.......@.B.|
00005930  be cb 3e cb 3e cb 3e cc  40 ce c0 ca 40 ca c0 c9  |..>.>.>.@...@...|
00005940  40 c9 40 c9 40 c9 40 c9  00 00 00 00 00 00 00 00  |@.@.@.@.........|
00005950  41 cd 41 cd 41 cd 41 cd  00 00 00 00 00 00 00 00  |A.A.A.A.........|
00005960  c0 cf c0 cf c0 cf c0 cf  c0 cf c0 cf c0 cf c0 cf  |................|
00005970  c0 ce c0 ce c0 ce c0 ce  c0 ce c0 ce c0 ce c0 ce  |................|
00005980  c0 d3 c0 d1 40 d1 c0 d0  c0 d1 c0 d1 c0 d1 40 d3  |....@.........@.|
00005990  40 d4 40 d1 c0 d0 40 d0  c0 d0 c0 d0 c0 d0 40 d0  |@.@...@.......@.|
000059a0  02 ad 00 00 00 01 c0 00  c0 83 00 00 00 00 00 00  |................|
000059b0  65 ae 90 14 93 f0 88 87  18 18 18 18 18 18 18 18  |e...............|
000059c0  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
*
00005a00  00 4c 58 64 70 7b 00 4c  58 64 71 7c 00 4b 58 63  |.LXdp{.LXdq|.KXc|
00005a10  70 7a 00 4b 58 63 70 7a  00 00 00 00 08 48 48 5a  |pz.KXcpz.....HHZ|
00005a20  69 7a 08 48 48 5a 69 7a  08 48 48 5a 69 7a 08 48  |iz.HHZiz.HHZiz.H|
00005a30  48 5a 69 7a 08 48 48 5a  69 7a 08 48 48 5a 69 7a  |HZiz.HHZiz.HHZiz|
00005a40  08 48 48 5a 69 7a 08 48  48 5a 69 7a 08 48 48 5a  |.HHZiz.HHZiz.HHZ|
00005a50  6c 75 08 48 48 5a 6c 75  08 48 48 5a 6c 75 08 48  |lu.HHZlu.HHZlu.H|
00005a60  48 5a 6c 75 08 48 48 5a  6c 75 08 48 48 5a 6c 75  |HZlu.HHZlu.HHZlu|
00005a70  08 48 48 5a 6c 75 08 48  48 5a 6c 75 08 47 47 5a  |.HHZlu.HHZlu.GGZ|
00005a80  69 7a 08 49 49 59 6c 79  08 48 48 59 6b 79 08 48  |iz.IIYly.HHYky.H|
00005a90  48 5a 6c 78 08 49 49 58  69 78 08 49 49 58 69 78  |HZlx.IIXix.IIXix|
00005aa0  08 48 48 58 6b 78 08 4a  4a 5b 6d 79 08 46 46 58  |.HHXkx.JJ[my.FFX|
00005ab0  69 79 08 48 48 59 6b 79  08 48 48 59 6b 78 08 48  |iy.HHYky.HHYkx.H|
00005ac0  48 59 6c 78 08 49 49 58  68 78 08 49 49 58 68 78  |HYlx.IIXhx.IIXhx|
00005ad0  08 49 49 57 6b 78 08 4a  4a 5b 6c 79 00 00 00 00  |.IIWkx.JJ[ly....|
00005ae0  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
00005af0  00 00 00 00 00 00 00 00  00 00 00 00 00 01 ff fe  |................|
00005b00  fd fb 00 ff ff fe fd fa  00 ff ff fe fd fa 00 ff  |................|
00005b10  ff fe fd fa 00 ff ff fe  fd fa 00 00 fe fc fb fa  |................|
00005b20  00 ff fe fc fc f9 00 ff  fe fc fb f8 00 ff fe fc  |................|
00005b30  fa fa 00 ff fe fc fa fa  00 01 ff fd fa fa 00 fe  |................|
00005b40  00 fd fc fc 00 fe ff fd  fb fb 00 fe 00 fd fc fb  |................|
00005b50  00 fe 00 fd fb fc 00 02  ff ff ff fe 00 fd ff fe  |................|
00005b60  00 fe 00 fd ff ff 00 fe  00 fd ff fe 00 fe 00 fe  |................|
00005b70  ff ff 00 fe 13 00 f0 d6  c6 af a0 86 13 00 f0 d6  |................|
00005b80  c6 af a0 86 13 00 f0 d8  c6 af 9e 86 13 00 f0 d8  |................|
00005b90  c6 af 9e 86 13 00 f3 d8  c8 b1 a0 87 13 00 f3 d8  |................|
00005ba0  c8 b1 a0 87 10 00 ed d5  c5 ac 9b 84 10 00 ed d5  |................|
00005bb0  c5 ac 9b 84 10 00 f0 d5  c5 ad 9e 86 10 00 f0 d5  |................|
00005bc0  c5 ad 9e 86 10 00 f0 d8  c5 af 9e 88 10 00 f0 d8  |................|
00005bd0  c5 af 9e 88 0e 00 ee d6  c3 ad 9d 83 0e 00 ee d6  |................|
00005be0  c3 ad 9d 83 10 00 f0 da  c7 ac 9e 84 10 00 f0 da  |................|
00005bf0  c7 ac 9e 84 0f 00 ef d5  c6 ae 9f 86 0f 00 ef d5  |................|
00005c00  c6 ae 9f 86 10 00 ef da  c7 ae 9d 85 10 00 ef da  |................|
00005c10  c7 ae 9d 85 0f 00 ef d5  c7 af 9f 86 0f 00 ef d5  |................|
00005c20  c7 af 9f 86 10 00 f0 d8  c6 ae 9d 87 10 00 f0 d8  |................|
00005c30  c6 ae 9d 87 12 00 ef d7  c8 af 9f 89 12 00 ef d7  |................|
00005c40  c8 af 9f 89 10 00 ef d7  c7 ae a0 88 10 00 ef d7  |................|
00005c50  c7 ae a0 88 10 00 f1 d9  ca b0 a1 89 10 00 f1 d9  |................|
00005c60  ca b0 a1 89 10 00 ee d8  c9 b0 a0 86 10 00 ee d8  |................|
00005c70  c9 b0 a0 86 ff 00 fe 00  04 06 02 01 ff 00 fe 00  |................|
00005c80  04 06 02 01 03 00 fc fa  fa fb f5 f6 03 00 fc fa  |................|
00005c90  fa fb f5 f6 02 00 fc f9  f9 fa f3 f3 02 00 fc f9  |................|
00005ca0  f9 fa f3 f3 01 00 fc f9  f8 f9 f2 f2 01 00 fc f9  |................|
00005cb0  f8 f9 f2 f2 00 00 fd fa  fa fa f5 f4 00 00 fd fa  |................|
00005cc0  fa fa f5 f4 00 00 fe fc  fd fd f8 fa 00 00 fe fc  |................|
00005cd0  fd fd f8 fa ff 00 fe fc  fd fe f7 f9 ff 00 fe fc  |................|
00005ce0  fd fe f7 f9 fe 00 fe fc  ff fe f9 f7 fe 00 fe fc  |................|
00005cf0  ff fe f9 f7 ff 00 ff 00  05 07 03 04 ff 00 ff 00  |................|
00005d00  05 07 03 04 01 00 fc fa  fa fb f5 f6 01 00 fc fa  |................|
00005d10  fa fb f5 f6 01 00 fd fb  fb fc f7 f7 01 00 fd fb  |................|
00005d20  fb fc f7 f7 00 00 fd fb  fc fd f8 f9 00 00 fd fb  |................|
00005d30  fc fd f8 f9 ff 00 fc fc  fc fe f7 fa ff 00 fc fc  |................|
00005d40  fc fe f7 fa 00 00 fd fd  fe 00 fd fe 00 00 fd fd  |................|
00005d50  fe 00 fd fe ff 00 fd fd  fe 01 fc fb ff 00 fd fd  |................|
00005d60  fe 01 fc fb ff 00 fd fe  00 01 fe 04 ff 00 fd fe  |................|
00005d70  00 01 fe 04 00 00 00 00  00 00 00 00 00 00 00 00  |................|
00005d80  de e8 f4 00 0c d9 e1 ed  f5 01 09 11 00 00 00 00  |................|
00005d90  01 00 fc f9 f9 f8 f2 f3  01 00 fc f9 f9 f8 f2 f3  |................|
00005da0  02 00 fd fa fa f8 f2 f3  02 00 fd fa fa f8 f2 f3  |................|
00005db0  00 00 fd fc fc fc f7 f5  00 00 fd fc fc fc f7 f5  |................|
00005dc0  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
*
00005e00  ff ff ff ff ff ff ff ff  ff ff ff ff ff ff ff ff  |................|
*
00020000

errors like:

kern.err kernel: [ 29.325310] mt7915e 0000:01:00.0: Message 000021ed (seq 2) timeout

Fri Oct  1 22:03:55 2021 kern.err kernel: [   29.325310] mt7915e 0000:01:00.0: Message 000021ed (seq 2) timeout

Fri Oct  1 22:04:35 2021 daemon.err hostapd: nl80211 driver initialization failed.

Portions of logread:

Fri Oct  1 23:45:48 2021 kern.warn kernel: [ 6142.469947] netlink: 'iw': attribute type 302 has an invalid length.
Fri Oct  1 23:45:48 2021 daemon.notice hostapd: Configuration file: /var/run/hostapd-phy1.conf (phy wlan1-1) --> new PHY
Fri Oct  1 23:46:08 2021 daemon.err hostapd: Could not set interface wlan1-1 flags (UP): Operation timed out
Fri Oct  1 23:46:08 2021 daemon.err hostapd: nl80211: Could not set interface 'wlan1-1' UP
Fri Oct  1 23:46:08 2021 daemon.notice hostapd: nl80211: deinit ifname=wlan1-1 disabled_11b_rates=0
Fri Oct  1 23:46:08 2021 daemon.err hostapd: nl80211 driver initialization failed.
Fri Oct  1 23:46:08 2021 daemon.notice hostapd: wlan1-1: CTRL-EVENT-TERMINATING
Fri Oct  1 23:46:08 2021 daemon.err hostapd: hostapd_free_hapd_data: Interface wlan1-1 wasn't started
Fri Oct  1 23:46:08 2021 daemon.notice netifd: radio1 (7150): Command failed: Invalid argument
Fri Oct  1 23:46:08 2021 kern.err kernel: [ 6163.085313] mt7915e 0000:01:00.0: Message 000007ed (seq 9) timeout
Fri Oct  1 23:46:08 2021 daemon.notice netifd: radio1 (7150): Device setup failed: HOSTAPD_START_FAILED
Fri Oct  1 23:46:08 2021 daemon.notice netifd: Wireless device 'radio1' set retry=0
Fri Oct  1 23:46:08 2021 daemon.crit netifd: Wireless device 'radio1' setup failed, retry=0
Fri Oct  1 23:46:08 2021 daemon.notice netifd: Wireless device 'radio1' is now down

Full logread from boot here:

In this state I have to power off and then power on and sometimes it works and sometimes it does not.

I am stumped - what should I try?

Faulty hardware? Back to Amazon and order replacement?

@daniel could it be this issue you mention above:

The main risk here is that you may have gotten a device where one or more erase-blocks in the very beginning of the SPI-NAND flash are broken. I did my best to make the installer also handle these cases properly (ie. relocate factory data, which has to be kept at known offset, in order to reverse/mitigate the effects of MediaTek's BMT which is used bu non-UBI OpenWrt as well as the stock firmware). I couldn't yet get hold of device having on of the first blocks broken, hence I had not chance to test this myself.
The worst-case here is a device which comes up without a valid MAC address and missing WiFi calibration. In that case you can either try to resolve things manually using the backup of the flash or use that to revert to the stock firmware.

Any thoughts or ideas @hnyman?

See:

static int
mt7915_mcu_parse_response(struct mt76_dev *mdev, int cmd,
			  struct sk_buff *skb, int seq)
{
	struct mt7915_mcu_rxd *rxd;
	int ret = 0;

	if (!skb) {
		dev_err(mdev->dev, "Message %08x (seq %d) timeout\n",
			cmd, seq);
		return -ETIMEDOUT;
	}

	rxd = (struct mt7915_mcu_rxd *)skb->data;
	if (seq != rxd->seq)
		return -EAGAIN;

	if (cmd == MCU_CMD(PATCH_SEM_CONTROL)) {
		skb_pull(skb, sizeof(*rxd) - 4);
		ret = *skb->data;
	} else if (cmd == MCU_EXT_CMD(THERMAL_CTRL)) {
		skb_pull(skb, sizeof(*rxd) + 4);
		ret = le32_to_cpu(*(__le32 *)skb->data);
	} else {
		skb_pull(skb, sizeof(struct mt7915_mcu_rxd));
	}

	return ret;
}

github.com

openwrt/mt76/blob/master/mt7915/mcu.c

// SPDX-License-Identifier: ISC
/* Copyright (C) 2020 MediaTek Inc. */

#include <linux/fs.h>
#include "mt7915.h"
#include "mcu.h"
#include "mac.h"
#include "eeprom.h"

#define fw_name(_dev, name, ...)	({			\
	char *_fw;						\
	switch (mt76_chip(&(_dev)->mt76)) {			\
	case 0x7915:						\
		_fw = MT7915_##name;				\
		break;						\
	case 0x7981:						\
		_fw = MT7981_##name;				\
		break;						\
	case 0x7986:						\
		_fw = MT7986_##name##__VA_ARGS__;		\

This file has been truncated. show original

This is with: r17648-16e83a7491

daniel · October 3, 2021, 8:53am

Lynx:

@daniel could it be this issue you mention above:

The main risk here is that you may have gotten a device where one or more erase-blocks in the very beginning of the SPI-NAND flash are broken. I did my best to make the installer also handle these cases properly (ie. relocate factory data, which has to be kept at known offset, in order to reverse/mitigate the effects of MediaTek's BMT which is used bu non-UBI OpenWrt as well as the stock firmware). I couldn't yet get hold of device having on of the first blocks broken, hence I had not chance to test this myself.
The worst-case here is a device which comes up without a valid MAC address and missing WiFi calibration. In that case you can either try to resolve things manually using the backup of the flash or use that to revert to the stock firmware.

Doesn't look like this is the case here. The hexdump of the factory partition looks alright (offsets match expectations) and if it was a problem related to calibration it would happen each and every time you boot.

As you only observe the problem on some but not every time you boot the device it has to be something else. I've heard MT7915E sometimes isn't reset durin soft-reboot and then stuck after boot, which looks more like what you are observing as well.
So first thing would be to try cold (ie. disconnect from power or using physical power switch) reset vs. warm (ie. using reboot command) reset and see if chances for Wifi to come up are any different.

Lynx · October 3, 2021, 12:01pm

Thanks. So presumably still a hardware fault? Is there something that could be done in software to help recover from such a reboot? For devices like this one with fault? Think I will just RMA this. My other two have been fine.

daniel · October 3, 2021, 12:54pm

Does the device come up with all WiFi interface reliable if cold-started (ie. connect power after it was disconnected)?

Lynx · October 3, 2021, 1:05pm

Yes it does. Seems to work perfectly and in mesh (with circa 300-400Mbit/s between it and the other two RT3200's, using 80Mhz on channel 36, albeit I could never get 160Mhz mesh to work).

The cold/warm boot issue you explain above seems to match with my experience. I wonder if this is a hardware issue with my specific device, or whether there could be a certain software state that can nudge any RT3200 into this state. Do you suppose the former? Could it start happening with time as the devices age?

In any case, could the chip be powered down and then back up in software in order to deal with this if it happens to other users to help improve reliability?

hnyman · October 3, 2021, 1:45pm

As you are running the UBI build, are you sure that you are not in the recovery mode when it starts wrongly, with the 5GHz WiFi disabled? Have you checked the /sys/fs/pstore for crash dumps? (Note that those survive warm reboot, and only cold boot clears them from RAM. Or you can manually delete them.)

Lynx · October 3, 2021, 1:48pm

So each time I checked for the existence of the /sys/fs/pstore crash dumps and they were not present. Is there another way to discern recovery mode? I think from recollection in recovery mode LuCi would show a different snapshot, and I did not see that.

I have three of these RT3200 devices in a mesh. Two are close together and this problematic one is further away but in range of both. In any case, I have consistently had problems with this one. Namely I would reboot all three and then find I could not access the web page for this one (but could access the other two fine). I would fix this by manually switching the switch at the back off, waiting a bit, and then switching it back on. And it was only when I hooked up a LAN cable to connect with my laptop that I realised the 5Ghz wifi was not coming up and saw these timeout errors.

In this state when I issued 'reboot' in ssh the issue seemed to remain. But if I power off and power on again then it seems to bring it back.

benciphered · October 3, 2021, 5:15pm

I'm encountering a problem that results in the same error posted here:

Oct  3 12:08:59 ROUTER1A kernel: [12276.321915] Unable to handle kernel NULL pointer dereference at virtual address 0000000000000053

It happens approximately 4-10 hours. It's an issue with the SNAPSHOT r17652-21c7a8593d and also happened on r17443 from the UBI installer repo.

This morning I attempted to get pstore data but I was unable to contact the router using either it's assigned IP address, 10.10.1.1. I also didn't have any luck pinging 192.168.1.1. I will be attempting to access it through either SSH or web on default IPs/ports when it next fails and hopefully I have more data to provide.

One thing that I do note is that the router continued to access the Internet and route traffic through Ethernet. It's the WiFi (both 2.4 and 5.0 radios) that wasn't functional. The fact that it routed traffic as a gateway makes me think that it was still operating as 10.10.1.1, though I couldn't get a ping response and didn't note any traffic from that IP with Wireshark.

I've been following this thread for the past couple days as I tried to diagnose the issue. If I am able to get pstore logs/data, I'll post them here. This router is not part of a mesh or anything like that. Normal gateway router/AP only.

themeanfarmer · October 3, 2021, 6:46pm

Hello. What channel does everyone use on 5GHz? I live in an apartment with about 50 networks.

Lynx · October 3, 2021, 7:14pm

I think in that case I would consider DFS. You ought to be able to scan and monitor and see which channel segment is the least occupied.

BTW for anyone wanting more SQM / VPN performace just enable irqbalance. I noticed significantly decreased loadavg with that enabled. And I saw on another thread this router can manage 1Gbit SQM with it enabled. Actually is there any reason not to have this enabled?

Finally has anyone got 160Mhz mesh or WDS working? Or 160Mhz working well in general? As in greater throughput than 80Mhz.

benciphered · October 4, 2021, 1:35am

I haven't really tried 160MHz. Originally tried to get it working back in June/July before LuCI had full ax support and couldn't get a stable connection. So I just set it to 80MHz and kept it there until earlier today experimenting a little bit. From my quick tests it did connect at 160MHz and report as such on the status page / station list. Have no idea about performance / stability, this was just a quick test before I decided I should probably keep the router on 80MHz until this crash problem is sorted so I don't introduce any additional variables in the troubleshooting process.

Is there any documentation on channel selection in this router, especially at 160MHz? I know with some routers only certain channels work with wider bandwidth, and it doesn't seem that all routers refer to the block of channels in a standard way. I seem to recall that when specifying the channel number, some routers want you to select the low numbered channel, some want the high numbered channel, and some want you to choose the center channel. Not sure how this works with this model, going to see what I can find out about this.

benciphered · October 4, 2021, 1:37am

My router just had the oops occur again and I was finally able to access it.

The router is running r17652-21c7a8593d, and after the oops it booted into recovery SNAPSHOT r17443-90e167abaa. Accessing the router at 192.168.1.1 and default ports was successful.

I was able to obtain dmesg-ramoops-0 and dmesg-ramoops-1 which appear to have identical data about the trace, as follows:

<1>[17357.656699] Unable to handle kernel NULL pointer dereference at virtual address 0000000000000053
<1>[17357.665498] Mem abort info:
<1>[17357.668285]   ESR = 0x96000005
<1>[17357.671331]   EC = 0x25: DABT (current EL), IL = 32 bits
<1>[17357.676646]   SET = 0, FnV = 0
<1>[17357.679690]   EA = 0, S1PTW = 0
<1>[17357.682820] Data abort info:
<1>[17357.685695]   ISV = 0, ISS = 0x00000005
<1>[17357.689521]   CM = 0, WnR = 0
<1>[17357.692480] user pgtable: 4k pages, 39-bit VAs, pgdp=0000000042bf2000
<1>[17357.698915] [0000000000000053] pgd=0000000000000000, p4d=0000000000000000, pud=0000000000000000
<0>[17357.707615] Internal error: Oops: 96000005 [#1] SMP
<7>[17357.712483] Modules linked in: xt_connlimit pppoe ppp_async nf_conncount iptable_nat xt_state xt_nat xt_helper xt_conntrack xt_connmark xt_connbytes xt_REDIRECT xt_MASQUERADE xt_FLOWOFFLOAD xt_CT pppox ppp_generic nf_nat nf_flow_table nf_conntrack_netlink nf_conntrack mt7915e mt7615e mt7615_common mt76_connac_lib mt76 mac80211 ipt_REJECT cfg80211 xt_time xt_tcpudp xt_tcpmss xt_statistic xt_recent xt_multiport xt_mark xt_mac xt_limit xt_length xt_hl xt_ecn xt_dscp xt_comment xt_TCPMSS xt_LOG xt_HL xt_DSCP xt_CLASSIFY slhc sch_cake nf_reject_ipv4 nf_log_ipv4 nf_defrag_ipv6 nf_defrag_ipv4 iptable_raw iptable_mangle iptable_filter ipt_ECN ip_tables hwmon crc_ccitt compat sch_tbf sch_ingress sch_htb sch_hfsc em_u32 cls_u32 cls_tcindex cls_route cls_matchall cls_fw cls_flow cls_basic act_skbedit act_mirred act_gact xt_set ip_set_list_set ip_set_hash_netportnet ip_set_hash_netport ip_set_hash_netnet ip_set_hash_netiface ip_set_hash_net ip_set_hash_mac ip_set_hash_ipportnet ip_set_hash_ipportip
<7>[17357.712647]  ip_set_hash_ipport ip_set_hash_ipmark ip_set_hash_ip ip_set_bitmap_port ip_set_bitmap_ipmac ip_set_bitmap_ip ip_set nfnetlink nf_log_ipv6 nf_log_common ip6table_mangle ip6table_filter ip6_tables ip6t_REJECT x_tables nf_reject_ipv6 ifb vfat fat autofs4 nls_utf8 nls_iso8859_1 nls_cp437 seqiv uas usb_storage leds_gpio xhci_plat_hcd gpio_button_hotplug
<7>[17357.831984] CPU: 1 PID: 1197 Comm: napi/phy1-9 Tainted: G S                5.10.70 #0
<7>[17357.839802] Hardware name: Linksys E8450 (UBI) (DT)
<7>[17357.844671] pstate: 60000005 (nZCv daif -PAN -UAO -TCO BTYPE=--)
<7>[17357.850677] pc : mt76_tx_status_skb_done+0x0/0x80 [mt76]
<7>[17357.855984] lr : mt7915_queue_rx_skb+0xa04/0x154c [mt7915e]
<7>[17357.861545] sp : ffffffc010ea3c20
<7>[17357.864850] x29: ffffffc010ea3c20 x28: ffffff8002375e00 
<7>[17357.870154] x27: 0000000040000000 x26: ffffff80023738d0 
<7>[17357.875458] x25: ffffff8002375ec0 x24: 0000000000000000 
<7>[17357.880762] x23: 0000000000000052 x22: ffffff800248f848 
<7>[17357.886066] x21: ffffff8000aacbb8 x20: ffffff800248f828 
<7>[17357.891370] x19: ffffff8002372800 x18: 0000000000000000 
<7>[17357.896674] x17: 0000000000000000 x16: 0000000000000000 
<7>[17357.901978] x15: 0000000000000000 x14: 0000000000000004 
<7>[17357.907282] x13: 0000000000000000 x12: 0000000000000006 
<7>[17357.912586] x11: ffffff8002374800 x10: ffffffc010a96000 
<7>[17357.917890] x9 : 0000000000000064 x8 : ffffff8002372800 
<7>[17357.923194] x7 : ffffff8002374950 x6 : 00000001001a0728 
<7>[17357.928498] x5 : fffffffeffe5f939 x4 : ffffff8002bff300 
<7>[17357.933801] x3 : ffffffc010ea3d18 x2 : ffffffc010ea3d18 
<7>[17357.939105] x1 : 0000000000000000 x0 : ffffff8002372800 
<7>[17357.944409] Call trace:
<7>[17357.946850]  mt76_tx_status_skb_done+0x0/0x80 [mt76]
<7>[17357.951808]  mt76_dma_rx_poll+0x284/0x764 [mt76]
<7>[17357.956420]  __napi_poll+0x34/0x140
<7>[17357.959900]  napi_threaded_poll+0x84/0xf0
<7>[17357.963903]  kthread+0x120/0x124
<7>[17357.967123]  ret_from_fork+0x10/0x18
<0>[17357.970694] Code: 54fffa41 d2800000 d65f03c0 d503201f (39414c24) 
<4>[17357.976778] ---[ end trace 0d2d5ed54117714c ]---

I hope this is a help. If there's anything else I could gather or try I'd be glad to.

drikus · October 4, 2021, 4:59am

These nullpointers at virtual address 0000000000000053 are seen and reported in this thread since end of July. Had multiple occurrences as well then revered to version prior end of July with OpenWrt SNAPSHOT r17114-349e2b7e65.

It is reported here
and supposedly fixed 10 days ago.
Not sure when it ends up on openwrt snapshot builds though, someone ?

hnyman · October 4, 2021, 5:09am

It pretty much depends on when @nbd updates the mt76 driver in OpenWrt to reflect the newest stuff. He has latest bumped the version here in July.

https://git.openwrt.org/?p=openwrt/openwrt.git;a=history;f=package/kernel/mt76;hb=HEAD

(Note that he committed that fix in in mt76, so he knows about it.)

benciphered · October 4, 2021, 2:10pm

Thank you for the clarification.

I saw this reported earlier in the thread and the request for pstore logs. I also followed the link to the issue on GitHub, saw the commit, and thought that meant it'd be in snapshot builds from then on. Thought it must not have been a full fix since it seemed to be the same issue. Didn't realize until you pointed it out that commit doesn't get incorporated into subsequent OpenWrt snapshots without further steps. My mistake.

I think I'll revert later today to r17114 on my main router.

bobbythomas · October 4, 2021, 2:48pm

I am also waiting for this fix to be incorporated in the nightly build as my router goes into recovery mode every 2-3 days. Thinking of writing a script to clear the pstore content and to reboot the router whenever it goes into recovery for the time being.

neheb · October 4, 2021, 8:13pm

anyone have any luck enabling 160mhz for the 5ghz interface?