Reboot loop problem at startup on gl-inet e750

Hi all,

I'm currently playing with gl-inet E750 that has been recently supported by Openwrt mainline.

I would like to report the following problem: when installing the default image everything works fine. But when I start to install a few packages, especially kernel modules, the device starts rebooting in loop at boot, until once in a while it succeeds in booting (and then it is stable until next reboot). When it reboots in loop I can ssh into the device for few seconds (<10) before it reboots.

My guess is that it is a problem with the hardware watchdog. I guess that when installing new packages, it increase the boot time, and causes the watchdog to not be handled in time. (at least most of the times, once in a while it succeeds in booting).

Any idea, how to solve this problem?

Thanks a lot!

Here is the dmesg I get before I lose control over the router:

 [+0.003599] random: 1 urandom warning(s) missed due to ratelimiting
[Oct13 13:10] ath10k_pci 0000:00:00.0: qca9887 hw1.0 target 0x4100016d chip_id 0x004000ff sub 0000:0000
[  +0.009611] ath10k_pci 0000:00:00.0: kconfig debug 0 debugfs 1 tracing 0 dfs 1 testmode 0
[  +0.013207] ath10k_pci 0000:00:00.0: firmware ver 10.1-ct-87-__fW-022-ecad3248 api 2 features wmi-10.x,has-wmi-mgmt-tx,mfp,txstatus-noack,wmi-10.x-CT,ratemask-CT,txrate-CT,get-temp-CT,tx-rc-CT,cust-stats-CT,retry-gt2-CT,txrate2-CT,beacon-cb-CT,wmi-block-ack-CT crc32 61a92df5
[  +0.293248] ath10k_pci 0000:00:00.0: board_file api 1 bmi_id N/A crc32 546cca0d
[  +2.428989] ath10k_pci 0000:00:00.0: 10.1 wmi init: vdevs: 16  peers: 127  tid: 256
[  +0.015229] ath10k_pci 0000:00:00.0: wmi print 'P 128 V 8 T 410'
[  +0.006527] ath10k_pci 0000:00:00.0: wmi print 'msdu-desc: 1424  sw-crypt: 0 ct-sta: 0'
[  +0.008309] ath10k_pci 0000:00:00.0: wmi print 'alloc rem: 25560 iram: 24780'
[  +0.040491] ath10k_pci 0000:00:00.0: htt-ver 2.1 wmi-op 2 htt-op 2 cal eeprom max-sta 128 raw 0 hwcrypto 1
[  +0.010428] ath10k_pci 0000:00:00.0: NOTE:  Firmware DBGLOG output disabled in debug_mask: 0x10000000
[  +0.110242] ath: EEPROM regdomain sanitized
[  +0.000012] ath: EEPROM regdomain: 0x64
[  +0.000003] ath: EEPROM indicates we should expect a direct regpair map
[  +0.000021] ath: Country alpha2 being used: 00
[  +0.000003] ath: Regpair used: 0x64
[  +0.122683] ath: EEPROM regdomain sanitized
[  +0.000012] ath: EEPROM regdomain: 0x64
[  +0.000003] ath: EEPROM indicates we should expect a direct regpair map
[  +0.000023] ath: Country alpha2 being used: 00
[  +0.000003] ath: Regpair used: 0x64
[  +0.012380] ieee80211 phy1: Selected rate control algorithm 'minstrel_ht'
[  +0.002390] ieee80211 phy1: Atheros AR9531 Rev:2 mem=0xb8100000, irq=13
[  +0.034955] kmodloader: done loading kernel modules from /etc/modules.d/*
[ +12.516702] br-lan2: port 1(eth0.10) entered blocking state
[  +0.005845] br-lan2: port 1(eth0.10) entered disabled state
[  +0.006153] device eth0.10 entered promiscuous mode
[  +0.005095] device eth0 entered promiscuous mode
[  +0.055105] br-lan: port 1(eth0) entered blocking state
[  +0.005479] br-lan: port 1(eth0) entered disabled state
[  +1.939532] eth0: link up (100Mbps/Half duplex)
[  +0.021721] IPv6: ADDRCONF(NETDEV_CHANGE): eth0: link becomes ready
[  +0.006736] br-lan: port 1(eth0) entered blocking state
[  +0.005458] br-lan: port 1(eth0) entered forwarding state
[  +0.094525] IPv6: ADDRCONF(NETDEV_CHANGE): br-lan: link becomes ready
[  +0.007034] IPv6: ADDRCONF(NETDEV_CHANGE): eth0.2: link becomes ready
[  +0.007025] br-lan2: port 1(eth0.10) entered blocking state
[  +0.005817] br-lan2: port 1(eth0.10) entered forwarding state
[  +0.185470] IPv6: ADDRCONF(NETDEV_CHANGE): br-lan2: link becomes ready
[  +2.369502] br-lan2: port 2(wlan1) entered blocking state
[  +0.005710] br-lan2: port 2(wlan1) entered disabled state
[  +0.006012] device wlan1 entered promiscuous mode
[  +0.079414] IPv6: ADDRCONF(NETDEV_CHANGE): wlan1: link becomes ready
[  +0.006849] br-lan2: port 2(wlan1) entered blocking state
[  +0.005636] br-lan2: port 2(wlan1) entered forwarding state
[  +4.247692] ath10k_pci 0000:00:00.0: 10.1 wmi init: vdevs: 16  peers: 127  tid: 256
[  +0.015239] ath10k_pci 0000:00:00.0: wmi print 'P 128 V 8 T 410'
[  +0.006887] ath10k_pci 0000:00:00.0: wmi print 'msdu-desc: 1424  sw-crypt: 0 ct-sta: 0'
[  +0.008331] ath10k_pci 0000:00:00.0: wmi print 'alloc rem: 25560 iram: 24780'
[  +0.069077] ath10k_pci 0000:00:00.0: pdev param 0 not supported by firmware
[  +0.013063] ath10k_pci 0000:00:00.0: rts threshold -1
[  +0.063277] br-lan: port 2(wlan0) entered blocking state
[  +0.005598] br-lan: port 2(wlan0) entered disabled state
[  +0.005897] device wlan0 entered promiscuous mode
[  +4.115718] IPv6: ADDRCONF(NETDEV_CHANGE): wlan0: link becomes ready
[  +0.006830] br-lan: port 2(wlan0) entered blocking state
[  +0.005561] br-lan: port 2(wlan0) entered forwarding state
[Oct13 13:11] br-lan: port 2(wlan0) entered disabled state
[  +0.005685] br-lan: port 1(eth0) entered disabled state
[  +0.101890] br-lan: port 1(eth0) entered disabled state
[  +0.023796] device wlan0 left promiscuous mode
[  +0.004818] br-lan: port 2(wlan0) entered disabled state
[  +0.746361] ath10k_pci 0000:00:00.0: mac flush null vif, drop 0 queues 0xffff
[  +0.163185] br-lan2: port 2(wlan1) entered disabled state
[  +0.005724] br-lan2: port 1(eth0.10) entered disabled state
[  +0.024866] device wlan1 left promiscuous mode
[  +0.004931] br-lan2: port 2(wlan1) entered disabled state
[  +0.010672] device eth0.10 left promiscuous mode
[  +0.004908] device eth0 left promiscuous mode
[  +0.004721] br-lan2: port 1(eth0.10) entered disabled state
[  +0.083229] eth0: link down
[  +6.578869] br-lan2: port 1(eth0.10) entered blocking state
[  +0.005844] br-lan2: port 1(eth0.10) entered disabled state
[  +0.006163] device eth0.10 entered promiscuous mode
[  +0.005095] device eth0 entered promiscuous mode
[  +0.463057] br-lan: port 1(eth0) entered blocking state
[  +0.005476] br-lan: port 1(eth0) entered disabled state
[  +1.548250] eth0: link up (100Mbps/Half duplex)
[  +0.039462] IPv6: ADDRCONF(NETDEV_CHANGE): eth0: link becomes ready
[  +0.006775] br-lan: port 1(eth0) entered blocking state
[  +0.005464] br-lan: port 1(eth0) entered forwarding state
[  +0.106009] IPv6: ADDRCONF(NETDEV_CHANGE): br-lan: link becomes ready
[  +0.007003] IPv6: ADDRCONF(NETDEV_CHANGE): eth0.2: link becomes ready
[  +0.007113] br-lan2: port 1(eth0.10) entered blocking state
[  +0.005820] br-lan2: port 1(eth0.10) entered forwarding state
[  +0.212078] IPv6: ADDRCONF(NETDEV_CHANGE): br-lan2: link becomes ready
[  +2.997284] br-lan2: port 2(wlan1) entered blocking state
[  +0.005649] br-lan2: port 2(wlan1) entered disabled state
[  +0.006046] device wlan1 entered promiscuous mode
[  +0.127551] br-lan2: port 2(wlan1) entered disabled state
[  +6.987429] ath10k_pci 0000:00:00.0: unable to read from the device (-145)
[  +0.007135] ath10k_pci 0000:00:00.0: could not read board ext data addr (-145)
[  +0.007489] ath10k_pci 0000:00:00.0: could not push board ext data (-145)
[  +0.007028] ath10k_pci 0000:00:00.0: failed to download calibration data from EEPROM: -145
```

looks like the (ART) radio calibration partition's gone ?

1 Like

If I disable wifi, the problem does not appear anymore.
It also works if I disable only "pci0000:00/0000:00:00.0" wirelless card from /etc/config/wireless
So it clearly looks like a wifi related problem.