Hello everyone. I have a question about some issue i came across with Xiaomi 4A GE. I think there is an issues with internal switch or br-lan interface. I really do not know how to debug or resolv it. I would like to get some ideas, tips and directions to find out what is going on.
Here is my situation (at my brothers house): I made a simple 5GHz mesh with 3 4A-GE routers. Master node is connected to ISP and sends WiFi on 2,4GHz. On other 2 nodes are PC and TV connected. I used 22.03.4/5 for very long time and it ran great. Last week we upgraded to 23.05.2 (There was issus, i wrote about it at: Internet keeps stopping in Xiaomi 4a(gigabit edi) router with openwrt - #5 by ilija.culap). We switched to SNAPSHOT and then to 23.05.3. Every one of them hat issues, where one of the nodes could not connect to the internet.
Issue:
- After 12 - 24 hours
- Mesh link is okay
- No OOM
- No errors in logread or dmesg
- Allready connected ethernet client (with IP address) can connect to router (web and ssh)
- Cannot ping the gateway
- No new clients can connect or existing reconnect, because DHCP is on master node
- ifconfig br-lan down ; ifconfig br-lan up does not help
- Resetting wifi does not help
- Reboot helps
Here are some information about network:
- 3 Nodes, 4A-GE, 5GHz
- Master node with DHCP and Firewall. This node makes no problems
- On nodes 2 and 3: firewall, dnsmasq and odhcpd are disabled
- On nodes 2 and 3: wan port is added to br-lan
- On all nodes IPv6 is disabled for br-lan
Some files from node 2 (same as 3):
Info
{
"kernel": "5.15.150",
"hostname": "Culap-X2",
"system": "MediaTek MT7621 ver:1 eco:3",
"model": "Xiaomi Mi Router 4A Gigabit Edition",
"board_name": "xiaomi,mi-router-4a-gigabit",
"rootfs_type": "squashfs",
"release": {
"distribution": "OpenWrt",
"version": "23.05.3",
"revision": "r23809-234f1a2efa",
"target": "ramips/mt7621",
"description": "OpenWrt 23.05.3 r23809-234f1a2efa"
}
}
Network
config interface 'loopback'
option device 'lo'
option proto 'static'
option ipaddr '127.0.0.1'
option netmask '255.0.0.0'
config globals 'globals'
option ula_prefix 'xxxx:xxxx:xxxx::/48'
option packet_steering '1'
config device
option name 'br-lan'
option type 'bridge'
option ipv6 '0'
list ports 'lan1'
list ports 'lan2'
list ports 'wan'
config interface 'lan'
option device 'br-lan'
option proto 'dhcp'
Wireless
config wifi-device 'radio0'
option type 'mac80211'
option path '1e140000.pcie/pci0000:00/0000:00:01.0/0000:02:00.0'
option channel '1'
option band '2g'
option htmode 'HT20'
option disabled '1'
config wifi-device 'radio1'
option type 'mac80211'
option path '1e140000.pcie/pci0000:00/0000:00:00.0/0000:01:00.0'
option channel '40'
option band '5g'
option htmode 'VHT80'
option cell_density '0'
config wifi-iface 'wifinet0'
option device 'radio1'
option mode 'mesh'
option encryption 'sae'
option mesh_id 'MX-32468034'
option mesh_fwding '1'
option mesh_rssi_threshold '-80'
option mesh_gate_announcements '1'
option mesh_hwmp_rootmode '3'
option key 'xxxxxxxxxxxxx'
option network 'lan'
What i tried:
- I have one spare 4A-GE that i added to my own mesh (2 x Xiaomi AX3200) so i can test and investigate. It crashed with 23.05.2 (at night) but i could not test anythig because i could not add any new client (DHCP). Now is this router connected to my PC
- I tought there is issue with DHCP renewal (12h lease time) so i tried to reduce lease time to 2m on my mesh but this did not trigger the issues. i ran it for 1 hour
- I tried to download and upload some files through the router (around 120GB on all 3 ports) but this did not trigger it either.
My next steps are to remove wan port from br-lan, activate bring up empty bridge and let all services run on the router. Problem is that i have to wait 24h to see if it works. So i hope someone can tell me what else i could do. How to trigger this behavior. Here is something that seems very similar: Br-lan client can not connect problem (It seems that wireless devices (wifi-iface) disconnect themselves from br-lan.)
Thank you all in advance
Best regards