Hi all strange behavior here:
==== Configuration =====
I have a linksys EA7500 v1 (running OpenWRT 23.05.0) connected to my main router (Tomato, running the DHCP server for two network segments) which in turn is connected to my ISP router on a bridged port.
The EA7500 is a WIFI access point (WDS) for 2.4 and 5.1 GHz.
I have 4 other routers (all running OpenWRT 23.05.0) which are connected as client (WDS) router to the EA7500, two to the 5.1 and two to the 2.4 GHz Wifis.
All 4 clients routers get a proper IP according to the configuration of the Tomato router.
The client routers are configured to have a lan bridge. The wifi as well as all NICs are in the lan bridge and the lan bridge is in the LAN firewall zone (there is no other firewall zone). I am aware that the wifi devices cannot be added to the lan bridge in the device section of the interface, but in the wifi section in luci.
DHCP server is switched off on all clients and the EA7500 as well. Firewall-SW and DHCP-server are still on the clients and not blocked, just switched off resp. not used.
==== Expected behavior ====
If I connect any network station (Laptop. PC, .....) to any of the sockets at any client router, It is expected to enter the lan zone, a DHCP request will be forwarded to the DHCP Server, and an answer with a DHCP Configuration will be received as reply. The network station configures its NIC according to the reply and is then connected to the Network. Traffic goes to other stations in the network segment or the gateway and into the internet.
In total I have 5 network stations connected to the four client routers (ok 4 wifi client routers and 5 network stations is a bit strange, but they are not all connected. 2 client routers are just spare for tests and if problems occur on the ones where network stations are connected to the sockets. The client routers have detachable antennas and I can sent/receive on longer distance/thicker walls than using just the build-in wifi device of the connected network stations)
==== Observed behaiour ======
Strange enough, it is working as expected only on three of the four client routers. On just one of the client routers, connected network stations do not receive the DHCP configuration from the DHCP server.
I installed tcpdump on the malfunctioning client router and see:
root@Linksys-E5400-CaiSec:/var# tcpdump | grep -v STP | grep -v ARP
tcpdump: verbose output suppressed, use -v[v]... for full protocol decode
listening on eth0, link-type EN10MB (Ethernet), snapshot length 262144 bytes
13:16:34.453420 IP 0.0.0.0.68 > 255.255.255.255.67: BOOTP/DHCP, Request from 54:ee:75:d2:f6:37 (oui Unknown), length 291
13:16:34.453537 IP 0.0.0.0.68 > 255.255.255.255.67: BOOTP/DHCP, Request from 54:ee:75:d2:f6:37 (oui Unknown), length 291
13:16:34.453561 IP 0.0.0.0.68 > 255.255.255.255.67: BOOTP/DHCP, Request from 54:ee:75:d2:f6:37 (oui Unknown), length 291
13:16:34.476398 IP 172.16.6.1.67 > 172.16.6.11.68: BOOTP/DHCP, Reply, length 324
13:16:36.453026 IP 0.0.0.0.68 > 255.255.255.255.67: BOOTP/DHCP, Request from 54:ee:75:d2:f6:37 (oui Unknown), length 291
13:16:36.453141 IP 0.0.0.0.68 > 255.255.255.255.67: BOOTP/DHCP, Request from 54:ee:75:d2:f6:37 (oui Unknown), length 291
13:16:36.453167 IP 0.0.0.0.68 > 255.255.255.255.67: BOOTP/DHCP, Request from 54:ee:75:d2:f6:37 (oui Unknown), length 291
13:16:36.457274 IP 172.16.6.1.67 > 172.16.6.11.68: BOOTP/DHCP, Reply, length 315
13:16:39.186370 IP 0.0.0.0.68 > 255.255.255.255.67: BOOTP/DHCP, Request from 54:ee:75:d2:f6:37 (oui Unknown), length 291
13:16:39.186492 IP 0.0.0.0.68 > 255.255.255.255.67: BOOTP/DHCP, Request from 54:ee:75:d2:f6:37 (oui Unknown), length 291
13:16:39.186518 IP 0.0.0.0.68 > 255.255.255.255.67: BOOTP/DHCP, Request from 54:ee:75:d2:f6:37 (oui Unknown), length 291
13:16:39.189113 IP 172.16.6.1.67 > 172.16.6.11.68: BOOTP/DHCP, Reply, length 315
Note:
- I skipped ARP and STP lines of the dump using grep.
- The malfunctioning client router is a Linksys-E5400.
- The device of the network station which is requesting the DHCP config has the MAC 54:ee:75:d2:f6:37
- The DHCP Server has the fixed IP 172.16.6.11 linked to the MAC 54:ee:75:d2:f6:37
- As from the protocol above, the reply with the correct IP is sent from the DHCP Server into the lan bridge on the malfunctioning client router, but not forwarded to the network station.
Question: Why does the reply from the DHCP Server not reach the network station that requested it????
What I tried:
a) other device/MAC requesting DHCP config - same result
b) other cable - same result
c) switched off 2 of the client routers - same result
I got the expected result with other client routers - tested: Linksys EA7500 as client router also (I have two of these), Linksys EA6500, TP-Link WR1043ND.
I checked the configs of the working client routers config in luci against the malfunctioning for diffs - none found (one of the working client routers has no switch - but the other have and everything is working perfect)
Has anyone come over such problems earlier or is it an issue of the client (E5400)?
I also tried a bit of switching the Wifi between 2.4 and 5.1 and found, that I should take care to avoid loops/backpaths, because if I have those, i.e. two connections between stations in the network, the STP goes indefinitely deep and everything else is blocked. So second question, less important: Where to switch off the STP?
Looking for some advice.....