Hi,
I am reaching for help from the community as it's been years now that I can not fix a problem with I think a point to point link with WDS.
My setup :
in zone A :
A main router connected to internet
wired to a router configured as WDS AP
in a distant zone B :
router as WDS client wired to a network with other equipments
To all of those routers are connected other routers, AP, equipments
I would prefer keeping everything on the same L2 layer for convenience.
All routers are atheros based with openwrt latest stable version.
This setup works pretty well, although from time to times every other day, or sometimes several times per day, the network is lagging and impossible to ping from A to B. This can last a couple of minutes or hours or never recover.
Machines in zone A can ping each other, Machine in zone B can ping each other.
With iwinfo i still have a link with expected throughput: 45.7 MBit/s
Load is low on all routers. No errors but dropped packets on ifconfig but can not tell why or if this normal or not.
Weird thing is that I have cam from B that I can still see streams from A (bandwitdh is about 10Mb/s less than wifi bandwidth, problem occurs also with cams off )
with tcpdump I can see packets going out from AP WDS but I do not see them coming on Client WDS.
If I restart wifi on AP WDS, everything works fine.
Any idea, suggestion or way to troubleshoot this ?
I could not find what could trigger this problem, maybe with the use of spotify and sonos like device ( multicast packets ? )
I used to have wifi hanging with wpa group key renewal, I disabled group key and this problem seems to have disappeared.
Thanks for your help !
What is the distance between the two zones?
What are the obstacles between them?
Are you in a residential zone, or some remote location?
Have you taken in consideration interference from other WLANs?
Are you using a fixed channel or auto?
What are the signal and noise levels on both sides?
There is 125m between wireless endpoints. no permanent obstacles.
I am in a remote location with really no other wifi nearby.
The point to point link is on channel 6 HT, my other AP are on channel 1 or 11 to prevent overlap.
I tried the CF-E110N which has a directive antenna of claimed 11dbi but roughly got the same signal level as with a classical TP-LINK WR1040 or WR841N.
output of iwinfo wlan0 assoclist that do not really change if network is lagging or not :
From my experience the signal is quite weak for just 125m distance. But this shouldn't be such a problem. And the directive antenna should give way better signal than the stock omnidirectional antenna.
What I suggest is to check the connections and the cables (if any) from the router to the antenna. Also make sure the antenna aiming is optimal.
Then keep an traffic monitor program running, like iptraf to continuously monitor the traffic on the links to see if it is connected to the outages that you experience.
One more thing, on which interfaces do you see the dropped packets that you mentioned earlier?
This is an integrated antenna so this is factory made. In the past I tried to use an external yagi (cheap one) but with no improvement. I have to tell that one router is in a building cover with wood so this is not totally free space connection.
Dropped packets are on the bridge br-lan and all interfaces are on this bridge (eth0,eth1,wlan0).
There is no dropped packets on the individual interfaces.
I used to look at traffic with iftop but did not see any difference with or without the lagging
I usually configure that +10% to be on the safe side.
@zoulou
The 11dBi antennas for p2p links are not worth spending money on. Always go for the best you can get, say 20something dBi directive antenna, minus connectors and cable loss, and card transmit at minimum (usually 1mW). Still legal transmission power AND high gain on the receiver.
By the way, post the whole wireless config to have a look.
Ok, How can I tell if this is a wireless reception issue, wouldn't the rate and signal should drop in iwinfo.
I do not see any difference with and without the problem, and when its working (most of the times) I am happy with the bandwith.
The wireless config is pretty simple:
The noise might as well raise. In server side you already have noise at -86, which means that there is already something near. On the contrary in client side the band is more clear, noise at -95 is practically no noise. Also the omnidirectional antennas collect noise from all around. In contrast a good directional antenna would isolate or attenuate the noise from the side and the rear.
Regarding the configs, fix the channel on client, and add the distance option (140m)
Fingers crossed!
I think the noise comes from the router itself(CF-E110n) , I have anoher unit with same noise reading and if I put any other router at the same place I get -95dBm. Someone stated the same problem with that device. CPE COMFAST CF-E110N won't work after downgrade
I don't know if it is true reading or not, if so that kind of defeat the directionnal antenna benefit
I changed the distance option to 150m on both ends and fix channel on client side, I will check if that made any improvement.
Well same thing with the new settings.
Do you know of any means to check what can be the cause of the lagging. I have the actual bandwith 3 times less than the expected throughput stated by iwinfo.
Can a queue get full and how to check ? thx
I wouldn't trust the throughput estimation from iwinfo, as it is based only on the SNR ratio. Better run an iperf speed test between the wireless endpoints with different packetsizes and protocols. Maybe the speed is not as much as you thought.
I've ran an betterspeedtest.sh benchmark between devices on both sides of the wireless link and got 37.5Mbps donwn and 49.6Mbs up which is not so far that what iwinfo states
After a lot of tests I think also WDS has still some bugs altough it is working most of the time.
I set up the link as a mesh but it turned back to HT20 and not HT40. Is there any known limitation for that because the bandwith went down to 30Mbps.
I'll report if it is more stable
I was also planning to try relayd or GRE-tap, any advice on that.