I remember that there was also a thread specifically for R7800 related to the KONG firmeware, where also discussion in regards to SQM were found. Some reasons were CPU as bottleneck. However, that was on the old firmware and there are several users with the same router getting much better results than me. And somehow I cannot find this thread anymore.
Some general comments:
Your asymmetric 129/6 connection is one basic problem. 21 to 1 ratio means that with the full download speed, much of your upload bandwidth goes likely just to protocol traffic.
In that speedy test, you bufferbloat was huge, and latency during download averaged 550-600 ms, which makes internet really sluggish.
Using SQM to restrict upload, even that small amount, likely causes part of that drop. You might test going a bit up from 95% to 97% or so.
Also test the normal "simple.qos" instead of the "simplest.qos".
And try also cake.qos with piece_of_cake qdisc.
Ps. is that "per packet overhead" right (and needed) for you?
thanks very much for your support. I tried your suggestions. The most difficult thing was to restart the speedtest so many times, until three server locations were sucessfully selected so that 16 streams were used. Please find below my results:
in sqm, for cake (only) there are a few parameters not well supported by the gui. if you are natting, add nat. adding ack-filter makes sense given a >15x1 up/down ratio. I don't use the overhead parameter if it is cable, just the docsis keyword.
You results are pretty identical. You reach something 115/5.3 with A+ on all.
Good, pretty much the maximum that you can achieve.
Just use the qdisc that feels best (and gives lowest CPU load). I guess that cake might be a bit more CPU intensive, but with your pretty low upload speed it should be manageable.
I've had similar issues on the R7800 with SQM cake enabled. Since I don't seem to have much bufferbloat on the download side, just on the upload (DOCIS 3.0), I disabled shaping on the download by setting bandwidth to 0. Now I get full speed on the speed test for download test, and very little bloat on the upload test. I was not aware of the ack-filter. I'll try disabling overhead and enabling the docsis options.
What modem do you use? Accidentally one with the notorious intel puma6/7 bug? I ask because I see weird >100 ms spikes in the idle and in the sqm shaped results (and these can be caused, by measuring over wifi, or by the puma6 bug)...
Thanks for the advise. Even I read the first blog article, I don't really understand what it does How would a measure the effects this setting does? Should I give it a try, although the CPU is much more used by cake (see below)?
So I measured the %idle value via "top -d l"
no SQM: ~80% down / ~95% up
fq_codel/simplest: ~80% down / ~93% up
fq_codel/simple: ~80% down / ~93% up
cake/piece_of_cake.qos: ~40% down / ~93% up
Would you then recommend the 2nd or 3rd option?
I'm using a rather old "Cisco EPC3208 EuroDocsis 3.0 2-PORT EMTA" with firmware e3200-E10-5-v302r125562-130611c_upc.
I don't see it listed on that bug page.
I would. I have noticed earlier that cake is CPU intensive, so I highlight the CPU issue. High CPU load from qdisc might slow down the other functions of the router.
But in the end it depends on your use case, the download practices and the other load for the router. Just try both for some time and pick the one that feels best...
Personally I use simple/fq_codel, but I have something like 190/19 Mbit, so my SQM needs are rather modest (with normal browsing & office apps).
In your last result, I'm pretty sure the bufferbloat test component of the dslreports test has been failing for a lot of people, lately, and I'm not sure why. It could be the link getting slammed so hard when you are not also inbound shaping. I'd be pretty interested in cakes' behavior with ack-filter on on the uplink and no download shaping....
Analysis: Middle of the afternoon download speed is a little below normal due to usual increased network load. Cake is grouchy about software offloading and/or 0 for download.
if someone would toss at least $4k/month into https://www.patreon.com/dtaht I'd get back more F/T in fixing bloat issues. Living in yurt got old multiple years back. Terribly chilly in the winters.
Another useful feature all these offload engines could would be a programmable completion interrupt for tx. You'd set that to what you would shape to, and completely eliminate the cpu cost of outbound shaping by treating it (with backpressure) as, (for example) a 35Mbit/sec interface.
There are a few ethernet devices that can do this, including one from intel, but I don't rmeember which one.
The use case for a symmetrical line is to work with the cloud (storage in particular): I have to upload/download a lot of data often. With fq_codel/simplest_tbf this router cannot reliably shape more than 200Mbps on the uplink on a good day (download is not shaped), so I am loosing a lot of bulk upload bandwidth in order to be able to have several video calls going through the router at the same time.
Having said that, this router might be the wrong tool for the job to begin with.
What does dsl speedtest look like with 250000 / 250000 with aggressive CPU settings (or performance governor) with zero overhead and no advanced SQM settings?