@ACwifidude - related to the NSS core scaling that @Ansuel asked, yes, I have enabled it rebuilding your branch.
Below, my observations:
- it was rebooting during torrenting a large file (I chose a Fedora ISO from https://torrent.fedoraproject.org/ - the 4.5Gb one).
- it was also always rebooting during a large rsync (I use rsync to copy data form laptop - via WiFi - to a wired server).
However, out of the 7-8 times I got it to crash - only 3 times it did produce oops logs (two are above) but - more worryingly, it never did during the rsync reboots.
Which led me more and more to @Ansuel idea that it may be voltage related - too random the behavior. Unfortunately, there were not many discussions about the voltages - and it seems a bit of a black magic (meaning, NSS voltages seemed to have been extracted from the stock firmware is doing - and also - questions about how is the router upping the frequencies of the NSS cores and the respective voltage?).
For example, I found this from Kernel 4.9, https://github.com/dissent1/r7800/commit/a5624ef53e9aea7a3e86748fa3e967735436d984.patch and also IPQ806x NSS Drivers - #642 by robimarko and few more places like in the Netgear R7800 exploration (IPQ8065, QCA9984) - #2473 by Ansuel, etc.
However, using your build and trying to piece together various bits of infos from the discussions above, I discovered that cat /sys/kernel/debug/regulator/regulator_summary
never shows the 1150 value for s1b on my router even of the NSS core was running at 800MHz, it was only showing 1100. But this again, made me wonder, is it because the dummy regulator doesn't properly output the voltage and maybe the cores are actually getting 1150000 ? There was a discussion between @Ansuel and robimarko where @robimarko was also not seeing the 1150000 after some line was removed from the code: IPQ806x NSS Drivers - #646 by robimarko. And what is the difference between L2 cache scaling w/voltage and NSS core voltage ? Too many unknowns for me ...
However, the file 083-ipq8064-dtsi-additions.patch
does add this line:
+qcom,l2-volt = <1100000 1100000 1150000>;
To me it felt a little wrong, why also 1100000 for the intermediate frequency ? - the intermediate frequency is closer to the maximum, than to the minimum.
Thus, I decided to bump the voltage level for the intermediate frequency - I changed the patch to:
+qcom,l2-volt = <1100000 1150000 1150000>;
Rebuilt all, router rebooted only once just after image was flashed (rebooted while the 5GHz radio had to come on) - but booted fine after. However, it did not experience any reboots for the Fedora torrent re-download, and it finished rsync-ing it few times to the server! Temperatures still look right on the router. However, I still did not see the 1150 output in the regulator_summary
...
I don't dare to hope, maybe is placebo patch, I will come back in few days after few more rsyncs, etc
LE: to put more stress on the voltage regulator, I am also running ondemand
for the governor, instead of performance
- so the stability for the torrent download / and large rsyncs (copying the Fedora .iso to the server and just renaming it to copy it again, etc) was both with ondemand
as well as NSS core frequency scaling enabled.