SQM and HTB hardware offload

Since kernel 5.12 HTB qdisc has an API for hardware offload. Mellanox is using this feature.

Would it make sense to try and implement this for some of our targets? Using Hardware HTB in combination with software fq_codel could allow us to do SQM at higher rates?