OpenWrt supports SQM for mitigating bufferbloat, the undesirable latency that arises when your router buffers too much data.
Bufferbloat is most evident when a connection is heavily loaded with downloads or uploads. It causes increased latency (ping), resulting in poor performance for realtime apps like VoIP, video chat, lag in online gaming, and generally makes the internet less responsive. This can be mitigated using SQM with a fast enough CPU and small reduction to peak throughput.
SQM is an integrated system that performs per-packet/per-flow network scheduling, active queue management (AQM), traffic shaping, rate limiting, and QoS prioritization. In comparison, “classic” AQM only manages queue length and “classic” QoS only does prioritization.
SQM is heavily CPU dependent. Slower devices may be unable to keep up with your connection speed.
SQM is incompatible with hardware flow offloading which bypasses part of the kernel as per this thread. Be sure that is disabled in LuCI → Network → Firewall to use SQM.
Before you can optimize your network, you need to know its current state.
Install luci-app-sqm
(or sqm-scripts
if you don't use LuCI) and read below.
In LuCI go to Network → SQM QoS:
Done! You can confirm results by re-running the speedtest. Any increased ping during download/uploads will now be minimal.
As an example, the user below is running OpenWrt 23.05 on a WRT32X router. The internet connection is a DOCSIS cable modem with 500/35 Mbit service. Note this ISP includes over-provisioning. SQM Cake was selected with 90% dl/ul limits on baseline speedtest values. Latency increase under load dropped to zero, lower ping with no packet loss is observed during VoIP and online gaming during heavy internet usage. The user's speedtest results with SQM and summary of tests below:
Speedtest Results | |||||||
---|---|---|---|---|---|---|---|
QoS | Download | Upload | Unloaded Ping | DL Latency | UL Latency | Quality grade | Bufferbloat grade |
None | 532 Mbits | 37 Mbits | 12 ms | +18 ms | +38 ms | B | B |
SQM | 495 Mbits | 28 Mbits | 12 ms | +0 ms | +0 ms | A+ | A + |
1. The steps above will handle latency well but you can improve this further with these steps:
2. Cake is the preferred algorithm as it is excellent at mitigating bufferbloat. However, Fq_codel is often a faster, albeit less comprehensive option. One user found fq_codel gave about 15% higher throughput when CPU limited and this email thread showed similar results.
3. To set your link mpu read SQM Details and SQM setting question. Setting mpu will ensure rate shaping is correct for small packets.
4. See SQM configuration for advanced settings.
5. Consider Cake parameters: https://man7.org/linux/man-pages/man8/tc-cake.8.html