SQM (Smart Queue Management)
OpenWrt supports SQM for mitigating bufferbloat, the undesirable latency that arises when your router buffers too much data.
Overview
Bufferbloat is most evident when a connection is heavily loaded with downloads or uploads. It causes increased latency (ping), resulting in poor performance for realtime apps like VoIP, video chat, lag in online gaming, and generally makes the internet less responsive. This can be mitigated using SQM with a fast enough CPU and small reduction to peak throughput.
SQM is an integrated system that performs per-packet/per-flow network scheduling, active queue management (AQM), traffic shaping, rate limiting, and QoS prioritization. In comparison, “classic” AQM only manages queue length and “classic” QoS only does prioritization.
SQM is heavily CPU dependent. Slower devices may be unable to keep up with your connection speed.
SQM is incompatible with hardware flow offloading which bypasses part of the kernel as per this thread. Be sure that is disabled in LuCI → Network → Firewall to use SQM.
Preparation: Measure Your Current Speed and Latency
Before you can optimize your network, you need to know its current state.
- Note if you are using this OpenWrt device as an Extender, Repeater, or Bridge, test your upstream router (OpenWrt or otherwise) and determine if an issue is present there first.
- Note if you are on a wireless AP, test your upstream router separately. If your wireless AP wifi driver supports AQL limits (e.g. mt76 does) adjust/reduce those to improve wifi latency.
Installation
Install luci-app-sqm
(or sqm-scripts
if you don't use LuCI) and read below.
In LuCI go to Network → SQM QoS:
- In the Basic Settings tab:
- Check the Enable box
- Set the Interface to your internet (WAN) link in the dropdown. Check Network → Interfaces if you need to determine your WAN port.
- Set the Download and Upload speeds to 90% of what you measured in Preparation
- In the Queue Discipline tab:
- In the Link Layer Adaptation tab, select your link type (setting mpu is optional see note 3):
- VDSL - choose Ethernet, and set overhead 34 (or 26 if you're not using PPPoE) (mpu 68). If the link uses 100 Mbps Ethernet set overhead 42 (mpu 84).
- DSL of any other type - choose ATM, and set overhead 44 (mpu 96).
- DOCSIS Cable - choose Ethernet, and for rates < 760 Mbps set overhead 22 (mpu 64), for rates >= 760 Mbps set overhead 42 (mpu 84).
- Fiber - choose Ethernet, and set overhead 44 (mpu 84).
- Ethernet - choose Ethernet, and set overhead 44 (mpu 84).
- If unsure - it's better to overestimate, choose Ethernet, and set overhead 44 (mpu 96).
- Click Save & Apply.
Done! You can confirm results by re-running the speedtest. Any increased ping during download/uploads will now be minimal.
Results
As an example, the user below is running OpenWrt 23.05 on a WRT32X router. The internet connection is a DOCSIS cable modem with 500/35 Mbit service. Note this ISP includes over-provisioning. SQM Cake was selected with 90% dl/ul limits on baseline speedtest values. Latency increase under load dropped to zero, lower ping with no packet loss is observed during VoIP and online gaming during heavy internet usage. The user's speedtest results with SQM and summary of tests below:
Speedtest Results | |||||||
---|---|---|---|---|---|---|---|
QoS | Download | Upload | Unloaded Ping | DL Latency | UL Latency | Quality grade | Bufferbloat grade |
None | 532 Mbits | 37 Mbits | 12 ms | +18 ms | +38 ms | B | B |
SQM | 495 Mbits | 28 Mbits | 12 ms | +0 ms | +0 ms | A+ | A + |
A Little More Tuning
1. The steps above will handle latency well but you can improve this further with these steps:
- Increase the Download speed limit setting and retest until bufferbloat latency begins to increase, then go back to a slightly lower value.
- Repeat the same for the Upload speed.
- Test this using Waveform speedtest to achieve A+ quality and A+ bufferbloat grades when optimal settings are found.
- For DSL, the experiments above may produce download/upload values that are actually higher than the original speed test. This is ok, the ATM framing bytes of a DSL link add an average of 9% overhead, and these settings tell SQM how to make up for that overhead.
- For DOCSIS cable, some providers trick speed tests by adding 10% over-provisioning for the first 10 seconds (so speed tests look better!).
2. Cake is the preferred algorithm as it is excellent at mitigating bufferbloat. However, Fq_codel is often a faster, albeit less comprehensive option. One user found fq_codel gave about 15% higher throughput when CPU limited and this email thread showed similar results.
3. To set your link mpu read SQM Details and SQM setting question. Setting mpu will ensure rate shaping is correct for small packets.
4. See SQM configuration for advanced settings.
5. Consider Cake parameters: https://man7.org/linux/man-pages/man8/tc-cake.8.html