From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mail.toke.dk (mail.toke.dk [45.145.95.4]) (using TLSv1.2 with cipher ADH-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by lists.bufferbloat.net (Postfix) with ESMTPS id 9E4563B29D; Fri, 17 Dec 2021 06:33:09 -0500 (EST) From: Toke =?utf-8?Q?H=C3=B8iland-J=C3=B8rgensen?= DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=toke.dk; s=20161023; t=1639740788; bh=ViF9tkmOOqgXEGoyiCU0cZx7wncnXWlKGhGiv2jLKY0=; h=From:To:Cc:Subject:In-Reply-To:References:Date:From; b=lK87xlUu5dtFq9ZXDhfWkm2apy8qPZnpRLtvKRLojDx6ffSaSNvwQLvQVvWafJj69 S0SAwRpg8Zgh3smmTo2LSXTMBM3Qka5AHSuoe2ZefrjjLeJ+ZSDZRttnYWq8G4YmqE 0zTIEeYr2AFu9ArOzFemwcDbWvABDMk7xCzXA+12BMNtSXzZvHxIy8PFSjvHvJez3S YAdsfr4JknjQzy1oTmGNOWiNjBoy53Aw4QeObdoD7PPpRj80k4j1ezcEU3hRxILTBc bJhjx5h297HY0tCQlOIdj6ZBHlevavTxjJrCUl6IEAGcnIe62/67cN/kKkdQQwGHMK I9s5smcOPQWAA== To: Sebastian Moeller , Joel =?utf-8?Q?Wir=C4=81mu?= Pauling Cc: Cake List , cerowrt-devel , Jonathan Morton Subject: Re: [Cerowrt-devel] 10gige and 2.5gige In-Reply-To: References: <1639678615.275317887@apps.rackspace.com> <1639690165.936410589@apps.rackspace.com> <4927o62q-qq3-p947-qopq-89rppp637497@ynat.uz> <7F1B6B98-2DE2-41A3-B47F-30B46E2736FB@gmx.de> Date: Fri, 17 Dec 2021 12:33:08 +0100 X-Clacks-Overhead: GNU Terry Pratchett Message-ID: <874k77cvpn.fsf@toke.dk> MIME-Version: 1.0 Content-Type: text/plain X-List-Received-Date: Fri, 17 Dec 2021 11:33:09 -0000 Sebastian Moeller writes: > Mmmh, I guess our approach at traffic shaping does not scale well at those speeds. Maybe this could be fixed with larger batching? > > I think it might be worth trying to switch to simple.qos/fq_codel and > set a somewhat larger burst/quantum duration in defaults.sh, then > disable BQL on the NIC and configure a beefy txqueuelen on the > interface. This might help making SQM limp along to higher rates. If > that would actually work, we could try to see whether we can make cake > learn coarser batching ("bursts" and/or quantum) at high rates (but I > did not check what cake does internally, it might already do this, > @jonathan?).... I fear the problem is rather the lack of multithreading. I have a fairly beefy 8-core ARM box for my main router these days, and even that can't forward at a gigabit on a single core. There was a bug in the HW configuration so all traffic was sent to a single core, which resulted in 50+ms of bloat and traffic capping out way short of a gigabit. Now that it's fixed and traffic is mixed over all eight cores I have smooth sailing and no bloat. Thankfully I don't need to shape, so I'm just running straight mq+fq_codel on the physical interface.... -Toke