From mboxrd@z Thu Jan  1 00:00:00 1970
Return-Path: <moeller0@gmx.de>
Received: from mout.gmx.net (mout.gmx.net [212.227.15.15])
	(using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits))
	(Client CN "mout.gmx.net",
	Issuer "TeleSec ServerPass DE-1" (verified OK))
	by huchra.bufferbloat.net (Postfix) with ESMTPS id AA02B21F615
	for <cerowrt-devel@lists.bufferbloat.net>;
	Sat, 26 Jul 2014 04:30:14 -0700 (PDT)
Received: from hms-beagle.lan ([134.2.89.70]) by mail.gmx.com (mrgmx003) with
	ESMTPSA (Nemesis) id 0LgZRV-1Wgubb3urN-00o0ww;
	Sat, 26 Jul 2014 13:30:06 +0200
Content-Type: text/plain; charset=windows-1252
Mime-Version: 1.0 (Mac OS X Mail 7.3 \(1878.6\))
From: Sebastian Moeller <moeller0@gmx.de>
In-Reply-To: <36889fad276c5cdd1cd083d1c83f2265@lang.hm>
Date: Sat, 26 Jul 2014 13:30:08 +0200
Content-Transfer-Encoding: quoted-printable
Message-Id: <2483CF77-EE7D-4D76-ACC8-5CBC75D093A7@gmx.de>
References: <CACj-SW2xRzNJa_c7CyOGzY-Yvun7UjNyp0W0aeF5DjO_Guu=ag@mail.gmail.com>
	<13144.1406313454@turing-police.cc.vt.edu>
	<36889fad276c5cdd1cd083d1c83f2265@lang.hm>
To: David Lang <david@lang.hm>
X-Mailer: Apple Mail (2.1878.6)
X-Provags-ID: V03:K0:Ua2mD6E7yd8ctPExFDJcJ+5tt8a4ImVQ8jjbg8Kei3jV6MZgFZ8
	/bxYLQScwPxy7uHVJYnT0hy3PiyYCYXht29jKAXDavoU/dY9FWxEWETFETz8wSA5yaPB4vM
	PMCYyd4po0oebcL67IWeLeD8zOajKIx8QMRKteUucsBysWtKWeXye14LFUzk9hAeH/NGFj1
	K2DTqJRNr9Z/tmgJaYUnQ==
Cc: cerowrt-devel@lists.bufferbloat.net
Subject: Re: [Cerowrt-devel] Ideas on how to simplify and popularize
	bufferbloat control for consideration.
X-BeenThere: cerowrt-devel@lists.bufferbloat.net
X-Mailman-Version: 2.1.13
Precedence: list
List-Id: Development issues regarding the cerowrt test router project
	<cerowrt-devel.lists.bufferbloat.net>
List-Unsubscribe: <https://lists.bufferbloat.net/options/cerowrt-devel>,
	<mailto:cerowrt-devel-request@lists.bufferbloat.net?subject=unsubscribe>
List-Archive: <https://lists.bufferbloat.net/pipermail/cerowrt-devel>
List-Post: <mailto:cerowrt-devel@lists.bufferbloat.net>
List-Help: <mailto:cerowrt-devel-request@lists.bufferbloat.net?subject=help>
List-Subscribe: <https://lists.bufferbloat.net/listinfo/cerowrt-devel>,
	<mailto:cerowrt-devel-request@lists.bufferbloat.net?subject=subscribe>
X-List-Received-Date: Sat, 26 Jul 2014 11:30:15 -0000

Hi David,


On Jul 25, 2014, at 23:03 , David Lang <david@lang.hm> wrote:

> On Fri, 25 Jul 2014 14:37:34 -0400, Valdis.Kletnieks@vt.edu wrote:
>> On Sat, 24 May 2014 10:02:53 -0400, "R." said:
>>=20
>>> Further, this function could be auto-scheduled or made enabled on
>>> router boot up.
>>=20
>> Yeah, if such a thing worked, it would be good.
>>=20
>> (Note in the following that a big part of my *JOB* is doing "What =
could
>> possibly go wrong?" analysis on mission-critical systems, which tends
>> to color
>> my viewpoint on projects. I still think the basic concept is good, =
just
>> difficult to do, and am listing the obvious challenges for anybody =
brave
>> enough to tackle it... :)
>>=20
>>> I must be missing something important which prevents this. What is =
it?
>>=20
>> There's a few biggies.  The first is what the linux-kernel calls =
-ENOPATCH -
>> nobody's written the code.  The second is you need an upstream target
>> someplace
>> to test against.  You need to deal with both the "server is =
unavalailable due
>> to a backhoe incident 2 time zones away" problem (which isn't *that*
>> hard, just
>> default to Something Not Obviously Bad(TM), and "server is =
slashdotted" (whci
>> is a bit harder to deal with.  Remember that there's some really odd =
corner
>> cases to worry about - for instance, if there's a power failure in a
>> town, then
>> when the electric company restores power you're going to have every
>> cerowrt box
>> hit the server within a few seconds - all over the same uplink most
>> likely.  No
>> good data can result from that... (Holy crap, it's been almost 3
>> decades since
>> I first saw a Sun 3/280 server tank because 12 Sun 3/50s all rebooted
>> over the
>> network at once when building power was restored).
>>=20
>> And if you're in Izbekistan and the closest server netwise is at 60
>> Hudson, the
>> analysis to compute the correct values becomes.... interesting.
>>=20
>> Dealing with non-obvious error conditions is also a challenge - a =
router
>> may only boot once every few months.  And if you happen to be booting =
just
>> as a BGP routing flap is causing your traffic to take a vastly =
suboptimal
>> path, you may end up encoding a vastly inaccurate setting and have it =
stuck
>> there, causing suckage for non-obvious reasons for the non-technical, =
so you
>> really don't want to enable auto-tuning unless you also have a good =
plan for
>> auto-*RE*tuning....
>=20
> have the router record it's finding, and then repeat the test =
periodically, recording it's finding as well. If the new finding is =
substantially different from the prior ones, schedule a retest 'soon' =
(or default to the prior setting if it's bad enough), otherwise, if =
there aren't many samples, schedule a test 'soon' if there are a lot of =
samples, schedule a test in a while.

	Yeah, keeping some history to =93predict=94 when to measure next =
sounds clever.

>=20
> However, I think the big question is how much the tuning is required.

I assume in most cases you need to measure the home-routers bandwidth =
rarely (say on DSL only after a re-sync with the DSLAM), but you need to =
measure the bandwidth early as only then you can properly shape the =
downlink. And we need to know the link=92s capacity to use traffic =
shaping so that BQL and fq_codel in the router have control over the =
bottleneck queue=85 An equivalent of BQL and fq_codel running in the =
DSLAM/CMTS and CPE obviously would be what we need, because then BQL and =
fq_codel on the router would be all that is required. But that does not =
seem like it is happening anytime soon, so we still need to workaround =
the limitations in the equipment fr a long time to come, I fear.=20

>=20
> If a connection with BQL and fq_codel is 90% as good as a tuned setup, =
default to untuned unless the user explicitly hits a button to measure =
(and then a second button to accept the measurement)
>=20
> If BQL and fw_codel by default are M70% as good as a tuned setup, =
there's more space to argue that all setups must be tuned, but then the =
question is how to they fare against a old, non-BQL, non-fq-codel setup? =
if they are considerably better, it may still be worthwhile.
=09

Best Regards
	Sebastian

>=20
> David Lang
> _______________________________________________
> Cerowrt-devel mailing list
> Cerowrt-devel@lists.bufferbloat.net
> https://lists.bufferbloat.net/listinfo/cerowrt-devel