From mboxrd@z Thu Jan  1 00:00:00 1970
Return-Path: <moeller0@gmx.de>
Received: from mout.gmx.net (mout.gmx.net [212.227.17.21])
 (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits))
 (No client certificate requested)
 by lists.bufferbloat.net (Postfix) with ESMTPS id 04AA23CB42;
 Mon,  9 Jan 2023 12:00:42 -0500 (EST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=gmx.de; s=s31663417;
 t=1673283633; bh=bd71Vu89uZAWPjgpEAY1nss3oHQjv0Dhi9cvwoqP91c=;
 h=X-UI-Sender-Class:Subject:From:In-Reply-To:Date:Cc:References:To;
 b=b0hw4h88okEKyFNaGrNB6fIsrigaO4FbdP+2kHRmCalCgNnkXwlfJ/8b6wH/kXOha
 vOAYnf1JW+1PpLFuJO/pWE0BvJej3yFEB8m9VsIB+L50cn8nh4qArENcx/hMe2pgBy
 gGAQcVIAiISFtZ3nacng5rwkxgyv16RDodK8nC/RhwXTG2KiYWl4TlZh/GBXR0P0j8
 BItbrZksoY+Cu/pIg4zasO2wZSNwykmpXl3KmidHuw7/zzYapblGhyJpZiLY+Laf7v
 MK2MMWuJ+V2len2pg5xZgnG0/brh09A5QFoBU5sJwEuGdTNyffev+9Jc8xN0wIfxu3
 yPfDedWb2nR/g==
X-UI-Sender-Class: 724b4f7f-cbec-4199-ad4e-598c01a50d3a
Received: from smtpclient.apple ([134.76.241.253]) by mail.gmx.net (mrgmx104
 [212.227.17.168]) with ESMTPSA (Nemesis) id 1MNKhm-1pQPz11fiM-00OodL; Mon, 09
 Jan 2023 18:00:33 +0100
Content-Type: text/plain;
	charset=us-ascii
Mime-Version: 1.0 (Mac OS X Mail 16.0 \(3696.120.41.1.1\))
From: Sebastian Moeller <moeller0@gmx.de>
In-Reply-To: <CAA93jw4x5D=XsHmH7n9nj++-+Oy9XhLB9zvVeivUCw1QjD9gNg@mail.gmail.com>
Date: Mon, 9 Jan 2023 18:00:31 +0100
Cc: "Livingood, Jason" <Jason_Livingood@comcast.com>,
 Rpm <rpm@lists.bufferbloat.net>,
 "mike.reynolds@netforecast.com" <mike.reynolds@netforecast.com>,
 libreqos <libreqos@lists.bufferbloat.net>,
 "David P. Reed" <dpreed@deepplum.com>,
 "starlink@lists.bufferbloat.net" <starlink@lists.bufferbloat.net>,
 bloat <bloat@lists.bufferbloat.net>
Content-Transfer-Encoding: quoted-printable
Message-Id: <DD055769-FB4C-4EB8-8696-7FEA84BB0DE9@gmx.de>
References: <mailman.2651.1672779463.1281.starlink@lists.bufferbloat.net>
 <1672786712.106922180@apps.rackspace.com>
 <F4CA66DA-516C-438A-8D8A-5F172E5DFA75@cable.comcast.com>
 <CAA93jw4x5D=XsHmH7n9nj++-+Oy9XhLB9zvVeivUCw1QjD9gNg@mail.gmail.com>
To: =?utf-8?Q?Dave_T=C3=A4ht?= <dave.taht@gmail.com>
X-Mailer: Apple Mail (2.3696.120.41.1.1)
X-Provags-ID: V03:K1:tYolVKxC8L/DZKSuNVbCqqFV4XEqXetuFXtJlEMVUQu+0PNloH0
 2fnv+ifH4vuFzpHMyLuqliMFoUpdE/EEtsG0aKOXD4kxbElEubK9937UnEh4/phfCwTdJZ/
 3lQt43ZchVH+78gmVMaZgI+It6by38BW4o3lKt9AdnGjwaAVPRvejmeTkiOhshWrlvsRm9e
 CGvNqdTev3xKIxfe5bDMA==
X-Spam-Flag: NO
UI-OutboundReport: notjunk:1;M01:P0:wDihcPSuexA=;sVz/iXajbek9Kicvn+ZarYcq7Ez
 53Uhy+2mkO5fVMNmad+5lvTdMGNwuYhjriE1pQtDYvYIdg+CNMWC+N+oJvOdKyrJXVGkTuxJF
 XTPOCG7b5d0pH8C7j28mxso/63ZxRjWbKs3fKNM3NmSFl7vyJIgYXT+jxlsJajy6N3ddrkhrO
 5g0L5Uwxn2ZZwq7KxqYuZyxB/rPRFl/nY1ABggXe5JbyQKk4wMjDWqglYSb3QOVuobCp18ir1
 mG1grWMhxO+PZBrxxYdxV+C0TYmieOYnoX0qhngubpPfW/nnSbgn2oeF5rW7/1V4zEulEW7ED
 K4oPmNjXI0fpknegbYFIL0hPTQjciJ1wqEkG+htQHox3ImEGgjyNmHZcRX1fXi/xZy1lF0OjJ
 csvN3twPYvN75e4h4jDG/cn9W45deQlNrc3qwAZSN2mwUHLVbwTaUeKrsyh8Rxl+31AeVB+dh
 G38eAuyCqsg+OeEwNqXTbeNVbTX39FZGSkgoKKBa3MfkIWTNp71bmF5rfScCfDBs9fhlyjIsD
 YoHpSrGNNG693e8+1hiPdNVjWV4ENSRlDnn/PHggQs6Dss1PdOrYdCoJTMfDR5yUpfcoGgxlk
 /2PdQV+XQeX3dGzSK3lCvKv7UHPZjjMhCimWV1nRJ0iGtG/Tc+SsbV+LVf6NiTOK6Dfnx6oAk
 44cT+q2pXlXO8H0xnSLrnTeNfBcN5bes0r39Xifsamq8Z51WQjLfNvLEWJG1QPuZmngCv23X0
 gsa6urVSdgwz12MFM8rFGygAwnIiOV7B4IBKOjKM9dPJa1ZQyfz3VyYjNwKUc1tgRPX8NwM0j
 WgABiTxBVpQuSEgB7sB5R6T/fKtnvA3Xjrg4MEuDHhWAw33jCQNt2uhPo5bsI9xJZ835SsK4I
 8FIZ2kyBjfTOtL3x1iTRlHgz9OMkaJQ1WJ+sMnUAMwDKAjwAPt2ompbeOWoTOaVejeldxcuOa
 Z4k+tEiC9nRSGUI4vR23pWVgQVY=
Subject: Re: [Rpm] [Starlink] Researchers Seeking Probe Volunteers in USA
X-BeenThere: rpm@lists.bufferbloat.net
X-Mailman-Version: 2.1.20
Precedence: list
List-Id: revolutions per minute - a new metric for measuring responsiveness
 <rpm.lists.bufferbloat.net>
List-Unsubscribe: <https://lists.bufferbloat.net/options/rpm>,
 <mailto:rpm-request@lists.bufferbloat.net?subject=unsubscribe>
List-Archive: <https://lists.bufferbloat.net/pipermail/rpm>
List-Post: <mailto:rpm@lists.bufferbloat.net>
List-Help: <mailto:rpm-request@lists.bufferbloat.net?subject=help>
List-Subscribe: <https://lists.bufferbloat.net/listinfo/rpm>,
 <mailto:rpm-request@lists.bufferbloat.net?subject=subscribe>
X-List-Received-Date: Mon, 09 Jan 2023 17:00:43 -0000

Hi Dave,


just a data point, apples networkQuality on Monterey (12.6.2, x86) =
defaults to bi-directionally saturating traffic. Your argument about the =
duration still holds though the test is really short. While I understand =
the motivation behind that, I think it would to the internet much better =
if all such tests would randomly offer users extended test duration of, =
say a minute. Users need to opt-in, but that would at least collect some =
longer duration data. Now, I have no idea whether apple actually keeps =
results on their server side (Ookla sure does, but given Apples =
applaudable privacy stance they might not do so) if not it would do =
little good to run extended tests, but for "players" like Ookla that do =
keep some logs interspersing longer running tests would offer a great =
way to test ISPs outside the "magic 20 seconds".


> On Jan 9, 2023, at 16:26, Dave Taht via Starlink =
<starlink@lists.bufferbloat.net> wrote:
>=20
> I have many kvetches about the new latency under load tests being
> designed and distributed over the past year. I am delighted! that they
> are happening, but most really need third party evaluation, and
> calibration, and a solid explanation of what network pathologies they
> do and don't cover. Also a RED team attitude towards them, as well as
> thinking hard about what you are not measuring (operations research).

	[SM] RED as in RED/BLUE team or as in random early detection? ;)

>=20
> I actually rather love the new cloudflare speedtest, because it tests
> a single TCP connection, rather than dozens, and at the same time folk
> are complaining that it doesn't find the actual "speed!".

	[SM] Ookla's on-line test can be toggled between multi and =
single flow mode (which is good, the default is multi) but e.g. the =
official macos client application from Ookla does not offer this toggle =
and defaults to multi-flow (which is less good). Fast.com ca be =
configured for single flow tests, but defaults to multi-flow.


> yet... the
> test itself more closely emulates a user experience than speedtest.net
> does.

	[SM] I like the separate reporting for transfer rates for =
objects of different sizes. I would argue that both single and =
multi-flow tests have merit, but I agree with you that if only one test =
is performed a single-flow test seems somewhat better.

> I am personally pretty convinced that the fewer numbers of flows
> that a web page opens improves the likelihood of a good user
> experience, but lack data on it.
>=20
> To try to tackle the evaluation and calibration part, I've reached out
> to all the new test designers in the hope that we could get together
> and produce a report of what each new test is actually doing.

	[SM] +1; and probably part of your questionaire already, what =
measures are actually reported back to the user.


> I've
> tweeted, linked in, emailed, and spammed every measurement list I know
> of, and only to some response, please reach out to other test designer
> folks and have them join the rpm email list?
>=20
> My principal kvetches in the new tests so far are:
>=20
> 0) None of the tests last long enough.
>=20
> Ideally there should be a mode where they at least run to "time of
> first loss", or periodically, just run longer than the
> industry-stupid^H^H^H^H^H^Hstandard 20 seconds. There be dragons
> there! It's really bad science to optimize the internet for 20
> seconds. It's like optimizing a car, to handle well, for just 20
> seconds.

	[SM] ++1

> 1) Not testing up + down + ping at the same time
>=20
> None of the new tests actually test the same thing that the infamous
> rrul test does - all the others still test up, then down, and ping. It
> was/remains my hope that the simpler parts of the flent test suite -
> such as the tcp_up_squarewave tests, the rrul test, and the rtt_fair
> tests would provide calibration to the test designers.
>=20
> we've got zillions of flent results in the archive published here:
> https://blog.cerowrt.org/post/found_in_flent/
>=20
> The new tests have all added up + ping and down + ping, but not up +
> down + ping. Why??

	[SM] I think at least on Monterey Apple's networkQuality does =
bidirectional tests (I just confirmed that via packet-capture, but it is =
already visible in iftop (but hobbled by iftop's relative high default =
hysteresis)). You actually need to manually intervene to get a =
sequential test:

laptop:~ user$ networkQuality -h
USAGE: networkQuality [-C <configuration_url>] [-c] [-h] [-I =
<interfaceName>] [-s] [-v]
    -C: override Configuration URL
    -c: Produce computer-readable output
    -h: Show help (this message)
    -I: Bind test to interface (e.g., en0, pdp_ip0,...)
    -s: Run tests sequentially instead of parallel upload/download
    -v: Verbose output

laptop:~ user $ networkQuality -v
=3D=3D=3D=3D SUMMARY =3D=3D=3D=3D                                        =
                                                =20
Upload capacity: 194.988 Mbps
Download capacity: 894.162 Mbps
Upload flows: 16
Download flows: 12
Responsiveness: High (2782 RPM)
Base RTT: 8
Start: 1/9/23, 17:45:57
End: 1/9/23, 17:46:12
OS Version: Version 12.6.2 (Build 21G320)

laptop:~ user $ networkQuality -v -s
=3D=3D=3D=3D SUMMARY =3D=3D=3D=3D                                        =
                                                =20
Upload capacity: 641.206 Mbps
Download capacity: 883.787 Mbps
Upload flows: 16
Download flows: 12
Upload Responsiveness: High (3529 RPM)
Download Responsiveness: High (1939 RPM)
Base RTT: 8
Start: 1/9/23, 17:46:17
End: 1/9/23, 17:46:41
OS Version: Version 12.6.2 (Build 21G320)

(this is alas not my home link...)


>=20
> The behaviors of what happens in that case are really non-intuitive, I
> know, but... it's just one more phase to add to any one of those new
> tests. I'd be deliriously happy if someone(s) new to the field
> started doing that, even optionally, and boggled at how it defeated
> their assumptions.

	[SM] Someone at Apple apparently listened ;)


>=20
> Among other things that would show...
>=20
> It's the home router industry's dirty secret than darn few "gigabit"
> home routers can actually forward in both directions at a gigabit.

	[SM] That is going to be remedied in the near future, the first =
batch of nominal Gigabit links were mostly asymmetric, e.g. often =
something like 1000/50 over DOCSIS or 1000/500 over GPON (reflecting the =
asymmetric nature of the these media in the field). But with symmetric =
XGS-PON being deployed by more and more (still a low absolute number) =
ISPs symmetric performance is going to move into the spot-light. However =
my guess is that the first few generations of home routers for these =
speedgrades will rely heavily on accelerator engines.


> I'd
> like to smash that perception thoroughly, but given our starting point
> is a gigabit router was a "gigabit switch" - and historically been
> something that couldn't even forward at 200Mbit - we have a long way
> to go there.
>=20
> Only in the past year have non-x86 home routers appeared that could
> actually do a gbit in both directions.
>=20
> 2) Few are actually testing within-stream latency
>=20
> Apple's rpm project is making a stab in that direction. It looks
> highly likely, that with a little more work, crusader and
> go-responsiveness can finally start sampling the tcp RTT, loss and
> markings, more directly. As for the rest... sampling TCP_INFO on
> windows, and Linux, at least, always appeared simple to me, but I'm
> discovering how hard it is by delving deep into the rust behind
> crusader.

	[SM] I think go-responsiveness looks at TCP_INFO already (on =
request) but will report an aggregate info block over all flows, which =
can get interesting as in my testing I often see a mix of IPv4 and IPv6 =
flows within individual tests, with noticeably different numbers for =
e.g. MSS. (Yes, MSS is not what you are asking for here, but I think =
flent does it right by diligently reporting all such measures =
flow-by-flow, but that will explode pretty quickly if say a test uses =
32/32 flows by direction).


>=20
> the goresponsiveness thing is also IMHO running WAY too many streams
> at the same time, I guess motivated by an attempt to have the test
> complete quickly?

	[SM] I can only guess, but that goal is to saturate the link =
persistently (and getting to that sate fast) and for that goal parallel =
flows seem to be OK, especially as that will reduce the server load for =
each of these flows a bit, no?


>=20
> B) To try and tackle the validation problem:
>=20
> In the libreqos.io project we've established a testbed where tests can
> be plunked through various ISP plan network emulations. It's here:
> https://payne.taht.net (run bandwidth test for what's currently hooked
> up)
>=20
> We could rather use an AS number and at least a ipv4/24 and ipv6/48 to
> leverage with that, so I don't have to nat the various emulations.
> (and funding, anyone got funding?) Or, as the code is GPLv2 licensed,
> to see more test designers setup a testbed like this to calibrate
> their own stuff.
>=20
> Presently we're able to test:
> flent
> netperf
> iperf2
> iperf3
> speedtest-cli
> crusader
> the broadband forum udp based test:
> https://github.com/BroadbandForum/obudpst
> trexx
>=20
> There's also a virtual machine setup that we can remotely drive a web
> browser from (but I didn't want to nat the results to the world) to
> test other web services.
> _______________________________________________
> Starlink mailing list
> Starlink@lists.bufferbloat.net
> https://lists.bufferbloat.net/listinfo/starlink