From mboxrd@z Thu Jan  1 00:00:00 1970
Return-Path: <mike@starlink.sx>
Received: from vsmx002.dclux.xion.oxcs.net (vsmx002.dclux.xion.oxcs.net
 [185.74.65.108])
 (using TLSv1.2 with cipher ADH-AES256-GCM-SHA384 (256/256 bits))
 (No client certificate requested)
 by lists.bufferbloat.net (Postfix) with ESMTPS id 909F13B2A4
 for <starlink@lists.bufferbloat.net>; Fri, 11 Jun 2021 18:34:45 -0400 (EDT)
Received: from proxy-2.proxy.oxio.ns.xion.oxcs.net
 (proxy-2.proxy.oxio.ns.xion.oxcs.net [31.4.191.24])
 by mx-out.dclux.xion.oxcs.net (Postfix) with SMTP id 3F5F28C0D8F;
 Fri, 11 Jun 2021 22:34:42 +0000 (UTC)
Date: Sat, 12 Jun 2021 00:34:31 +0200
From: Mike Puchol <mike@starlink.sx>
To: starlink@lists.bufferbloat.net, davecb@spamcop.net
Message-ID: <a1b4d233-6c4d-47b2-b43d-e72acc84c23f@Spark>
In-Reply-To: <01a7bed2-6f49-3d7d-eb5a-209031ee8070@gmail.com>
References: <baa8ff7a-0bde-9d6e-5984-ef5fcbae5ccd@rogers.com>
 <950B8EAF-90B9-41A6-951D-91821F591D41@teklibre.net>
 <01a7bed2-6f49-3d7d-eb5a-209031ee8070@gmail.com>
X-Readdle-Message-ID: a1b4d233-6c4d-47b2-b43d-e72acc84c23f@Spark
MIME-Version: 1.0
Content-Type: multipart/alternative; boundary="60c3e500_519b500d_3067"
X-VadeSecure-Status: LEGIT
X-VADE-STATUS: LEGIT
Subject: Re: [Starlink] Fwd: Microstate Accounting and the Nyquist problem
X-BeenThere: starlink@lists.bufferbloat.net
X-Mailman-Version: 2.1.20
Precedence: list
List-Id: "Starlink has bufferbloat. Bad." <starlink.lists.bufferbloat.net>
List-Unsubscribe: <https://lists.bufferbloat.net/options/starlink>,
 <mailto:starlink-request@lists.bufferbloat.net?subject=unsubscribe>
List-Archive: <https://lists.bufferbloat.net/pipermail/starlink>
List-Post: <mailto:starlink@lists.bufferbloat.net>
List-Help: <mailto:starlink-request@lists.bufferbloat.net?subject=help>
List-Subscribe: <https://lists.bufferbloat.net/listinfo/starlink>,
 <mailto:starlink-request@lists.bufferbloat.net?subject=subscribe>
X-List-Received-Date: Fri, 11 Jun 2021 22:34:45 -0000

--60c3e500_519b500d_3067
Content-Type: text/plain; charset="utf-8"
Content-Transfer-Encoding: quoted-printable
Content-Disposition: inline

We know that Starlink recalculates topology every 15 seconds (this guy, w=
ho obviously has way too much spare time, came up with an indirect observ=
ation of this interval:=C2=A0https://blog.beerriot.com/2021/02/14/starlin=
k-raster-scan/=C2=A0)

If we could align with this, we could at least know when potential change=
s in path delays happen, and try to observe other changes that happen at =
a similar cadence.

Other thoughts, try to plug more details out of the gRPC data, setup GPS-=
synced probes with a device at the exit PoP, measure differences between =
time-sync probes to an array of endpoints.

Has nobody attacked the JTAG connector on a Dishy yet=3F

Best,

Mike
On Jun 12, 2021, 00:14 +0200, David Collier-Brown <davecb.42=40gmail.com>=
, wrote:
> OK, Oh Smarter Colleagues, the challenge to you is to say if there is a=
 =22natural=22 place to capture state changes to get the data we want, an=
d if so, is it common or similar enough between drivers to be worthy of a=
ttention=3F
> --dave
> On 2021-06-09 9:15 a.m., Dave Taht wrote:
> >
> >
> > > Begin forwarded message:
> > >
> > > =46rom: David Collier-Brown <davecb.42=40gmail.com>
> > > Subject: Microstate Accounting and the Nyquist problem
> > > Date: June 9, 2021 at 4:44:14 AM PDT
> > > To: Dave Taht <davet=40teklibre.net>
> > > Cc: Dave Collier-Brown <dave.collier-brown=40indexexchange.com>
> > > Reply-To: davecb=40spamcop.net
> > >
> > > A million years ago (roughly around Solaris 9), Sun was suffering f=
rom the same problems in measuring their dispatcher as you are with =22sl=
oshing=22.
> > > A CPU would be 100% busy in one microsecond, 10% busy in the next g=
azillion, and the average CPU utilization for our sample period would be =
maybe 10.1, if the sampler happened to sample right when the spike was ha=
ppening.
> > > This was utterly useless for things like the fair-share scheduler, =
so it got fixed in Solaris 10, by having the dispatcher record the time a=
 process (well, kernel thread) had spent in a state when the state change=
d.
> > > Initially =22microstate accounting=22 could be toggled on and off, =
but the branch-around cost more time than always doing the calculation (a=
s discovered by my mad friend =46red) and the kernel folks left it on. It=
's on to this day.
> > > In Simon Sundberg's talk, the opportunity to measure occurs every 1=
,000 packets, when a suitable timestamp is provided. While the eBP=46 pro=
gram can look at every packet and do after-the-fact book-keeping in a map=
, that's only good if the phenomenon you're measuring is persistent enoug=
h that it's around for =7E2,000 packets.
> > > I'm going to suggest that the right place to record the information=
 you want is right where the event happens.=C2=A0 Preferably in c code, a=
s performance is easy to mess up, but perhaps with an eBP=46 mechanism to=
 export it.
> > > In previous Solaris work, I reliably found that exporting kstats wa=
s a darn sight harder than collecting them, and in Eric's blog post=5B1=5D=
 he notes that converting time is expensive and best done long after coll=
ecting, when someone wanted to read the data.
> > > There was an effort to do kstats in Linux=5B2=5D, but it had suppos=
edly poor performance, and actual trouble when the clock frequency change=
d.
> > > Is there, in your opinion, a =22natural=22 place to capture state c=
hanges to get the data you want, and if so, is it common or similar enoug=
h between drivers to be worthy of attention=3F
> > > --dave
> > >
> > > References:
> > >
> > > 1. Solaris: http://dtrace.org/blogs/eschrock/2004/10/13/microstate-=
accounting-in-solaris-10/
> > > 2. A failing Linux effort: https://lwn.net/Articles/127296/, https:=
//sourceforge.net/projects/microstate/
> > >
> > > --
> > > David Collier-Brown,         =7C Always do right. This will gratify=

> > > System Programmer and Author =7C some people and astonish the rest
> > > davecb=40spamcop.net           =7C                      -- Mark Twa=
in
> >
> =5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=
=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F
> Starlink mailing list
> Starlink=40lists.bufferbloat.net
> https://lists.bufferbloat.net/listinfo/starlink

--60c3e500_519b500d_3067
Content-Type: text/html; charset="utf-8"
Content-Transfer-Encoding: quoted-printable
Content-Disposition: inline

<html xmlns=3D=22http://www.w3.org/1999/xhtml=22>
<head>
<title></title>
</head>
<body>
<div name=3D=22messageBodySection=22>
<div dir=3D=22auto=22>We know that Starlink recalculates topology every 1=
5 seconds (this guy, who obviously has way too much spare time, came up w=
ith an indirect observation of this interval:&=23160;<a href=3D=22https:/=
/blog.beerriot.com/2021/02/14/starlink-raster-scan/=22 target=3D=22=5Fbla=
nk=22>https://blog.beerriot.com/2021/02/14/starlink-raster-scan/</a>&=231=
60;)<br />
<br />
If we could align with this, we could at least know when potential change=
s in path delays happen, and try to observe other changes that happen at =
a similar cadence.<br />
<br />
Other thoughts, try to plug more details out of the gRPC data, setup GPS-=
synced probes with a device at the exit PoP, measure differences between =
time-sync probes to an array of endpoints.<br />
<br />
Has nobody attacked the JTAG connector on a Dishy yet=3F</div>
</div>
<div name=3D=22messageSignatureSection=22><br />
Best,<br />
<br />
Mike</div>
<div name=3D=22messageReplySection=22>On Jun 12, 2021, 00:14 +0200, David=
 Collier-Brown &lt;davecb.42=40gmail.com&gt;, wrote:<br />
<blockquote type=3D=22cite=22 style=3D=22border-left-color: grey; border-=
left-width: thin; border-left-style: solid; margin: 5px 5px;padding-left:=
 10px;=22>
<div>
<p>OK, <i>Oh Smarter Colleagues</i>, the challenge to you is to say if th=
ere is a =22natural=22 place to capture state changes to get the data we =
want, and if so, is it common or similar enough between drivers to be wor=
thy of attention=3F<br /></p>
<p>--dave<br /></p>
<div class=3D=22moz-cite-prefix=22>On 2021-06-09 9:15 a.m., Dave Taht wro=
te:<br /></div>
<blockquote type=3D=22cite=22 cite=3D=22mid:950B8EA=46-90B9-41A6-951D-918=
21=46591D41=40teklibre.net=22><br class=3D=22=22 />
<div><br class=3D=22=22 />
<blockquote type=3D=22cite=22 class=3D=22=22>
<div class=3D=22=22>Begin forwarded message:</div>
<br class=3D=22Apple-interchange-newline=22 />
<div style=3D=22margin-top: 0px; margin-right: 0px; margin-bottom: 0px; m=
argin-left: 0px;=22 class=3D=22=22><span style=3D=22font-family: -webkit-=
system-font, Helvetica Neue, Helvetica, sans-serif; color:rgba(0, 0, 0, 1=
.0);=22 class=3D=22=22><b class=3D=22=22>=46rom:</b></span> <span style=3D=
=22font-family: -webkit-system-font, Helvetica Neue, Helvetica, sans-seri=
f;=22 class=3D=22=22>David Collier-Brown &lt;<a href=3D=22mailto:davecb.4=
2=40gmail.com=22 class=3D=22=22 moz-do-not-send=3D=22true=22>davecb.42=40=
gmail.com</a>&gt;<br class=3D=22=22 /></span></div>
<div style=3D=22margin-top: 0px; margin-right: 0px; margin-bottom: 0px; m=
argin-left: 0px;=22 class=3D=22=22><span style=3D=22font-family: -webkit-=
system-font, Helvetica Neue, Helvetica, sans-serif; color:rgba(0, 0, 0, 1=
.0);=22 class=3D=22=22><b class=3D=22=22>Subject:</b></span> <span style=3D=
=22font-family: -webkit-system-font, Helvetica Neue, Helvetica, sans-seri=
f;=22 class=3D=22=22><b class=3D=22=22>Microstate Accounting and the Nyqu=
ist problem</b><br class=3D=22=22 /></span></div>
<div style=3D=22margin-top: 0px; margin-right: 0px; margin-bottom: 0px; m=
argin-left: 0px;=22 class=3D=22=22><span style=3D=22font-family: -webkit-=
system-font, Helvetica Neue, Helvetica, sans-serif; color:rgba(0, 0, 0, 1=
.0);=22 class=3D=22=22><b class=3D=22=22>Date:</b></span> <span style=3D=22=
font-family: -webkit-system-font, Helvetica Neue, Helvetica, sans-serif;=22=
 class=3D=22=22>June 9, 2021 at 4:44:14 AM PDT<br class=3D=22=22 /></span=
></div>
<div style=3D=22margin-top: 0px; margin-right: 0px; margin-bottom: 0px; m=
argin-left: 0px;=22 class=3D=22=22><span style=3D=22font-family: -webkit-=
system-font, Helvetica Neue, Helvetica, sans-serif; color:rgba(0, 0, 0, 1=
.0);=22 class=3D=22=22><b class=3D=22=22>To:</b></span> <span style=3D=22=
font-family: -webkit-system-font, Helvetica Neue, Helvetica, sans-serif;=22=
 class=3D=22=22>Dave Taht &lt;<a href=3D=22mailto:davet=40teklibre.net=22=
 class=3D=22=22 moz-do-not-send=3D=22true=22>davet=40teklibre.net</a>&gt;=
<br class=3D=22=22 /></span></div>
<div style=3D=22margin-top: 0px; margin-right: 0px; margin-bottom: 0px; m=
argin-left: 0px;=22 class=3D=22=22><span style=3D=22font-family: -webkit-=
system-font, Helvetica Neue, Helvetica, sans-serif; color:rgba(0, 0, 0, 1=
.0);=22 class=3D=22=22><b class=3D=22=22>Cc:</b></span> <span style=3D=22=
font-family: -webkit-system-font, Helvetica Neue, Helvetica, sans-serif;=22=
 class=3D=22=22>Dave Collier-Brown &lt;<a href=3D=22mailto:dave.collier-b=
rown=40indexexchange.com=22 class=3D=22=22 moz-do-not-send=3D=22true=22>d=
ave.collier-brown=40indexexchange.com</a>&gt;<br class=3D=22=22 /></span>=
</div>
<div style=3D=22margin-top: 0px; margin-right: 0px; margin-bottom: 0px; m=
argin-left: 0px;=22 class=3D=22=22><span style=3D=22font-family: -webkit-=
system-font, Helvetica Neue, Helvetica, sans-serif; color:rgba(0, 0, 0, 1=
.0);=22 class=3D=22=22><b class=3D=22=22>Reply-To:</b></span> <span style=
=3D=22font-family: -webkit-system-font, Helvetica Neue, Helvetica, sans-s=
erif;=22 class=3D=22=22><a href=3D=22mailto:davecb=40spamcop.net=22 class=
=3D=22=22 moz-do-not-send=3D=22true=22>davecb=40spamcop.net</a><br class=3D=
=22=22 /></span></div>
<br class=3D=22=22 />
<div class=3D=22=22>
<div class=3D=22=22>
<p class=3D=22=22>A million years ago (roughly around Solaris 9), Sun was=
 suffering from the same problems in measuring their dispatcher as you ar=
e with =22sloshing=22.</p>
<p class=3D=22=22>A CPU would be 100% busy in one microsecond, 10% busy i=
n the next gazillion, and the average CPU utilization for our sample peri=
od would be <i class=3D=22=22>maybe</i> 10.1, if the sampler happened to =
sample right when the spike was happening.</p>
<p class=3D=22=22>This was utterly useless for things like the fair-share=
 scheduler, so it got fixed in Solaris 10, by having the dispatcher recor=
d the time a process (well, kernel thread) had spent in a state when the =
state changed.<br class=3D=22=22 /></p>
<p class=3D=22=22>Initially =22microstate accounting=22 could be toggled =
on and off, but the branch-around cost more time than always doing the ca=
lculation (as discovered by my mad friend =46red) and the kernel folks le=
ft it on. It's on to this day.</p>
<p class=3D=22=22>In Simon Sundberg's talk, the opportunity to measure oc=
curs every 1,000 packets, when a suitable timestamp is provided. While th=
e eBP=46 program can look at every packet and do after-the-fact book-keep=
ing in a map, that's only good if the phenomenon you're measuring is pers=
istent enough that it's around for =7E2,000 packets.</p>
<p class=3D=22=22>I'm going to suggest that the right place to record the=
 information you want is right where the event happens.&=23160; Preferabl=
y in c code, as performance is easy to mess up, but perhaps with an eBP=46=
 mechanism to export it.</p>
<p class=3D=22=22>In previous Solaris work, I reliably found that exporti=
ng kstats was a darn sight harder than collecting them, and in Eric's blo=
g post=5B1=5D he notes that converting time is expensive and best done lo=
ng after collecting, when someone wanted to read the data.</p>
<p class=3D=22=22>There was an effort to do kstats in Linux=5B2=5D, but i=
t had supposedly poor performance, and actual trouble when the clock freq=
uency changed.<br class=3D=22=22 /></p>
<p class=3D=22=22>Is there, in your opinion, a =22natural=22 place to cap=
ture state changes to get the data you want, and if so, is it common or s=
imilar enough between drivers to be worthy of attention=3F</p>
<p class=3D=22=22>--dave<br class=3D=22=22 /></p>
<p class=3D=22=22><br class=3D=22=22 /></p>
<p class=3D=22=22>References:</p>
<ol class=3D=22=22>
<li class=3D=22=22>Solaris: <a class=3D=22moz-txt-link-freetext=22 href=3D=
=22https://can01.safelinks.protection.outlook.com/=3Furl=3Dhttp%3A%2=46%2=
=46dtrace.org%2=46blogs%2=46eschrock%2=462004%2=4610%2=4613%2=46microstat=
e-accounting-in-solaris-10%2=46&amp;data=3D04%7C01%7C%7C7f7cd5aab2ca42e2e=
7e908d92d25e27f%7Cb07c069022b843668d8d7b845d088e18%7C1%7C0%7C637590463000=
477252%7CUnknown%7CTW=46pbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJB=
TiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&amp;sdata=3DfdZDOtRCcBk%2BO1ksiTOSU%2=46=
ltR8IMwueHyj0kQG4UkHw%3D&amp;reserved=3D0=22 originalsrc=3D=22http://dtra=
ce.org/blogs/eschrock/2004/10/13/microstate-accounting-in-solaris-10/=22 =
shash=3D=22r++8CBJzd3EiHdQD4yln/JywHEgcQZcRkADLM58pY3y4GIQM79qWqmhLCC/gRJ=
=46mrZMcTsRTXYWsjJvwqLaUNTcyJvdbeC+s=46PghSwQwf0ml5RWpT/hdeHE62U3EYo3yqhk=
0XWHHRmrDgD5wIcJP=468LNpbygu6zd=46rcp5AUtudE=3D=22 moz-do-not-send=3D=22t=
rue=22>http://dtrace.org/blogs/eschrock/2004/10/13/microstate-accounting-=
in-solaris-10/</a><br class=3D=22=22 /></li>
<li class=3D=22=22>A failing Linux effort: <a class=3D=22moz-txt-link-fre=
etext=22 href=3D=22https://can01.safelinks.protection.outlook.com/=3Furl=3D=
https%3A%2=46%2=46lwn.net%2=46Articles%2=46127296%2=46&amp;data=3D04%7C01=
%7C%7C7f7cd5aab2ca42e2e7e908d92d25e27f%7Cb07c069022b843668d8d7b845d088e18=
%7C1%7C0%7C637590463000487248%7CUnknown%7CTW=46pbGZsb3d8eyJWIjoiMC4wLjAwM=
DAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&amp;sdata=3DuN0g=
q8vi0GJHMPpjKVYRjX6G5nQOc%2BugxUwUEk3%2BWJ8%3D&amp;reserved=3D0=22 origin=
alsrc=3D=22https://lwn.net/Articles/127296/=22 shash=3D=22NeRrtMZSW/w5Z16=
EVp+L4mx76CnciKROysvnuvIYMwhpmHy1kPIu8UIvlSNPVJr3mrf6T8eg/A9RZ=46dY3ToSPD=
wOK9AXprKG=46x=465bfPklET=46T2/wyZDMQg+32h2Au2fNqlAk1p20ndsJ2B3+iEmm08ARf=
HCVl7c8Z3RpgKoan60=3D=22 moz-do-not-send=3D=22true=22>https://lwn.net/Art=
icles/127296/</a>, <a class=3D=22moz-txt-link-freetext=22 href=3D=22https=
://can01.safelinks.protection.outlook.com/=3Furl=3Dhttps%3A%2=46%2=46sour=
ceforge.net%2=46projects%2=46microstate%2=46&amp;data=3D04%7C01%7C%7C7f7c=
d5aab2ca42e2e7e908d92d25e27f%7Cb07c069022b843668d8d7b845d088e18%7C1%7C0%7=
C637590463000497242%7CUnknown%7CTW=46pbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjo=
iV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&amp;sdata=3DiMNi40Pl9hMmd1=
h7WrL=46P5jmHQ60mJl7zehhO8miJv4%3D&amp;reserved=3D0=22 originalsrc=3D=22h=
ttps://sourceforge.net/projects/microstate/=22 shash=3D=22EaATi/ge264upYi=
z/waLkJT=46hNT5ya2c8AUQqq5JoVbfuWm//GeW5inxQP8rUL=46qg7ezIt8agie84EjTPfOK=
SsmTlrVx7IVrdUIQdV3qSc0D2gNrGzTCYSEeYSd1AQhTNbTx3c8CCg2k4xiUArgx1w5vfMhPy=
myvv501lHYtKH4=3D=22 moz-do-not-send=3D=22true=22>https://sourceforge.net=
/projects/microstate/</a><br class=3D=22=22 /></li>
</ol>
<pre class=3D=22moz-signature=22 cols=3D=2272=22>-- =20
David Collier-Brown,         =7C Always do right. This will gratify
System Programmer and Author =7C some people and astonish the rest
<a class=3D=22moz-txt-link-abbreviated=22 href=3D=22mailto:davecb=40spamc=
op.net=22 moz-do-not-send=3D=22true=22>davecb=40spamcop.net</a>          =
 =7C                      -- Mark Twain
</pre></div>
</div>
</blockquote>
</div>
<br /></blockquote>
</div>
=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=
=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F<br />
Starlink mailing list<br />
Starlink=40lists.bufferbloat.net<br />
https://lists.bufferbloat.net/listinfo/starlink<br /></blockquote>
</div>
</body>
</html>

--60c3e500_519b500d_3067--