From mboxrd@z Thu Jan  1 00:00:00 1970
Return-Path: <davet@teklibre.net>
Received: from mail.taht.net (mail.taht.net [176.58.107.8])
 (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits))
 (No client certificate requested)
 by lists.bufferbloat.net (Postfix) with ESMTPS id 41FA13B29E
 for <starlink@lists.bufferbloat.net>; Wed,  9 Jun 2021 09:15:29 -0400 (EDT)
Received: from smtpclient.apple (unknown
 [IPv6:2600:380:455c:bb78:e8f6:553:65b3:eaef])
 (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits))
 (No client certificate requested)
 by mail.taht.net (Postfix) with ESMTPSA id 32B5A221D8;
 Wed,  9 Jun 2021 13:15:27 +0000 (UTC)
From: Dave Taht <davet@teklibre.net>
Content-Type: multipart/alternative;
 boundary="Apple-Mail=_1C120C17-6A42-4ED0-A960-3A459188E846"
Mime-Version: 1.0 (Mac OS X Mail 14.0 \(3654.80.0.2.43\))
Date: Wed, 9 Jun 2021 06:15:24 -0700
References: <baa8ff7a-0bde-9d6e-5984-ef5fcbae5ccd@rogers.com>
Cc: davecb.42@gmail.com
To: starlink@lists.bufferbloat.net
Message-Id: <950B8EAF-90B9-41A6-951D-91821F591D41@teklibre.net>
X-Mailer: Apple Mail (2.3654.80.0.2.43)
Subject: [Starlink] Fwd: Microstate Accounting and the Nyquist problem
X-BeenThere: starlink@lists.bufferbloat.net
X-Mailman-Version: 2.1.20
Precedence: list
List-Id: "Starlink has bufferbloat. Bad." <starlink.lists.bufferbloat.net>
List-Unsubscribe: <https://lists.bufferbloat.net/options/starlink>,
 <mailto:starlink-request@lists.bufferbloat.net?subject=unsubscribe>
List-Archive: <https://lists.bufferbloat.net/pipermail/starlink>
List-Post: <mailto:starlink@lists.bufferbloat.net>
List-Help: <mailto:starlink-request@lists.bufferbloat.net?subject=help>
List-Subscribe: <https://lists.bufferbloat.net/listinfo/starlink>,
 <mailto:starlink-request@lists.bufferbloat.net?subject=subscribe>
X-List-Received-Date: Wed, 09 Jun 2021 13:15:29 -0000


--Apple-Mail=_1C120C17-6A42-4ED0-A960-3A459188E846
Content-Transfer-Encoding: quoted-printable
Content-Type: text/plain;
	charset=us-ascii


> Begin forwarded message:
>=20
> From: David Collier-Brown <davecb.42@gmail.com>
> Subject: Microstate Accounting and the Nyquist problem
> Date: June 9, 2021 at 4:44:14 AM PDT
> To: Dave Taht <davet@teklibre.net>
> Cc: Dave Collier-Brown <dave.collier-brown@indexexchange.com>
> Reply-To: davecb@spamcop.net
>=20
> A million years ago (roughly around Solaris 9), Sun was suffering from =
the same problems in measuring their dispatcher as you are with =
"sloshing".
>=20
> A CPU would be 100% busy in one microsecond, 10% busy in the next =
gazillion, and the average CPU utilization for our sample period would =
be maybe 10.1, if the sampler happened to sample right when the spike =
was happening.
>=20
> This was utterly useless for things like the fair-share scheduler, so =
it got fixed in Solaris 10, by having the dispatcher record the time a =
process (well, kernel thread) had spent in a state when the state =
changed.
>=20
> Initially "microstate accounting" could be toggled on and off, but the =
branch-around cost more time than always doing the calculation (as =
discovered by my mad friend Fred) and the kernel folks left it on. It's =
on to this day.
>=20
> In Simon Sundberg's talk, the opportunity to measure occurs every =
1,000 packets, when a suitable timestamp is provided. While the eBPF =
program can look at every packet and do after-the-fact book-keeping in a =
map, that's only good if the phenomenon you're measuring is persistent =
enough that it's around for ~2,000 packets.
>=20
> I'm going to suggest that the right place to record the information =
you want is right where the event happens.  Preferably in c code, as =
performance is easy to mess up, but perhaps with an eBPF mechanism to =
export it.
>=20
> In previous Solaris work, I reliably found that exporting kstats was a =
darn sight harder than collecting them, and in Eric's blog post[1] he =
notes that converting time is expensive and best done long after =
collecting, when someone wanted to read the data.
>=20
> There was an effort to do kstats in Linux[2], but it had supposedly =
poor performance, and actual trouble when the clock frequency changed.
>=20
> Is there, in your opinion, a "natural" place to capture state changes =
to get the data you want, and if so, is it common or similar enough =
between drivers to be worthy of attention?
>=20
> --dave
>=20
>=20
>=20
> References:
>=20
> Solaris: =
http://dtrace.org/blogs/eschrock/2004/10/13/microstate-accounting-in-solar=
is-10/ =
<http://dtrace.org/blogs/eschrock/2004/10/13/microstate-accounting-in-sola=
ris-10/>=20
> A failing Linux effort: https://lwn.net/Articles/127296/ =
<https://lwn.net/Articles/127296/>,https://sourceforge.net/projects/micros=
tate/ <https://sourceforge.net/projects/microstate/>
> --=20
> David Collier-Brown,         | Always do right. This will gratify
> System Programmer and Author | some people and astonish the rest
> davecb@spamcop.net <mailto:davecb@spamcop.net>           |             =
         -- Mark Twain


--Apple-Mail=_1C120C17-6A42-4ED0-A960-3A459188E846
Content-Transfer-Encoding: quoted-printable
Content-Type: text/html;
	charset=us-ascii

<html><head><meta http-equiv=3D"Content-Type" content=3D"text/html; =
charset=3Dus-ascii"></head><body style=3D"word-wrap: break-word; =
-webkit-nbsp-mode: space; line-break: after-white-space;" class=3D""><br =
class=3D""><div><br class=3D""><blockquote type=3D"cite" class=3D""><div =
class=3D"">Begin forwarded message:</div><br =
class=3D"Apple-interchange-newline"><div style=3D"margin-top: 0px; =
margin-right: 0px; margin-bottom: 0px; margin-left: 0px;" class=3D""><span=
 style=3D"font-family: -webkit-system-font, Helvetica Neue, Helvetica, =
sans-serif; color:rgba(0, 0, 0, 1.0);" class=3D""><b class=3D"">From: =
</b></span><span style=3D"font-family: -webkit-system-font, Helvetica =
Neue, Helvetica, sans-serif;" class=3D"">David Collier-Brown &lt;<a =
href=3D"mailto:davecb.42@gmail.com" =
class=3D"">davecb.42@gmail.com</a>&gt;<br class=3D""></span></div><div =
style=3D"margin-top: 0px; margin-right: 0px; margin-bottom: 0px; =
margin-left: 0px;" class=3D""><span style=3D"font-family: =
-webkit-system-font, Helvetica Neue, Helvetica, sans-serif; =
color:rgba(0, 0, 0, 1.0);" class=3D""><b class=3D"">Subject: =
</b></span><span style=3D"font-family: -webkit-system-font, Helvetica =
Neue, Helvetica, sans-serif;" class=3D""><b class=3D"">Microstate =
Accounting and the Nyquist problem</b><br class=3D""></span></div><div =
style=3D"margin-top: 0px; margin-right: 0px; margin-bottom: 0px; =
margin-left: 0px;" class=3D""><span style=3D"font-family: =
-webkit-system-font, Helvetica Neue, Helvetica, sans-serif; =
color:rgba(0, 0, 0, 1.0);" class=3D""><b class=3D"">Date: =
</b></span><span style=3D"font-family: -webkit-system-font, Helvetica =
Neue, Helvetica, sans-serif;" class=3D"">June 9, 2021 at 4:44:14 AM =
PDT<br class=3D""></span></div><div style=3D"margin-top: 0px; =
margin-right: 0px; margin-bottom: 0px; margin-left: 0px;" class=3D""><span=
 style=3D"font-family: -webkit-system-font, Helvetica Neue, Helvetica, =
sans-serif; color:rgba(0, 0, 0, 1.0);" class=3D""><b class=3D"">To: =
</b></span><span style=3D"font-family: -webkit-system-font, Helvetica =
Neue, Helvetica, sans-serif;" class=3D"">Dave Taht &lt;<a =
href=3D"mailto:davet@teklibre.net" =
class=3D"">davet@teklibre.net</a>&gt;<br class=3D""></span></div><div =
style=3D"margin-top: 0px; margin-right: 0px; margin-bottom: 0px; =
margin-left: 0px;" class=3D""><span style=3D"font-family: =
-webkit-system-font, Helvetica Neue, Helvetica, sans-serif; =
color:rgba(0, 0, 0, 1.0);" class=3D""><b class=3D"">Cc: </b></span><span =
style=3D"font-family: -webkit-system-font, Helvetica Neue, Helvetica, =
sans-serif;" class=3D"">Dave Collier-Brown &lt;<a =
href=3D"mailto:dave.collier-brown@indexexchange.com" =
class=3D"">dave.collier-brown@indexexchange.com</a>&gt;<br =
class=3D""></span></div><div style=3D"margin-top: 0px; margin-right: =
0px; margin-bottom: 0px; margin-left: 0px;" class=3D""><span =
style=3D"font-family: -webkit-system-font, Helvetica Neue, Helvetica, =
sans-serif; color:rgba(0, 0, 0, 1.0);" class=3D""><b class=3D"">Reply-To: =
</b></span><span style=3D"font-family: -webkit-system-font, Helvetica =
Neue, Helvetica, sans-serif;" class=3D""><a =
href=3D"mailto:davecb@spamcop.net" class=3D"">davecb@spamcop.net</a><br =
class=3D""></span></div><br class=3D""><div class=3D"">
 =20
    <meta http-equiv=3D"Content-Type" content=3D"text/html; =
charset=3DUTF-8" class=3D"">
 =20
  <div class=3D""><p class=3D"">A million years ago (roughly around =
Solaris 9), Sun was suffering
      from the same problems in measuring their dispatcher as you are
      with "sloshing".</p><p class=3D"">A CPU would be 100% busy in one =
microsecond, 10% busy in the next
      gazillion, and the average CPU utilization for our sample period
      would be <i class=3D"">maybe</i> 10.1, if the sampler happened to =
sample
      right when the spike was happening.</p><p class=3D"">This was =
utterly useless for things like the fair-share
      scheduler, so it got fixed in Solaris 10, by having the dispatcher
      record the time a process (well, kernel thread) had spent in a
      state when the state changed.<br class=3D"">
    </p><p class=3D"">Initially "microstate accounting" could be toggled =
on and off,
      but the branch-around cost more time than always doing the
      calculation (as discovered by my mad friend Fred) and the kernel
      folks left it on. It's on to this day.</p><p class=3D"">In Simon =
Sundberg's talk, the opportunity to measure occurs every
      1,000 packets, when a suitable timestamp is provided. While the
      eBPF program can look at every packet and do after-the-fact
      book-keeping in a map, that's only good if the phenomenon you're
      measuring is persistent enough that it's around for ~2,000
      packets.</p><p class=3D"">I'm going to suggest that the right =
place to record the
      information you want is right where the event happens.&nbsp; =
Preferably
      in c code, as performance is easy to mess up, but perhaps with an
      eBPF mechanism to export it.</p><p class=3D"">In previous Solaris =
work, I reliably found that exporting kstats
      was a darn sight harder than collecting them, and in Eric's blog
      post[1] he notes that converting time is expensive and best done
      long after collecting, when someone wanted to read the data.</p><p =
class=3D"">There was an effort to do kstats in Linux[2], but it had
      supposedly poor performance, and actual trouble when the clock
      frequency changed.<br class=3D"">
    </p><p class=3D"">Is there, in your opinion, a "natural" place to =
capture state
      changes to get the data you want, and if so, is it common or
      similar enough between drivers to be worthy of attention?</p><p =
class=3D"">--dave<br class=3D"">
    </p><p class=3D""><br class=3D"">
    </p><p class=3D"">References:</p>
    <ol class=3D"">
      <li class=3D"">Solaris:
<a class=3D"moz-txt-link-freetext" =
href=3D"http://dtrace.org/blogs/eschrock/2004/10/13/microstate-accounting-=
in-solaris-10/">http://dtrace.org/blogs/eschrock/2004/10/13/microstate-acc=
ounting-in-solaris-10/</a>
        <br class=3D"">
      </li>
      <li class=3D"">A failing Linux effort: <a =
class=3D"moz-txt-link-freetext" =
href=3D"https://lwn.net/Articles/127296/">https://lwn.net/Articles/127296/=
</a>,
        <a class=3D"moz-txt-link-freetext" =
href=3D"https://sourceforge.net/projects/microstate/">https://sourceforge.=
net/projects/microstate/</a><br class=3D"">
      </li>
    </ol>
    <pre class=3D"moz-signature" cols=3D"72">--=20
David Collier-Brown,         | Always do right. This will gratify
System Programmer and Author | some people and astonish the rest
<a class=3D"moz-txt-link-abbreviated" =
href=3D"mailto:davecb@spamcop.net">davecb@spamcop.net</a>           |    =
                  -- Mark Twain
</pre>
  </div>

</div></blockquote></div><br class=3D""></body></html>=

--Apple-Mail=_1C120C17-6A42-4ED0-A960-3A459188E846--