From: Dave Taht <davet@teklibre.net>
To: starlink@lists.bufferbloat.net
Cc: davecb.42@gmail.com
Subject: [Starlink] Fwd: Microstate Accounting and the Nyquist problem
Date: Wed, 9 Jun 2021 06:15:24 -0700 [thread overview]
Message-ID: <950B8EAF-90B9-41A6-951D-91821F591D41@teklibre.net> (raw)
In-Reply-To: <baa8ff7a-0bde-9d6e-5984-ef5fcbae5ccd@rogers.com>
[-- Attachment #1: Type: text/plain, Size: 2963 bytes --]
> Begin forwarded message:
>
> From: David Collier-Brown <davecb.42@gmail.com>
> Subject: Microstate Accounting and the Nyquist problem
> Date: June 9, 2021 at 4:44:14 AM PDT
> To: Dave Taht <davet@teklibre.net>
> Cc: Dave Collier-Brown <dave.collier-brown@indexexchange.com>
> Reply-To: davecb@spamcop.net
>
> A million years ago (roughly around Solaris 9), Sun was suffering from the same problems in measuring their dispatcher as you are with "sloshing".
>
> A CPU would be 100% busy in one microsecond, 10% busy in the next gazillion, and the average CPU utilization for our sample period would be maybe 10.1, if the sampler happened to sample right when the spike was happening.
>
> This was utterly useless for things like the fair-share scheduler, so it got fixed in Solaris 10, by having the dispatcher record the time a process (well, kernel thread) had spent in a state when the state changed.
>
> Initially "microstate accounting" could be toggled on and off, but the branch-around cost more time than always doing the calculation (as discovered by my mad friend Fred) and the kernel folks left it on. It's on to this day.
>
> In Simon Sundberg's talk, the opportunity to measure occurs every 1,000 packets, when a suitable timestamp is provided. While the eBPF program can look at every packet and do after-the-fact book-keeping in a map, that's only good if the phenomenon you're measuring is persistent enough that it's around for ~2,000 packets.
>
> I'm going to suggest that the right place to record the information you want is right where the event happens. Preferably in c code, as performance is easy to mess up, but perhaps with an eBPF mechanism to export it.
>
> In previous Solaris work, I reliably found that exporting kstats was a darn sight harder than collecting them, and in Eric's blog post[1] he notes that converting time is expensive and best done long after collecting, when someone wanted to read the data.
>
> There was an effort to do kstats in Linux[2], but it had supposedly poor performance, and actual trouble when the clock frequency changed.
>
> Is there, in your opinion, a "natural" place to capture state changes to get the data you want, and if so, is it common or similar enough between drivers to be worthy of attention?
>
> --dave
>
>
>
> References:
>
> Solaris: http://dtrace.org/blogs/eschrock/2004/10/13/microstate-accounting-in-solaris-10/ <http://dtrace.org/blogs/eschrock/2004/10/13/microstate-accounting-in-solaris-10/>
> A failing Linux effort: https://lwn.net/Articles/127296/ <https://lwn.net/Articles/127296/>,https://sourceforge.net/projects/microstate/ <https://sourceforge.net/projects/microstate/>
> --
> David Collier-Brown, | Always do right. This will gratify
> System Programmer and Author | some people and astonish the rest
> davecb@spamcop.net <mailto:davecb@spamcop.net> | -- Mark Twain
[-- Attachment #2: Type: text/html, Size: 6586 bytes --]
next parent reply other threads:[~2021-06-09 13:15 UTC|newest]
Thread overview: 14+ messages / expand[flat|nested] mbox.gz Atom feed top
[not found] <baa8ff7a-0bde-9d6e-5984-ef5fcbae5ccd@rogers.com>
2021-06-09 13:15 ` Dave Taht [this message]
2021-06-11 22:14 ` David Collier-Brown
2021-06-11 22:34 ` Mike Puchol
2021-06-11 22:39 ` [Starlink] " Dave Taht
2021-06-11 22:59 ` Nathan Owens
2021-06-11 23:00 ` Dave Taht
2021-06-11 23:09 ` Mike Puchol
2021-06-18 18:17 ` [Starlink] NTP implementations [was: Microstate Accounting and the Nyquist problem] Juliusz Chroboczek
2021-06-12 14:00 ` [Starlink] Microstate Accounting and the Nyquist problem Michael Richardson
2021-06-12 16:13 ` Mike Puchol
2021-06-12 16:42 ` Nathan Owens
2021-06-12 19:05 ` Mike Puchol
2021-06-12 19:20 ` Gary E. Miller
2021-06-12 1:40 ` [Starlink] Fwd: " Karl Auerbach
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
List information: https://lists.bufferbloat.net/postorius/lists/starlink.lists.bufferbloat.net/
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=950B8EAF-90B9-41A6-951D-91821F591D41@teklibre.net \
--to=davet@teklibre.net \
--cc=davecb.42@gmail.com \
--cc=starlink@lists.bufferbloat.net \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox