From: Matt Mathis <mattmathis@google.com>
To: Nick Buraglio <buraglio@forwardingplane.net>
Cc: Dave Taht <davet@teklibre.net>, starlink@lists.bufferbloat.net
Subject: Re: [Starlink] plotting all the data
Date: Thu, 17 Jun 2021 08:49:22 -0700 [thread overview]
Message-ID: <CAH56bmD-roOX6tedNLyb_J0ATp4kPshEG6unJkFeFLK8wVPOqQ@mail.gmail.com> (raw)
In-Reply-To: <CAGB08_cxray74yuBoOArY8Hrx=ScLC8kXgpKPwq88FGpwe3Brw@mail.gmail.com>
[-- Attachment #1: Type: text/plain, Size: 3366 bytes --]
Some time recently I read a casual paper (on Medium I think) that made the
point that deep diving into outliers and understanding them has led to a
half dozen Nobel prizes, because they lead to discoveries of phenomena that
nobody else had even noticed. See for instinance the Holmdel Horn
https://en.wikipedia.org/wiki/Holmdel_Horn_Antenna
To keep sane, I tend to keep outliers and clip them as last as possible,
e.g. by choice of graph axis. This way I have the opportunity to notice
otherwise hidden patterns.
In mlab data we sometimes see outliers that suggest "out of bounds" data
rates. e.g. a repeated test that clearly has a max rate of 50 Mb/s or
something, and then every so often a one test at 200 Mb/s or higher. My
assumption is that these are from software managed shapers that
occasionally fail to properly load their configurations. (I admit to
not having looked hard enough to prove this hypnosis).
Thanks,
--MM--
The best way to predict the future is to create it. - Alan Kay
We must not tolerate intolerance;
however our response must be carefully measured:
too strong would be hypocritical and risks spiraling out of
control;
too weak risks being mistaken for tacit approval.
On Thu, Jun 17, 2021 at 7:25 AM Nick Buraglio <buraglio@forwardingplane.net>
wrote:
> This is much more common in the high performance computing and networking
> space (i.e. perfsonar, TWAMP, and OWAMP). I have also been pushing "gather
> and store all the data" for ....since I was an engineer working on the
> Teragrid (which is where I first saw Matt's MTU talk around 2002 or 03,
> BTW). =)
> High fidelity plots of everything that can be gathered is laborious to
> curate but is invaluable for so many reasons. Now we just need a way to
> make it happen everywhere for everyone in a way that's easy.
>
> nb
>
>
> On Thu, Jun 17, 2021 at 8:57 AM Dave Taht <davet@teklibre.net> wrote:
>
>> Capturing and plotting *all* the data is often revealing.
>>
>> Sometimes plotting the data you are discarding (for what seems like sane
>> reasons) is quite revealing. Saw this on slashdot this morning, it’s
>> good...
>>
>>
>> https://www.newyorker.com/magazine/2021/06/21/when-graphs-are-a-matter-of-life-and-death
>>
>> In the bufferbloat effort I’ve fought time and time again for folk to
>> stop throwing out data above the 95 percentile, and at the very least plot
>> everything they threw out to find patterns...
>>
>> dslreports’ graphing tools, for example, throws out a ton of “outliers" …
>> and the only reason why there is no data past 4 sec here, is that the test
>> doesn’t run long enough.
>>
>> http://www.dslreports.com/speedtest/results/bufferbloat?up=1
>>
>> (been trying to get ahold of someone over there to buy their raw data for
>> years now. They have the biggest - 8 years worth - collection)
>>
>> mlabs has a similar data reduction issue that they haven’t got around to
>> fixing.
>>
>> And more recently we encountered a smoothing problem in wireshark that
>> made a halt in packet processing look more like a normal tcp cwnd cut….
>>
>> _______________________________________________
>> Starlink mailing list
>> Starlink@lists.bufferbloat.net
>> https://lists.bufferbloat.net/listinfo/starlink
>>
>
[-- Attachment #2: Type: text/html, Size: 4735 bytes --]
next prev parent reply other threads:[~2021-06-17 15:49 UTC|newest]
Thread overview: 5+ messages / expand[flat|nested] mbox.gz Atom feed top
2021-06-17 13:56 Dave Taht
2021-06-17 14:24 ` Nick Buraglio
2021-06-17 15:28 ` Dave Taht
2021-06-17 15:49 ` Matt Mathis [this message]
2021-06-17 20:18 ` George Burdell
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
List information: https://lists.bufferbloat.net/postorius/lists/starlink.lists.bufferbloat.net/
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=CAH56bmD-roOX6tedNLyb_J0ATp4kPshEG6unJkFeFLK8wVPOqQ@mail.gmail.com \
--to=mattmathis@google.com \
--cc=buraglio@forwardingplane.net \
--cc=davet@teklibre.net \
--cc=starlink@lists.bufferbloat.net \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox