From mboxrd@z Thu Jan  1 00:00:00 1970
Return-Path: <SRS0=hdZGX7=F6=alum.mit.edu=dickroy@eigbox.net>
Received: from bosmailout06.eigbox.net (bosmailout06.eigbox.net [66.96.187.6])
 (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256
 bits)) (No client certificate requested)
 by lists.bufferbloat.net (Postfix) with ESMTPS id 5CB253B29D
 for <nnagain@lists.bufferbloat.net>; Mon, 16 Oct 2023 13:01:39 -0400 (EDT)
Received: from bosmailscan05.eigbox.net ([10.20.15.5])
 by bosmailout06.eigbox.net with esmtp (Exim) id 1qsQyI-0004zX-Qj
 for nnagain@lists.bufferbloat.net; Mon, 16 Oct 2023 13:01:38 -0400
DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed;
 d=alum.mit.edu; s=dkim; h=Sender:Content-Transfer-Encoding:Content-Type:
 MIME-Version:Message-ID:Date:Subject:In-Reply-To:References:Cc:To:From:
 Reply-To:Content-ID:Content-Description:Resent-Date:Resent-From:Resent-Sender
 :Resent-To:Resent-Cc:Resent-Message-ID:List-Id:List-Help:List-Unsubscribe:
 List-Subscribe:List-Post:List-Owner:List-Archive;
 bh=DnovO5HkT9je7hifjrJnGBkAapI+etYl+3GIjwBo+0Y=; b=06+XoIwU7fMoFUM5akSqIHnI19
 gnJjoahhUupDJCc+7hjfkZcuVHLJLkueDj/UJGYsyR4ioD/tC+p89aFXKUxQK7VIs+YyL+lPTwJMA
 SIr0Z+mDyVE4IQ6r6HklUkg6MPyrE+moqDJiZFFfnlbqXOPjQ7Bv93G+pdeuyW2/6aiA7fF4mauN/
 BJv7KxAfftptjEsXnr2hnrAGp0ZMIZMGQAJcnS3nNtQF4SlRFfsQekmSE2S/YOP/gZbg0Ib4wr2aq
 hXgO5ORwd+d17Z0wLgjOQyLhuiMmRHLYpabYq8YiYo7yO0PepwST+MXSiOXP4m82FBbR+I6Ir460M
 uYbOZ2tA==;
Received: from [10.115.3.32] (helo=bosimpout12)
 by bosmailscan05.eigbox.net with esmtp (Exim) id 1qsQyI-0000fB-IW
 for nnagain@lists.bufferbloat.net; Mon, 16 Oct 2023 13:01:38 -0400
Received: from bosauthsmtp02.yourhostingaccount.com ([10.20.18.2])
 by bosimpout12 with 
 id yh1b2A00602gpmq01h1erU; Mon, 16 Oct 2023 13:01:38 -0400
X-Authority-Analysis: v=2.3 cv=dOg9ZNRb c=1 sm=1 tr=0
 a=9MP9vxlQrmnoeofDS6o88g==:117 a=tKttg/DTfI8zZz0UFxdR5w==:17
 a=IkcTkHD0fZMA:10 a=bhdUkHdE2iEA:10 a=kurRqvosAAAA:8 a=gcS-kS_IAAAA:8
 a=ZkvPBPLQAAAA:8 a=2z1OXlWFAAAA:8 a=3oGU1CO3AAAA:8 a=pGLkceISAAAA:8
 a=aWDjA9EtAAAA:8 a=3nI6nj7hAAAA:8 a=x49x_tBrH6rrP8kXb_MA:9 a=QEXdDO2ut3YA:10
 a=6nOIGRkTGDwA:10 a=CHoBO7vaALwA:10 a=AB5Pgl2823wA:10 a=ahHLdPVtsAUA:10
 a=-FEs8UIgK8oA:10 a=kbxRQ_lfPIoQnHsAj2-A:22 a=zUkndHwK1TG2BjCMP9Kp:22
 a=SFr2u9Cu4sbnRqnMvguH:22 a=SNRPda0NjyR9MlWdJ_lJ:22 a=CaOJnntE3efkP4TL9Bkc:22
 a=Vz2zr2CgXkogbCJve16i:22 a=PUQwBqpy_9XipHPXVRm3:22
Received: from c-73-158-253-41.hsd1.ca.comcast.net ([73.158.253.41]:63343
 helo=SRA6) by bosauthsmtp02.eigbox.net with esmtpa (Exim)
 id 1qsQyE-0004QK-R2; Mon, 16 Oct 2023 13:01:35 -0400
Reply-To: <dickroy@alum.mit.edu>
From: "Dick Roy" <dickroy@alum.mit.edu>
To: =?utf-8?Q?'Network_Neutrality_is_back!_Let?=
 =?utf-8?Q?=C2=B4s_make_the_technical_aspects_he?=
 =?utf-8?Q?ard_this_time!'?= <nnagain@lists.bufferbloat.net>
References: <CAA93jw5CqXvn0-CwbDpBxQ2WRcEMQmCSU2+LK6aqxVzZwKt2xA@mail.gmail.com>
 <DE9D14D2-1D72-4F4B-B16D-069D55BEE13F@mid.net>
 <CAL9Qcx5gn9cuXOkqMs-tY33Jvp9E45QDaxoVVssbSyGL9YY6RA@mail.gmail.com>
 <CAA93jw5CHPvaHanmhnATGJ2rQ0T0ttmZvrc1PKHYuXB08V4hNA@mail.gmail.com>
 <d7db8bba-1621-4d38-9474-57bdda4bfa6a@3kitty.org>
 <4c44a9ef4c4b14a06403e553e633717d@rjmcmahon.com>
In-Reply-To: <4c44a9ef4c4b14a06403e553e633717d@rjmcmahon.com>
Date: Mon, 16 Oct 2023 10:01:31 -0700
Organization: SRA
Message-ID: <EC364A874C6142C8BA388DDE19A1C18C@SRA6>
MIME-Version: 1.0
Content-Type: text/plain;
	charset="utf-8"
Content-Transfer-Encoding: quoted-printable
X-Mailer: Microsoft Office Outlook 11
Thread-Index: Adn/p64eArBgoF9hTqmYz9W+JtAd/gAqLemQ
X-MimeOLE: Produced By Microsoft MimeOLE
X-EN-UserInfo: f809475445fb8041985048e338e1a001:931c98230c6409dcc37fa7e93b490c27
X-EN-AuthUser: dickroy@intellicommunications.com
Sender: "Dick Roy" <dickroy@alum.mit.edu>
X-EN-OrigIP: 73.158.253.41
X-EN-OrigHost: c-73-158-253-41.hsd1.ca.comcast.net
Subject: Re: [NNagain] transit and peering costs projections
X-BeenThere: nnagain@lists.bufferbloat.net
X-Mailman-Version: 2.1.20
Precedence: list
List-Id: =?utf-8?q?Network_Neutrality_is_back!_Let=C2=B4s_make_the_technical_aspects_heard_this_time!?=
 <nnagain.lists.bufferbloat.net>
List-Unsubscribe: <https://lists.bufferbloat.net/options/nnagain>,
 <mailto:nnagain-request@lists.bufferbloat.net?subject=unsubscribe>
List-Archive: <https://lists.bufferbloat.net/pipermail/nnagain>
List-Post: <mailto:nnagain@lists.bufferbloat.net>
List-Help: <mailto:nnagain-request@lists.bufferbloat.net?subject=help>
List-Subscribe: <https://lists.bufferbloat.net/listinfo/nnagain>,
 <mailto:nnagain-request@lists.bufferbloat.net?subject=subscribe>
X-List-Received-Date: Mon, 16 Oct 2023 17:01:39 -0000

Just an observation:  ANY type of congestion control that changes =
application behavior in response to congestion, or predicted congestion =
(ENC), begs the question "How does throttling of application information =
exchange rate (aka behavior) affect the user experience and will the =
user tolerate it?"=20

Given any (complex and packet-switched) network topology of =
interconnected nodes and links, each with possible a different capacity =
and characteristics, such as the internet today, IMO the two fundamental =
questions are:

1) How can a given network be operated/configured so as to maximize =
aggregate throughput (i.e. achieve its theoretical capacity), and
2) What things in the network need to change to increase the throughput =
(aka parameters in the network with the largest Lagrange multipliers =
associated with them)?

I am not an expert in this field, however it seems to me that answers to =
these questions would be useful, assuming they are not yet available!

Cheers,

RR
=20


-----Original Message-----
From: Nnagain [mailto:nnagain-bounces@lists.bufferbloat.net] On Behalf =
Of rjmcmahon via Nnagain
Sent: Sunday, October 15, 2023 1:39 PM
To: Network Neutrality is back! Let=C2=B4s make the technical aspects =
heard this time!
Cc: rjmcmahon
Subject: Re: [NNagain] transit and peering costs projections

Hi Jack,

Thanks again for sharing. It's very interesting to me.

Today, the networks are shifting from capacity constrained to latency=20
constrained, as can be seen in the IX discussions about how the speed of =

light over fiber is too slow even between Houston & Dallas.

The mitigations against standing queues (which cause bloat today) are:

o) Shrink the e2e bottleneck queue so it will drop packets in a flow and =

TCP will respond to that "signal"
o) Use some form of ECN marking where the network forwarding plane=20
ultimately informs the TCP source state machine so it can slow down or=20
pace effectively. This can be an earlier feedback signal and, if done=20
well, can inform the sources to avoid bottleneck queuing. There are=20
couple of approaches with ECN. Comcast is trialing L4S now which seems=20
interesting to me as a WiFi test & measurement engineer. The jury is=20
still out on this and measurements are needed.
o) Mitigate source side bloat via TCP_NOTSENT_LOWAT

The QoS priority approach per congestion is orthogonal by my judgment as =

it's typically not supported e2e, many networks will bleach DSCP=20
markings. And it's really too late by my judgment.

Also, on clock sync, yes your generation did us both a service and=20
disservice by getting rid of the PSTN TDM clock ;) So IP networking=20
devices kinda ignored clock sync, which makes e2e one way delay (OWD)=20
measurements impossible. Thankfully, the GPS atomic clock is now=20
available mostly everywhere and many devices use TCXO oscillators so=20
it's possible to get clock sync and use oscillators that can minimize=20
drift. I pay $14 for a Rpi4 GPS chip with pulse per second as an=20
example.

It seems silly to me that clocks aren't synced to the GPS atomic clock=20
even if by a proxy even if only for measurement and monitoring.

Note: As Richard Roy will point out, there really is no such thing as=20
synchronized clocks across geographies per general relativity - so those =

syncing clocks need to keep those effects in mind. I limited the iperf 2 =

timestamps to microsecond precision in hopes avoiding those issues.

Note: With WiFi, a packet drop can occur because an intermittent RF=20
channel condition. TCP can't tell the difference between an RF drop vs a =

congested queue drop. That's another reason ECN markings from network=20
devices may be better than dropped packets.

Note: I've added some iperf 2 test support around pacing as that seems=20
to be the direction the industry is heading as networks are less and=20
less capacity strained and user quality of experience is being driven by =

tail latencies. One can also test with the Prague CCA for the L4S=20
scenarios. (This is a fun project: https://www.l4sgear.com/ and fairly=20
low cost)

--fq-rate n[kmgKMG]
Set a rate to be used with fair-queuing based socket-level pacing, in=20
bytes or bits per second. Only available on platforms supporting the=20
SO_MAX_PACING_RATE socket option. (Note: Here the suffixes indicate=20
bytes/sec or bits/sec per use of uppercase or lowercase, respectively)

--fq-rate-step n[kmgKMG]
Set a step of rate to be used with fair-queuing based socket-level=20
pacing, in bytes or bits per second. Step occurs every=20
fq-rate-step-interval (defaults to one second)

--fq-rate-step-interval n
Time in seconds before stepping the fq-rate

Bob

PS. Iperf 2 man page https://iperf2.sourceforge.io/iperf-manpage.html

> The "VGV User" (Voice, Gaming, Videoconferencing) cares a lot about
> latency.   It's not just "rewarding" to have lower latencies; high
> latencies may make VGV unusable.   Average (or "typical") latency as
> the FCC label proposes isn't a good metric to judge usability.  A path
> which has high variance in latency can be unusable even if the average
> is quite low.   Having your voice or video or gameplay "break up"
> every minute or so when latency spikes to 500 msec makes the "user
> experience" intolerable.
>=20
> A few years ago, I ran some simple "ping" tests to help a friend who
> was trying to use a gaming app.  My data was only for one specific
> path so it's anecdotal.  What I saw was surprising - zero data loss,
> every datagram was delivered, but occasionally a datagram would take
> up to 30 seconds to arrive.  I didn't have the ability to poke around
> inside, but I suspected it was an experience of "bufferbloat", enabled
> by the dramatic drop in price of memory over the decades.
>=20
> It's been a long time since I was involved in operating any part of
> the Internet, so I don't know much about the inner workings today.
> Apologies for my ignorance....
>=20
> There was a scenario in the early days of the Internet for which we
> struggled to find a technical solution.  Imagine some node in the
> bowels of the network, with 3 connected "circuits" to some other
> nodes.  On two of those inputs, traffic is arriving to be forwarded
> out the third circuit.  The incoming flows are significantly more than
> the outgoing path can accept.
>=20
> What happens?   How is "backpressure" generated so that the incoming
> flows are reduced to the point that the outgoing circuit can handle
> the traffic?
>=20
> About 45 years ago, while we were defining TCPV4, we struggled with
> this issue, but didn't find any consensus solutions.  So "placeholder"
> mechanisms were defined in TCPV4, to be replaced as research continued
> and found a good solution.
>=20
> In that "placeholder" scheme, the "Source Quench" (SQ) IP message was
> defined; it was to be sent by a switching node back toward the sender
> of any datagram that had to be discarded because there wasn't any
> place to put it.
>=20
> In addition, the TOS (Type Of Service) and TTL (Time To Live) fields
> were defined in IP.
>=20
> TOS would allow the sender to distinguish datagrams based on their
> needs.  For example, we thought "Interactive" service might be needed
> for VGV traffic, where timeliness of delivery was most important.=20
> "Bulk" service might be useful for activities like file transfers,
> backups, et al.   "Normal" service might now mean activities like
> using the Web.
>=20
> The TTL field was an attempt to inform each switching node about the
> "expiration date" for a datagram.   If a node somehow knew that a
> particular datagram was unlikely to reach its destination in time to
> be useful (such as a video datagram for a frame that has already been
> displayed), the node could, and should, discard that datagram to free
> up resources for useful traffic.  Sadly we had no mechanisms for
> measuring delay, either in transit or in queuing, so TTL was defined
> in terms of "hops", which is not an accurate proxy for time.   But
> it's all we had.
>=20
> Part of the complexity was that the "flow control" mechanism of the
> Internet had put much of the mechanism in the users' computers' TCP
> implementations, rather than the switches which handle only IP.
> Without mechanisms in the users' computers, all a switch could do is
> order more circuits, and add more memory to the switches for queuing.=20
> Perhaps that led to "bufferbloat".
>=20
> So TOS, SQ, and TTL were all placeholders, for some mechanism in a
> future release that would introduce a "real" form of Backpressure and
> the ability to handle different types of traffic.   Meanwhile, these
> rudimentary mechanisms would provide some flow control. Hopefully the
> users' computers sending the flows would respond to the SQ
> backpressure, and switches would prioritize traffic using the TTL and
> TOS information.
>=20
> But, being way out of touch, I don't know what actually happens
> today.  Perhaps the current operators and current government watchers
> can answer?:git clone https://rjmcmahon@git.code.sf.net/p/iperf2/code=20
> iperf2-code
>=20
> 1/ How do current switches exert Backpressure to  reduce competing
> traffic flows?  Do they still send SQs?
>=20
> 2/ How do the current and proposed government regulations treat the
> different needs of different types of traffic, e.g., "Bulk" versus
> "Interactive" versus "Normal"?  Are Internet carriers permitted to
> treat traffic types differently?  Are they permitted to charge
> different amounts for different types of service?
>=20
> Jack Haverty
>=20
> On 10/15/23 09:45, Dave Taht via Nnagain wrote:
>> For starters I would like to apologize for cc-ing both nanog and my
>> new nn list. (I will add sender filters)
>>=20
>> A bit more below.
>>=20
>> On Sun, Oct 15, 2023 at 9:32=E2=80=AFAM Tom Beecher =
<beecher@beecher.cc>=20
>> wrote:
>>>> So for now, we'll keep paying for transit to get to the others=20
>>>> (since it=E2=80=99s about as much as transporting IXP from Dallas), =
and=20
>>>> hoping someone at Google finally sees Houston as more than a third=20
>>>> rate city hanging off of Dallas. Or=E2=80=A6 someone finally brings =
a=20
>>>> worthwhile IX to Houston that gets us more than peering to Kansas=20
>>>> City. Yeah, I think the former is more likely. =F0=9F=98=8A
>>>=20
>>> There is often a chicken/egg scenario here with the economics. As an =

>>> eyeball network, your costs to build out and connect to Dallas are=20
>>> greater than your transit cost, so you do that. Totally fair.
>>>=20
>>> However think about it from the content side. Say I want to build=20
>>> into to Houston. I have to put routers in, and a bunch of cache=20
>>> servers, so I have capital outlay , plus opex for space, power,=20
>>> IX/backhaul/transit costs. That's not cheap, so there's a lot of=20
>>> calculations that go into it. Is there enough total eyeball traffic=20
>>> there to make it worth it? Is saving 8-10ms enough of a performance=20
>>> boost to justify the spend? What are the long term trends in that=20
>>> market? These answers are of course different for a company running=20
>>> their own CDN vs the commercial CDNs.
>>>=20
>>> I don't work for Google and obviously don't speak for them, but I=20
>>> would suspect that they're happy to eat a 8-10ms performance hit to=20
>>> serve from Dallas , versus the amount of capital outlay to build out =

>>> there right now.
>> The three forms of traffic I care most about are voip, gaming, and
>> videoconferencing, which are rewarding to have at lower latencies.
>> When I was a kid, we had switched phone networks, and while the sound
>> quality was poorer than today, the voice latency cross-town was just
>> like "being there". Nowadays we see 500+ms latencies for this kind of
>> traffic.
>>=20
>> As to how to make calls across town work that well again, cost-wise, =
I
>> do not know, but the volume of traffic that would be better served by
>> these interconnects quite low, respective to the overall gains in
>> lower latency experiences for them.
>>=20
>>=20
>>=20
>>> On Sat, Oct 14, 2023 at 11:47=E2=80=AFPM Tim Burke <tim@mid.net> =
wrote:
>>>> I would say that a 1Gbit IP transit in a carrier neutral DC can be=20
>>>> had for a good bit less than $900 on the wholesale market.
>>>>=20
>>>> Sadly, IXP=E2=80=99s are seemingly turning into a pay to play game, =
with=20
>>>> rates almost costing as much as transit in many cases after you=20
>>>> factor in loop costs.
>>>>=20
>>>> For example, in the Houston market (one of the largest and fastest=20
>>>> growing regions in the US!), we do not have a major IX, so to get =
up=20
>>>> to Dallas it=E2=80=99s several thousand for a 100g wave, plus =
several=20
>>>> thousand for a 100g port on one of those major IXes. Or, a better=20
>>>> option, we can get a 100g flat internet transit for just a little=20
>>>> bit more.
>>>>=20
>>>> Fortunately, for us as an eyeball network, there are a good number=20
>>>> of major content networks that are allowing for private peering in=20
>>>> markets like Houston for just the cost of a cross connect and a =
QSFP=20
>>>> if you=E2=80=99re in the right DC, with Google and some others =
being the=20
>>>> outliers.
>>>>=20
>>>> So for now, we'll keep paying for transit to get to the others=20
>>>> (since it=E2=80=99s about as much as transporting IXP from Dallas), =
and=20
>>>> hoping someone at Google finally sees Houston as more than a third=20
>>>> rate city hanging off of Dallas. Or=E2=80=A6 someone finally brings =
a=20
>>>> worthwhile IX to Houston that gets us more than peering to Kansas=20
>>>> City. Yeah, I think the former is more likely. =F0=9F=98=8A
>>>>=20
>>>> See y=E2=80=99all in San Diego this week,
>>>> Tim
>>>>=20
>>>> On Oct 14, 2023, at 18:04, Dave Taht <dave.taht@gmail.com> wrote:
>>>>> =EF=BB=BFThis set of trendlines was very interesting. =
Unfortunately the=20
>>>>> data
>>>>> stops in 2015. Does anyone have more recent data?
>>>>>=20
>>>>> =
https://drpeering.net/white-papers/Internet-Transit-Pricing-Historical-An=
d-Projected.php
>>>>>=20
>>>>> I believe a gbit circuit that an ISP can resell still runs at =
about
>>>>> $900 - $1.4k (?) in the usa? How about elsewhere?
>>>>>=20
>>>>> ...
>>>>>=20
>>>>> I am under the impression that many IXPs remain very successful,
>>>>> states without them suffer, and I also find the concept of doing=20
>>>>> micro
>>>>> IXPs at the city level, appealing, and now achievable with cheap=20
>>>>> gear.
>>>>> Finer grained cross connects between telco and ISP and IXP would=20
>>>>> lower
>>>>> latencies across town quite hugely...
>>>>>=20
>>>>> PS I hear ARIN is planning on dropping the price for, and bundling =

>>>>> 3
>>>>> BGP AS numbers at a time, as of the end of this year, also.
>>>>>=20
>>>>>=20
>>>>>=20
>>>>> --
>>>>> Oct 30:=20
>>>>> =
https://netdevconf.info/0x17/news/the-maestro-and-the-music-bof.html
>>>>> Dave T=C3=A4ht CSO, LibreQos
>>=20
>>=20
>=20
> _______________________________________________
> Nnagain mailing list
> Nnagain@lists.bufferbloat.net
> https://lists.bufferbloat.net/listinfo/nnagain
_______________________________________________
Nnagain mailing list
Nnagain@lists.bufferbloat.net
https://lists.bufferbloat.net/listinfo/nnagain