From mboxrd@z Thu Jan  1 00:00:00 1970
Return-Path: <dpreed@reed.com>
Received: from smtp97.iad3a.emailsrvr.com (smtp97.iad3a.emailsrvr.com
	[173.203.187.97])
	(using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits))
	(Client did not present a certificate)
	by huchra.bufferbloat.net (Postfix) with ESMTPS id 7320821F2A5
	for <cerowrt-devel@lists.bufferbloat.net>;
	Sat, 31 Jan 2015 13:51:07 -0800 (PST)
Received: from smtp21.relay.iad3a.emailsrvr.com (localhost.localdomain
	[127.0.0.1])
	by smtp21.relay.iad3a.emailsrvr.com (SMTP Server) with ESMTP id
	B87371803A6; Sat, 31 Jan 2015 16:51:05 -0500 (EST)
Received: from app44.wa-webapps.iad3a (relay-webapps.rsapps.net
	[172.27.255.140])
	by smtp21.relay.iad3a.emailsrvr.com (SMTP Server) with ESMTP id
	48D98180399; Sat, 31 Jan 2015 16:51:05 -0500 (EST)
X-Sender-Id: dpreed@reed.com
Received: from app44.wa-webapps.iad3a (relay-webapps.rsapps.net
	[172.27.255.140]) by 0.0.0.0:25 (trex/5.4.2);
	Sat, 31 Jan 2015 21:51:05 GMT
Received: from reed.com (localhost.localdomain [127.0.0.1])
	by app44.wa-webapps.iad3a (Postfix) with ESMTP id 33D6018004A;
	Sat, 31 Jan 2015 16:51:05 -0500 (EST)
Received: by apps.rackspace.com
	(Authenticated sender: dpreed@reed.com, from: dpreed@reed.com) 
	with HTTP; Sat, 31 Jan 2015 16:51:05 -0500 (EST)
Date: Sat, 31 Jan 2015 16:51:05 -0500 (EST)
From: dpreed@reed.com
To: "Dave Taht" <dave.taht@gmail.com>
MIME-Version: 1.0
Content-Type: multipart/alternative;
	boundary="----=_20150131165105000000_31330"
Importance: Normal
X-Priority: 3 (Normal)
X-Type: html
In-Reply-To: <CAA93jw7b0E9jjQYXrEPzjLLC9j8xNC0TFYXpWVtgFameJaNBdw@mail.gmail.com>
References: <CA+BoTQkVu23P3EOmY_Q3E1GJnWsyF==Pawz4iPOS_Bq5dvfO5Q@mail.gmail.com>
	<1422537297.21689.15.camel@edumazet-glaptop2.roam.corp.google.com> 
	<54CB5D08.2070906@broadcom.com> 
	<1422623975.21689.77.camel@edumazet-glaptop2.roam.corp.google.com> 
	<54CB8B69.1070807@broadcom.com> 
	<CAA93jw5fqhz0Hiw74L2GXgtZ9JsMg+NtYydKxKzGDrvQcZn4hA@mail.gmail.com> 
	<CAA93jw7b0E9jjQYXrEPzjLLC9j8xNC0TFYXpWVtgFameJaNBdw@mail.gmail.com>
X-Auth-ID: dpreed@reed.com
Message-ID: <1422741065.199624134@apps.rackspace.com>
X-Mailer: webmail/11.3.11-RC
Cc: Andrew McGregor <andrewmcgr@gmail.com>,
	Jesper Dangaard Brouer <jbrouer@redhat.com>,
	Matt Mathis <mattmathis@google.com>, "cerowrt-devel@lists.bufferbloat.net"
	<cerowrt-devel@lists.bufferbloat.net>,
	Jonathan Morton <chromatix99@gmail.com>, Tim Shepard <shep@alum.mit.edu>,
	Avery Pennarun <apenwarr@google.com>
Subject: Re: [Cerowrt-devel] Fwd: Throughput regression with `tcp: refine
	TSO autosizing`
X-BeenThere: cerowrt-devel@lists.bufferbloat.net
X-Mailman-Version: 2.1.13
Precedence: list
List-Id: Development issues regarding the cerowrt test router project
	<cerowrt-devel.lists.bufferbloat.net>
List-Unsubscribe: <https://lists.bufferbloat.net/options/cerowrt-devel>,
	<mailto:cerowrt-devel-request@lists.bufferbloat.net?subject=unsubscribe>
List-Archive: <https://lists.bufferbloat.net/pipermail/cerowrt-devel>
List-Post: <mailto:cerowrt-devel@lists.bufferbloat.net>
List-Help: <mailto:cerowrt-devel-request@lists.bufferbloat.net?subject=help>
List-Subscribe: <https://lists.bufferbloat.net/listinfo/cerowrt-devel>,
	<mailto:cerowrt-devel-request@lists.bufferbloat.net?subject=subscribe>
X-List-Received-Date: Sat, 31 Jan 2015 21:51:36 -0000

------=_20150131165105000000_31330
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

=0AI think we need to create an Internet focused 802.11 working group that =
would be to the "OS wireless designers and IEEE 802.11 standards groups" as=
 the WHATML group was to W3C.=0A =0AW3C was clueless about the real world a=
t the point WHATML was created.  And WHATML was a "revenge of the real" aga=
inst W3C - advancing a wide variety of important practical innovations rath=
er than attending endless standards meetings with people who were not focus=
ed on solving actually important problems.=0A =0AIt took a bunch of work to=
 get WHATML going, and it offended W3C, who became unhelpful.  But the appr=
oach actually worked - we now have a Web that really uses browser-side expr=
essivity and that would never have happened if W3C were left to its own dev=
ices.=0A =0AThe WiFi consortium was an attempt to wrest control of pragmati=
c direction from 802.11 and the proprietary-divergence folks at Qualcomm, B=
roadcom, Cisco, etc.  But it failed, because it became thieves on a raft, m=
ore focused on picking each others' pockets than on actually addressing the=
 big issues.=0A =0AJim has seen this play out in the Linux community around=
 X.  Though there are lots of interests who would benefit by moving the eng=
ineering ball forward, everyone resists action because it means giving up t=
he chance at dominance, and the central group is far too weak to do anythin=
g beyond adjudicating the worst battles.=0A =0AWhen I say "we" I definitely=
 include myself (though my time is limited due to other commitments and the=
 need to support my family), but I would only play with people who actually=
 are committed to making stuff happen - which includes raising hell with th=
e vendors if need be, but also effective engineering steps that can achieve=
 quick adoption.=0A =0ASadly, and I think it is manageable at the moment, t=
here are moves out there being made to get the FCC to "protect" WiFi from "=
interference".  The current one was Marriott, who requested the FCC for a r=
ule to make it legal to disrupt and block use of WiFi in people's rooms in =
their hotels, except with their access points.  This also needs some techni=
cal defense.  I believe any issues with WiFi performance in actual Marriott=
 hotels are due to bufferbloat in their hotel-wide systems, just as the iss=
ues with GoGo are the same.  But it's possible that queueing problems in th=
eir own WiFi gear are bad as well.=0A =0AI mention this because it is relat=
ed, and to the layperson, or non-radio-knowledgeable executive, indistingui=
shable.  It will take away the incentive to actually fix the 802.11 impleme=
ntations to be better performing, making the problem seem to be a "manageme=
nt" issue that can be solved by making WiFi less interoperable and less fle=
xible by rules, rather than by engineering.=0A =0AHowever, solving the prob=
lems of hotspot networks and hotel networks are definitely "real world" iss=
ues, and quite along the same lines you mention, Dave.  FQ is almost certai=
nly a big deal both in WiFi and in the distribution networks behind WiFi. C=
o-existence is also a big deal (RTS/CTS-like mechanisms can go a long way t=
o remediate hidden-terminal disruption of the basic protocols). Roaming and=
 scaling need work as well.=0A =0AIt would even be a good thing to invent p=
ragmatic ways to provide "low rate" subnets and "high rate" subnets that ca=
n coexist, so that compatibility with ancient "b" networks need not be main=
tained on all nets, at great cost - just send beacons at a high rate, so th=
at the "b" NICs can't see them.... but you need pragmatic stack implementat=
ions.=0A =0ABut the engineering is not the only challenge. The other challe=
nge is to take the initiative and get stuff deployed.  In the case of buffe=
rbloat, the grade currently is a "D" for deployments, maybe a "D-".  Beauti=
ful technical work, but the economic/business/political side of things has =
been poor.  Look at how slow IETF has been to achieve anything (the perfect=
 is truly the enemy of the good, and Dave Clark's "rough consensus and work=
ing code" has been replaced by technocratic malaise, and what appears to me=
 to be a class of people who love traveling the world to a floating cocktai=
l party without getting anything important done).=0A =0AThe problem with co=
mmunications is that you can't just ship a product with a new "feature", be=
cause the innovation only works if widely adopted.  Since there is no "Linu=
x Desktop" (and Linus hates the idea, to a large extent) Linux can't be the=
 sole carrier of the idea.  You pretty much need iOS and Android both to bu=
y in or to provide a path for easy third-party upgrades.  How do you do tha=
t?  Well, that's where the WHATML-type approach is necessary.=0A =0AI don't=
 know if this can be achieved, and there are lots of details to be worked o=
ut.  But I'll play.=0A =0A =0A=0A=0AOn Saturday, January 31, 2015 4:05pm, "=
Dave Taht" <dave.taht@gmail.com> said:=0A=0A=0A=0AI would like to have some=
how assembled all the focused resources to make a go at fixing wifi, or at =
least having a f2f with a bunch of people in the late march timeframe. This=
 message of mine to linux-wireless bounced for some reason and I am off to =
log out for 10 days, so...=0Asee relevant netdev thread also for ore detail=
s.=0A=0A=0A---------- Forwarded message ----------=0AFrom: Dave Taht <[ dav=
e.taht@gmail.com ]( mailto:dave.taht@gmail.com )>=0ADate: Sat, Jan 31, 2015=
 at 12:29 PM=0ASubject: Re: Throughput regression with `tcp: refine TSO aut=
osizing`=0ATo: Arend van Spriel <[ arend@broadcom.com ]( mailto:arend@broad=
com.com )>=0ACc: linux-wireless <[ linux-wireless@vger.kernel.org ]( mailto=
:linux-wireless@vger.kernel.org )>, Michal Kazior <[ michal.kazior@tieto.co=
m ]( mailto:michal.kazior@tieto.com )>, Eyal Perry <[ eyalpe@dev.mellanox.c=
o.il ]( mailto:eyalpe@dev.mellanox.co.il )>, Network Development <[ netdev@=
vger.kernel.org ]( mailto:netdev@vger.kernel.org )>, Eric Dumazet <[ eric.d=
umazet@gmail.com ]( mailto:eric.dumazet@gmail.com )>=0A=0A=0A=0A=0AThe wifi=
 industry as a whole has vastly bigger problems than achieving 1500Mbits in=
 a faraday cage on a single flow.=0AI encourage you to try tests in netperf=
-wrapper that explicitly test for latency under load, and in particular, th=
e RTT_FAIR tests against 4 or more stations on a single wifi AP. You will f=
ind the results very depressing. Similarly, on your previous test series, a=
 latency figure would have been nice to have. I just did a talk at nznog, w=
here I tested the local wifi with less than ambits of throughput, and 3 sec=
onds of latency, filmed here: =0A[ https://plus.google.com/u/0/107942175615=
993706558/posts/CY8ew8MPnMt ]( https://plus.google.com/u/0/1079421756159937=
06558/posts/CY8ew8MPnMt )=0ADo wish more folk were testing in the busy real=
 world environments, like coffee shops, cities... really, anywhere outside =
a faraday cage!=0AI am not attending netconf - I was unable to raise funds =
to go, and the program committee wanted something "new",=0Ainstead of the p=
reso I gave the IEEE 802.11 working group back in september. ( [ http://sna=
pon.lab.bufferbloat.net/~d/ieee802.11-sept-17-2014/11-14-1265-00-0wng-More-=
on-Bufferbloat.pdf ]( http://snapon.lab.bufferbloat.net/~d/ieee802.11-sept-=
17-2014/11-14-1265-00-0wng-More-on-Bufferbloat.pdf ) )=0AI was very pleased=
 with the results of that talk - the day after I gave it, the phrase "test =
for latency" showed up in a bunch of 802.11ax (the next generation after ac=
) documents. :) Still, we are stuck with the train wreck that is 802.11ac g=
lommed on top of 802.11n, glommed on top of 802.11g, in terms of queue mana=
gement, terrible uses of airtime, rate control and other stuff. Aruba and M=
eraki, in particular took a big interest in what I'd outlined in the preso =
above (we have a half dozen less well baked ideas - that's just the easy st=
uff that can be done to improve wifi).  I gave a followup at meraki but I d=
on't think that's online.=0AFelix (nbd) is on vacation right now, as I am I=
. In fact I am going somewhere for a week totally lacking internet access.=
=0APresently the plan, with what budget (none) we have and time (very littl=
e) we have is to produce a pair of proof of concept implementations for per=
 tid queuing (see relevant commit by nbd),  leveraging the new minstrel sta=
ts, the new minstrel-blues stuff, and an aggregation aware codel with a cal=
culated target based on the most recently active stations, and a bunch of t=
he other stuff outlined above at IEEE.=0AIt is my hope that this will start=
 to provide accurate back pressure (or sufficient lack thereof for TSQ), to=
 also improve throughput while still retaining low latency. But it is a cer=
tainty that we will run into more cross layer issues that will be a pita to=
 resolve.=0AIf we can put together a meet up around or during ELC in califo=
rnia in march? =0AI am really not terribly optimistic on anything other tha=
n the 2 chipsets we can hack on (ath9k, mt76). Negotiations to get qualcomm=
 to open up their ath10k firmware have thus far failed, nor has a ath10k-li=
te got anywhere. Perhaps broadcom would be willing to open up their firmwar=
e sufficiently to build in a better API?=0AA bit more below.=0A=0A On Jan 3=
0, 2015 5:59 AM, "Arend van Spriel" <[ arend@broadcom.com ]( mailto:arend@b=
roadcom.com )> wrote:=0A >=0A > On 01/30/15 14:19, Eric Dumazet wrote:=0A >=
>=0A >> On Fri, 2015-01-30 at 11:29 +0100, Arend van Spriel wrote:=0A >>=0A=
 >>> Hi Eric,=0A >>>=0A >>> Your suggestions are still based on the fact th=
at you consider wireless=0A >>> networking to be similar to ethernet, but a=
s Michal indicated there are=0A >>> some fundamental differences starting w=
ith CSMA/CD versus CSMA/CA. Also=0A >>> the medium conditions are far from =
comparable. =0AThe analogy i now use for it is that switched ethernet is ge=
nerally your classic "dumbbell"=0Atopology. Wifi is more like a "taxi-stand=
" topology. If you think about how people=0Aqueue up at a taxi stand (and s=
ometimes agree to share a ride), the inter arrival=0Aand departure times of=
 a taxi stand make for a better mental model. =0AAdmittedly, I seem to spen=
d a lot of time, waiting for taxies, thinking about=0Awifi.=0A>> There is n=
o shielding so=0A >>> it needs to deal with interference and dynamically dr=
ops the link rate=0A >>> so transmission of packets can take several millis=
econds. Then with 11n=0A >>> they came up with aggregation with sends up to=
 64 packets in a single=0A >>> transmit over the air at worst case 6.5 Mbps=
 (if I am not mistaken). The=0A >>> parameter value for tcp_limit_output_by=
tes of 131072 means that it=0A >>> allows queuing for about 1ms on a 1Gbps =
link, but I hope you can see=0A >>> this is not realistic for dealing with =
all variances of the wireless=0A >>> medium/standard. I suggested this as t=
opic for the wireless workshop in=0A >>> Otawa [1], but I can not attend th=
ere. Still hope that there will be=0A >>> some discussions to get more awar=
eness.=0A=0AI have sometimes hoped that TSQ could be made more a function o=
f the=0Anumber of active flows exiting an interface, but eric tells me that=
's impossible.=0AThis is possibly another case where TSQ could use to be a =
callback function...=0Abut frankly I care not a whit about maximizing singl=
e flow tcp throughput on wifi=0Ain a faraday cage.=0A=0A >>=0A >> Ever hear=
d about bufferbloat ?=0A >=0A >=0A > Sure. I am trying to get awareness abo=
ut that in our wireless driver/firmware development teams. So bear with me.=
=0A >=0A >=0A >> Have you read my suggestions and tried them ?=0A >>=0A >> =
You can adjust the limit per flow to pretty much you want. If you need=0A >=
> 64 packets, just do the math. If in 2018 you need 128 packets, do the=0A =
>> math again.=0A >>=0A >> I am very well aware that wireless wants aggrega=
tion, thank you.=0A=0AI note that a lot of people testing this are getting =
it backwards. Usually it is the AP that is sending lots and lots of big pac=
kets, where the return path is predominately acks from the station. =0AI am=
 not a huge fan of stretch acks, but certainly a little bit of thinning doe=
sn't bother me on the return path there.=0AGoing the other way, particularl=
y in a wifi world that insists on treating every packet as sacred (which I =
don't agree with at all), thinning acks can help, but single stream through=
put is of interest only on benchmarks, FQing as much as possible all the fl=
ows destined the station in each aggregate masks loss and reduces the need =
to protect everything so much.=0A >=0A > Sorry if I offended you. I was jus=
t giving these as example combined with effective rate usable on the medium=
 to say that the bandwidth is more dynamic in wireless and as such need dyn=
amic change of queue depth. Now this can be done by making the fraction siz=
e as used in your suggestion adaptive to these conditions.=0A=0AWell... see=
 above. Maybe this technique will do more of the right thing, but... go tes=
t.=0A=0A >=0A >> 131072 bytes of queue on 40Gbit is not 1ms, but 26 usec of=
 queueing, and=0A >> we get line rate nevertheless.=0A >=0A >=0A > I was sa=
ying it was about 1ms on *1Gbit* as the wireless TCP rates are moving into =
that direction in 11ac.=0A >=0A >=0A >> We need this level of shallow queue=
s (BQL, TSQ), to get very precise rtt=0A >> estimations so that TCP has goo=
d entropy for its pacing, even in the 50=0A >> usec rtt ranges.=0A >>=0A >>=
 If we allowed 1ms of queueing, then a 40Gbit flow would queue 5 MBytes.=0A=
 >>=0A >> This was terrible, because it increased cwnd and all sender queue=
s to=0A >> insane levels.=0A >=0A >=0A > Indeed and that is what we would l=
ike to address in our wireless drivers. I will setup some experiments using=
 the fraction sizing and post my findings. Again sorry if I offended you.=
=0A=0A=0AYou really, really, really need to test at rates below 50mbit and =
with other stations, also while doing this. It's not going to be a linear c=
urve.=0A =0A>=0A > Regards,=0A > Arend=0A >=0A > --=0A > To unsubscribe fro=
m this list: send the line "unsubscribe netdev" in=0A > the body of a messa=
ge to [ majordomo@vger.kernel.org ]( mailto:majordomo@vger.kernel.org )=0A =
> More majordomo info at  [ http://vger.kernel.org/majordomo-info.html ]( h=
ttp://vger.kernel.org/majordomo-info.html )=0A=0A-- =0A=0ADave T=C3=A4ht=0A=
=0Athttp://[ www.bufferbloat.net/projects/bloat/wiki/Upcoming_Talks ]( http=
://www.bufferbloat.net/projects/bloat/wiki/Upcoming_Talks )
------=_20150131165105000000_31330
Content-Type: text/html; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

<font face=3D"tahoma" size=3D"2"><p style=3D"margin:0;padding:0;font-family=
: tahoma; font-size: 10pt; word-wrap: break-word;">I think we need to creat=
e an Internet focused 802.11 working group that would be to the "OS wireles=
s designers and IEEE 802.11 standards groups" as the WHATML group was to W3=
C.</p>=0A<p style=3D"margin:0;padding:0;font-family: tahoma; font-size: 10p=
t; word-wrap: break-word;">&nbsp;</p>=0A<p style=3D"margin:0;padding:0;font=
-family: tahoma; font-size: 10pt; word-wrap: break-word;">W3C was clueless =
about the real world at the point WHATML was created. &nbsp;And WHATML was =
a "revenge of the real" against W3C - advancing a wide variety of important=
 practical innovations rather than attending endless standards meetings wit=
h people who were not focused on solving actually important problems.</p>=
=0A<p style=3D"margin:0;padding:0;font-family: tahoma; font-size: 10pt; wor=
d-wrap: break-word;">&nbsp;</p>=0A<p style=3D"margin:0;padding:0;font-famil=
y: tahoma; font-size: 10pt; word-wrap: break-word;">It took a bunch of work=
 to get WHATML going, and it offended W3C, who became unhelpful. &nbsp;But =
the approach actually worked - we now have a Web that really uses browser-s=
ide expressivity and that would never have happened if W3C were left to its=
 own devices.</p>=0A<p style=3D"margin:0;padding:0;font-family: tahoma; fon=
t-size: 10pt; word-wrap: break-word;">&nbsp;</p>=0A<p style=3D"margin:0;pad=
ding:0;font-family: tahoma; font-size: 10pt; word-wrap: break-word;">The Wi=
Fi consortium was an attempt to wrest control of pragmatic direction from 8=
02.11 and the proprietary-divergence folks at Qualcomm, Broadcom, Cisco, et=
c. &nbsp;But it failed, because it became thieves on a raft, more focused o=
n picking each others' pockets than on actually addressing the big issues.<=
/p>=0A<p style=3D"margin:0;padding:0;font-family: tahoma; font-size: 10pt; =
word-wrap: break-word;">&nbsp;</p>=0A<p style=3D"margin:0;padding:0;font-fa=
mily: tahoma; font-size: 10pt; word-wrap: break-word;">Jim has seen this pl=
ay out in the Linux community around X. &nbsp;Though there are lots of inte=
rests who would benefit by moving the engineering ball forward, everyone re=
sists action because it means giving up the chance at dominance, and the ce=
ntral group is far too weak to do anything beyond adjudicating the worst ba=
ttles.</p>=0A<p style=3D"margin:0;padding:0;font-family: tahoma; font-size:=
 10pt; word-wrap: break-word;">&nbsp;</p>=0A<p style=3D"margin:0;padding:0;=
font-family: tahoma; font-size: 10pt; word-wrap: break-word;">When I say "w=
e" I definitely include myself (though my time is limited due to other comm=
itments and the need to support my family), but I would only play with peop=
le who actually are committed to making stuff happen - which includes raisi=
ng hell with the vendors if need be, but also effective engineering steps t=
hat can achieve quick adoption.</p>=0A<p style=3D"margin:0;padding:0;font-f=
amily: tahoma; font-size: 10pt; word-wrap: break-word;">&nbsp;</p>=0A<p sty=
le=3D"margin:0;padding:0;font-family: tahoma; font-size: 10pt; word-wrap: b=
reak-word;">Sadly, and I think it is manageable at the moment, there are mo=
ves out there being made to get the FCC to "protect" WiFi from "interferenc=
e". &nbsp;The current one was Marriott, who requested the FCC for a rule to=
 make it&nbsp;legal to disrupt and block use of WiFi in people's rooms in t=
heir hotels, except with their access points. &nbsp;This also needs some te=
chnical defense. &nbsp;I believe any issues with WiFi performance in actual=
 Marriott hotels are due to bufferbloat in their hotel-wide systems, just a=
s the issues with GoGo are the same. &nbsp;But it's possible that queueing =
problems in their own WiFi gear are bad as well.</p>=0A<p style=3D"margin:0=
;padding:0;font-family: tahoma; font-size: 10pt; word-wrap: break-word;">&n=
bsp;</p>=0A<p style=3D"margin:0;padding:0;font-family: tahoma; font-size: 1=
0pt; word-wrap: break-word;">I mention this because it is related, and to t=
he layperson, or non-radio-knowledgeable executive, indistinguishable. &nbs=
p;It will take away the incentive to actually fix the 802.11 implementation=
s to be better performing, making the problem seem to be a "management" iss=
ue that can be solved by making WiFi less interoperable and less flexible b=
y rules, rather than by engineering.</p>=0A<p style=3D"margin:0;padding:0;f=
ont-family: tahoma; font-size: 10pt; word-wrap: break-word;">&nbsp;</p>=0A<=
p style=3D"margin:0;padding:0;font-family: tahoma; font-size: 10pt; word-wr=
ap: break-word;">However, solving the problems of hotspot networks and hote=
l networks are definitely "real world" issues, and quite along the same lin=
es you mention, Dave. &nbsp;FQ is almost certainly a big deal both in WiFi =
and in the distribution networks behind WiFi. Co-existence is also a big de=
al (RTS/CTS-like mechanisms can go a long way to remediate hidden-terminal =
disruption of the basic protocols). Roaming and scaling need work as well.<=
/p>=0A<p style=3D"margin:0;padding:0;font-family: tahoma; font-size: 10pt; =
word-wrap: break-word;">&nbsp;</p>=0A<p style=3D"margin:0;padding:0;font-fa=
mily: tahoma; font-size: 10pt; word-wrap: break-word;">It would even be a g=
ood thing to invent pragmatic ways to provide "low rate" subnets and "high =
rate" subnets that can coexist, so that compatibility with ancient "b" netw=
orks need not be maintained on all nets, at great cost - just send beacons =
at a high rate, so that the "b" NICs&nbsp;can't see them.... but you need p=
ragmatic stack implementations.</p>=0A<p style=3D"margin:0;padding:0;font-f=
amily: tahoma; font-size: 10pt; word-wrap: break-word;">&nbsp;</p>=0A<p sty=
le=3D"margin:0;padding:0;font-family: tahoma; font-size: 10pt; word-wrap: b=
reak-word;">But the engineering is not the only challenge. The other challe=
nge is to take the initiative and get stuff deployed. &nbsp;In the case of =
bufferbloat, the grade currently is a "D" for deployments, maybe a "D-". &n=
bsp;Beautiful technical work, but the economic/business/political side of t=
hings has been poor. &nbsp;Look at how slow IETF has been to achieve anythi=
ng (the perfect is truly the enemy of the good, and Dave Clark's "rough con=
sensus and working code" has been replaced by technocratic malaise, and wha=
t appears to me to be a class of people who love traveling the world to a f=
loating cocktail party without getting anything important done).</p>=0A<p s=
tyle=3D"margin:0;padding:0;font-family: tahoma; font-size: 10pt; word-wrap:=
 break-word;">&nbsp;</p>=0A<p style=3D"margin:0;padding:0;font-family: taho=
ma; font-size: 10pt; word-wrap: break-word;">The problem with communication=
s is that you can't just ship a product with a new "feature", because the i=
nnovation only works if widely adopted. &nbsp;Since there is no "Linux Desk=
top" (and Linus hates the idea, to a large extent) Linux can't be the sole =
carrier of the idea. &nbsp;You pretty much need iOS and Android both to buy=
 in or to provide a path for easy third-party upgrades. &nbsp;How do you do=
 that? &nbsp;Well, that's where the WHATML-type approach is necessary.</p>=
=0A<p style=3D"margin:0;padding:0;font-family: tahoma; font-size: 10pt; wor=
d-wrap: break-word;">&nbsp;</p>=0A<p style=3D"margin:0;padding:0;font-famil=
y: tahoma; font-size: 10pt; word-wrap: break-word;">I don't know if this ca=
n be achieved, and there are lots of details to be worked out. &nbsp;But I'=
ll play.</p>=0A<p style=3D"margin:0;padding:0;font-family: tahoma; font-siz=
e: 10pt; word-wrap: break-word;">&nbsp;</p>=0A<p style=3D"margin:0;padding:=
0;font-family: tahoma; font-size: 10pt; word-wrap: break-word;">&nbsp;</p>=
=0A<!--WM_COMPOSE_SIGNATURE_START--><!--WM_COMPOSE_SIGNATURE_END-->=0A<p st=
yle=3D"margin:0;padding:0;font-family: tahoma; font-size: 10pt; word-wrap: =
break-word;"><br /><br />On Saturday, January 31, 2015 4:05pm, "Dave Taht" =
&lt;dave.taht@gmail.com&gt; said:<br /><br /></p>=0A<div id=3D"SafeStyles14=
22739208">=0A<div dir=3D"ltr">I would like to have somehow assembled all th=
e focused resources to make a go at fixing wifi, or at least having a f2f w=
ith a bunch of people in the late march timeframe. This message of mine to =
linux-wireless bounced for some reason and I am off to log out for 10 days,=
 so...=0A<div>see relevant netdev thread also for ore details.</div>=0A<div=
><br />=0A<div class=3D"gmail_quote">---------- Forwarded message ---------=
-<br />From: <strong class=3D"gmail_sendername">Dave Taht</strong> <span di=
r=3D"ltr">&lt;<a href=3D"mailto:dave.taht@gmail.com">dave.taht@gmail.com</a=
>&gt;</span><br />Date: Sat, Jan 31, 2015 at 12:29 PM<br />Subject: Re: Thr=
oughput regression with `tcp: refine TSO autosizing`<br />To: Arend van Spr=
iel &lt;<a href=3D"mailto:arend@broadcom.com">arend@broadcom.com</a>&gt;<br=
 />Cc: linux-wireless &lt;<a href=3D"mailto:linux-wireless@vger.kernel.org"=
>linux-wireless@vger.kernel.org</a>&gt;, Michal Kazior &lt;<a href=3D"mailt=
o:michal.kazior@tieto.com">michal.kazior@tieto.com</a>&gt;, Eyal Perry &lt;=
<a href=3D"mailto:eyalpe@dev.mellanox.co.il">eyalpe@dev.mellanox.co.il</a>&=
gt;, Network Development &lt;<a href=3D"mailto:netdev@vger.kernel.org">netd=
ev@vger.kernel.org</a>&gt;, Eric Dumazet &lt;<a href=3D"mailto:eric.dumazet=
@gmail.com">eric.dumazet@gmail.com</a>&gt;<br /><br /><br />=0A<div dir=3D"=
ltr">=0A<p style=3D"margin:0;padding:0;font-family: tahoma; font-size: 10pt=
; word-wrap: break-word;" dir=3D"ltr">The wifi industry as a whole has vast=
ly bigger problems than achieving 1500Mbits in a faraday cage on a single f=
low.</p>=0A<p style=3D"margin:0;padding:0;font-family: tahoma; font-size: 1=
0pt; word-wrap: break-word;">I encourage you to try tests in netperf-wrappe=
r that explicitly test for latency under load, and in particular, the RTT_F=
AIR tests against 4 or more stations on a single wifi AP. You will find the=
 results very depressing. Similarly, on your previous test series, a latenc=
y figure would have been nice to have. I just did a talk at nznog, where I =
tested the local wifi with less than ambits of throughput, and 3 seconds of=
 latency, filmed here:&nbsp;</p>=0A<p style=3D"margin:0;padding:0;font-fami=
ly: tahoma; font-size: 10pt; word-wrap: break-word;"><a href=3D"https://plu=
s.google.com/u/0/107942175615993706558/posts/CY8ew8MPnMt" target=3D"_blank"=
>https://plus.google.com/u/0/107942175615993706558/posts/CY8ew8MPnMt</a></p=
>=0A<p style=3D"margin:0;padding:0;font-family: tahoma; font-size: 10pt; wo=
rd-wrap: break-word;">Do wish more folk were testing in the busy real world=
 environments, like coffee shops, cities... really, anywhere outside a fara=
day cage!</p>=0A<p style=3D"margin:0;padding:0;font-family: tahoma; font-si=
ze: 10pt; word-wrap: break-word;">I am not attending netconf - I was unable=
 to raise funds to go, and the program committee wanted something "new",</p=
>=0A<p style=3D"margin:0;padding:0;font-family: tahoma; font-size: 10pt; wo=
rd-wrap: break-word;">instead of the preso I gave the IEEE 802.11 working g=
roup back in september. (&nbsp;<a href=3D"http://snapon.lab.bufferbloat.net=
/~d/ieee802.11-sept-17-2014/11-14-1265-00-0wng-More-on-Bufferbloat.pdf" tar=
get=3D"_blank">http://snapon.lab.bufferbloat.net/~d/ieee802.11-sept-17-2014=
/11-14-1265-00-0wng-More-on-Bufferbloat.pdf</a> )</p>=0A<p style=3D"margin:=
0;padding:0;font-family: tahoma; font-size: 10pt; word-wrap: break-word;">I=
 was very pleased with the results of that talk - the day after I gave it, =
the phrase "test for latency" showed up in a bunch of 802.11ax (the next ge=
neration after ac) documents. :) Still, we are stuck with the train wreck t=
hat is 802.11ac glommed on top of 802.11n, glommed on top of 802.11g, in te=
rms of queue management, terrible uses of airtime, rate control and other s=
tuff. Aruba and Meraki, in particular took a big interest in what I'd outli=
ned in the preso above (we have a half dozen less well baked ideas - that's=
 just the easy stuff that can be done to improve wifi).&nbsp; I gave a foll=
owup at meraki but I don't think that's online.</p>=0A<p style=3D"margin:0;=
padding:0;font-family: tahoma; font-size: 10pt; word-wrap: break-word;">Fel=
ix (nbd) is on vacation right now, as I am I. In fact I am going somewhere =
for a week totally lacking internet access.</p>=0A<p style=3D"margin:0;padd=
ing:0;font-family: tahoma; font-size: 10pt; word-wrap: break-word;">Present=
ly the plan, with what budget (none) we have and time (very little) we have=
 is to produce a pair of proof of concept implementations for per tid queui=
ng (see relevant commit by nbd), &nbsp;leveraging the new minstrel stats, t=
he new minstrel-blues stuff, and an aggregation aware codel with a calculat=
ed target based on the most recently active stations, and a bunch of the ot=
her stuff outlined above at IEEE.</p>=0A<p style=3D"margin:0;padding:0;font=
-family: tahoma; font-size: 10pt; word-wrap: break-word;">It is my hope tha=
t this will start to provide accurate back pressure (or sufficient lack the=
reof for TSQ), to also improve throughput while still retaining low latency=
. But it is a certainty that we will run into more cross layer issues that =
will be a pita to resolve.</p>=0A<p style=3D"margin:0;padding:0;font-family=
: tahoma; font-size: 10pt; word-wrap: break-word;">If we can put together a=
 meet up around or during ELC in california in march?&nbsp;</p>=0A<p style=
=3D"margin:0;padding:0;font-family: tahoma; font-size: 10pt; word-wrap: bre=
ak-word;">I am really not terribly optimistic on anything other than the 2 =
chipsets we can hack on (ath9k, mt76). Negotiations to get qualcomm to open=
 up their ath10k firmware have thus far failed, nor has a ath10k-lite got a=
nywhere. Perhaps broadcom would be willing to open up their firmware suffic=
iently to build in a better API?</p>=0A<p style=3D"margin:0;padding:0;font-=
family: tahoma; font-size: 10pt; word-wrap: break-word;" dir=3D"ltr">A bit =
more below.</p>=0A<p style=3D"margin:0;padding:0;font-family: tahoma; font-=
size: 10pt; word-wrap: break-word;" dir=3D"ltr"><br /> On Jan 30, 2015 5:59=
 AM, "Arend van Spriel" &lt;<a href=3D"mailto:arend@broadcom.com" target=3D=
"_blank">arend@broadcom.com</a>&gt; wrote:<br /> &gt;<br /> &gt; On 01/30/1=
5 14:19, Eric Dumazet wrote:<br /> &gt;&gt;<br /> &gt;&gt; On Fri, 2015-01-=
30 at 11:29 +0100, Arend van Spriel wrote:<br /> &gt;&gt;<br /> &gt;&gt;&gt=
; Hi Eric,<br /> &gt;&gt;&gt;<br /> &gt;&gt;&gt; Your suggestions are still=
 based on the fact that you consider wireless<br /> &gt;&gt;&gt; networking=
 to be similar to ethernet, but as Michal indicated there are<br /> &gt;&gt=
;&gt; some fundamental differences starting with CSMA/CD versus CSMA/CA. Al=
so<br /> &gt;&gt;&gt; the medium conditions are far from comparable.&nbsp;<=
/p>=0A<p style=3D"margin:0;padding:0;font-family: tahoma; font-size: 10pt; =
word-wrap: break-word;">The analogy i now use for it is that switched ether=
net is generally your classic "dumbbell"</p>=0A<p style=3D"margin:0;padding=
:0;font-family: tahoma; font-size: 10pt; word-wrap: break-word;">topology. =
Wifi is more like a "taxi-stand" topology. If you think about how people</p=
>=0A<p style=3D"margin:0;padding:0;font-family: tahoma; font-size: 10pt; wo=
rd-wrap: break-word;">queue up at a taxi stand (and sometimes agree to shar=
e a ride), the inter arrival</p>=0A<p style=3D"margin:0;padding:0;font-fami=
ly: tahoma; font-size: 10pt; word-wrap: break-word;">and departure times of=
 a taxi stand make for a better mental model.&nbsp;</p>=0A<p style=3D"margi=
n:0;padding:0;font-family: tahoma; font-size: 10pt; word-wrap: break-word;"=
>Admittedly, I seem to spend a lot of time, waiting for taxies, thinking ab=
out</p>=0A<p style=3D"margin:0;padding:0;font-family: tahoma; font-size: 10=
pt; word-wrap: break-word;">wifi.</p>=0A<p style=3D"margin:0;padding:0;font=
-family: tahoma; font-size: 10pt; word-wrap: break-word;" dir=3D"ltr"><span=
 class=3D"">&gt;&gt; There is no shielding so<br /> &gt;&gt;&gt; it needs t=
o deal with interference and dynamically drops the link rate<br /> &gt;&gt;=
&gt; so transmission of packets can take several milliseconds. Then with 11=
n<br /> &gt;&gt;&gt; they came up with aggregation with sends up to 64 pack=
ets in a single<br /> &gt;&gt;&gt; transmit over the air at worst case 6.5 =
Mbps (if I am not mistaken). The<br /> &gt;&gt;&gt; parameter value for tcp=
_limit_output_bytes of 131072 means that it<br /> &gt;&gt;&gt; allows queui=
ng for about 1ms on a 1Gbps link, but I hope you can see<br /> &gt;&gt;&gt;=
 this is not realistic for dealing with all variances of the wireless<br />=
 &gt;&gt;&gt; medium/standard. I suggested this as topic for the wireless w=
orkshop in<br /> &gt;&gt;&gt; Otawa [1], but I can not attend there. Still =
hope that there will be<br /> &gt;&gt;&gt; some discussions to get more awa=
reness.<br /><br /></span>I have sometimes hoped that TSQ could be made mor=
e a function of the</p>=0A<p style=3D"margin:0;padding:0;font-family: tahom=
a; font-size: 10pt; word-wrap: break-word;" dir=3D"ltr">number of active fl=
ows exiting an interface, but eric tells me that's impossible.</p>=0A<p sty=
le=3D"margin:0;padding:0;font-family: tahoma; font-size: 10pt; word-wrap: b=
reak-word;" dir=3D"ltr">This is possibly another case where TSQ could use t=
o be a callback function...</p>=0A<p style=3D"margin:0;padding:0;font-famil=
y: tahoma; font-size: 10pt; word-wrap: break-word;" dir=3D"ltr">but frankly=
 I care not a whit about maximizing single flow tcp throughput on wifi</p>=
=0A<p style=3D"margin:0;padding:0;font-family: tahoma; font-size: 10pt; wor=
d-wrap: break-word;" dir=3D"ltr">in a faraday cage.</p>=0A<p style=3D"margi=
n:0;padding:0;font-family: tahoma; font-size: 10pt; word-wrap: break-word;"=
 dir=3D"ltr"><span class=3D""><br /> &gt;&gt;<br /> &gt;&gt; Ever heard abo=
ut bufferbloat ?<br /> &gt;<br /> &gt;<br /> &gt; Sure. I am trying to get =
awareness about that in our wireless driver/firmware development teams. So =
bear with me.<br /> &gt;<br /> &gt;<br /> &gt;&gt; Have you read my suggest=
ions and tried them ?<br /> &gt;&gt;<br /> &gt;&gt; You can adjust the limi=
t per flow to pretty much you want. If you need<br /> &gt;&gt; 64 packets, =
just do the math. If in 2018 you need 128 packets, do the<br /> &gt;&gt; ma=
th again.<br /> &gt;&gt;<br /> &gt;&gt; I am very well aware that wireless =
wants aggregation, thank you.<br /><br /></span>I note that a lot of people=
 testing this are getting it backwards. Usually it is the AP that is sendin=
g lots and lots of big packets, where the return path is predominately acks=
 from the station.&nbsp;</p>=0A<p style=3D"margin:0;padding:0;font-family: =
tahoma; font-size: 10pt; word-wrap: break-word;" dir=3D"ltr">I am not a hug=
e fan of stretch acks, but certainly a little bit of thinning doesn't bothe=
r me on the return path there.</p>=0A<p style=3D"margin:0;padding:0;font-fa=
mily: tahoma; font-size: 10pt; word-wrap: break-word;">Going the other way,=
 particularly in a wifi world that insists on treating every packet as sacr=
ed (which I don't agree with at all), thinning acks can help, but single st=
ream throughput is of interest only on benchmarks, FQing as much as possibl=
e all the flows destined the station in each aggregate masks loss and reduc=
es the need to protect everything so much.</p>=0A<p style=3D"margin:0;paddi=
ng:0;font-family: tahoma; font-size: 10pt; word-wrap: break-word;" dir=3D"l=
tr"><span class=3D""> &gt;<br /> &gt; Sorry if I offended you. I was just g=
iving these as example combined with effective rate usable on the medium to=
 say that the bandwidth is more dynamic in wireless and as such need dynami=
c change of queue depth. Now this can be done by making the fraction size a=
s used in your suggestion adaptive to these conditions.<br /><br /></span>W=
ell... see above. Maybe this technique will do more of the right thing, but=
... go test.</p>=0A<p style=3D"margin:0;padding:0;font-family: tahoma; font=
-size: 10pt; word-wrap: break-word;" dir=3D"ltr"><br /> &gt;<br /> &gt;&gt;=
 131072 bytes of queue on 40Gbit is not 1ms, but 26 usec of queueing, and<b=
r /> &gt;&gt; we get line rate nevertheless.<br /> &gt;<br /> &gt;<br /> &g=
t; I was saying it was about 1ms on *1Gbit* as the wireless TCP rates are m=
oving into that direction in 11ac.<br /> &gt;<br /> &gt;<br /> &gt;&gt; We =
need this level of shallow queues (BQL, TSQ), to get very precise rtt<br />=
 &gt;&gt; estimations so that TCP has good entropy for its pacing, even in =
the 50<br /> &gt;&gt; usec rtt ranges.<br /> &gt;&gt;<br /> &gt;&gt; If we =
allowed 1ms of queueing, then a 40Gbit flow would queue 5 MBytes.<br /> &gt=
;&gt;<br /> &gt;&gt; This was terrible, because it increased cwnd and all s=
ender queues to<br /> &gt;&gt; insane levels.<br /> &gt;<br /> &gt;<br /> &=
gt; Indeed and that is what we would like to address in our wireless driver=
s. I will setup some experiments using the fraction sizing and post my find=
ings. Again sorry if I offended you.<br /><br /></p>=0A<p style=3D"margin:0=
;padding:0;font-family: tahoma; font-size: 10pt; word-wrap: break-word;">Yo=
u really, really, really need to test at rates below 50mbit and with other =
stations, also while doing this. It's not going to be a linear curve.</p>=
=0A<p style=3D"margin:0;padding:0;font-family: tahoma; font-size: 10pt; wor=
d-wrap: break-word;" dir=3D"ltr">&nbsp;</p>=0A<p style=3D"margin:0;padding:=
0;font-family: tahoma; font-size: 10pt; word-wrap: break-word;" dir=3D"ltr"=
>&gt;<br /> &gt; Regards,<br /> &gt; Arend<br /> &gt;<br /> &gt; --<br /> &=
gt; To unsubscribe from this list: send the line "unsubscribe netdev" in<br=
 /> &gt; the body of a message to <a href=3D"mailto:majordomo@vger.kernel.o=
rg" target=3D"_blank">majordomo@vger.kernel.org</a><br /> &gt; More majordo=
mo info at&nbsp; <a href=3D"http://vger.kernel.org/majordomo-info.html" tar=
get=3D"_blank">http://vger.kernel.org/majordomo-info.html</a></p>=0A</div>=
=0A</div>=0A<br /><br />-- <br />=0A<div class=3D"gmail_signature">Dave T=
=C3=A4ht<br /><br />thttp://<a href=3D"http://www.bufferbloat.net/projects/=
bloat/wiki/Upcoming_Talks" target=3D"_blank">www.bufferbloat.net/projects/b=
loat/wiki/Upcoming_Talks</a></div>=0A</div>=0A</div>=0A</div></font>
------=_20150131165105000000_31330--