<html><head><meta http-equiv="Content-Type" content="text/html charset=iso-8859-1"></head><body style="word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-line-break: after-white-space;"><br><div><div>On 21. aug. 2014, at 19:04, Dave Taht <<a href="mailto:dave.taht@gmail.com">dave.taht@gmail.com</a>> wrote:</div><br class="Apple-interchange-newline"><blockquote type="cite"><div style="font-size: 12px; font-style: normal; font-variant: normal; font-weight: normal; letter-spacing: normal; line-height: normal; orphans: auto; text-align: start; text-indent: 0px; text-transform: none; white-space: normal; widows: auto; word-spacing: 0px; -webkit-text-stroke-width: 0px;">On Thu, Aug 21, 2014 at 5:21 AM, Michael Welzl <<a href="mailto:michawe@ifi.uio.no">michawe@ifi.uio.no</a>> wrote:<br><blockquote type="cite">Dave,<br><br>About this point that I've seen you make repeatedly:<br><br><blockquote type="cite">My biggest problem with all the work so far is that it starts with a<br>constant baseline 150ms or 100ms RTT, and then try various algorithms<br>over a wide range of bandwidths, and then elides that base RTT in all<br>future plots. Unless you read carefully, you don't see that...<br><br>The original bufferbloat experiment was at a physical RTT of 10ms,<br>with bidirectional traffic flooding the queues in both directions, on<br>an asymmetric network. I keep waiting for more experimenters to try<br>that as a baseline experiment.<br></blockquote><br><br>This sounds like a call for reality, when the point is to measure things that matter to the system you're investigating.<br></blockquote><br>Heh. It is my wish, certainly, to see the remy concept extended to<br>solving the problems I consider important. The prospect of merely<br>warming my feet on a nice 80 core box for a week, and evaluating the<br>results, is quite appealing.<br><br>I'll gladly trade someone else's code, and someone else's compute<br>time, for a vacation (or unemployment, if it works out!). Can arrange<br>for a very hefty cluster to tackle this stuff also, if the code were<br>public.<br><br>I have encouraged keith to look into the<span class="Apple-converted-space"> </span><a href="http://netfpga.org/">netfpga.org</a><span class="Apple-converted-space"> </span>project as a<br>possible target for the rules being created by remy's algorithms.<br><br><blockquote type="cite">E.g., if I'm investigating TCP, and I don't specifically work on ACK behavior (ACK congestion control or something like that), I usually don't care much about the backwards traffic or the asymmetry. Yes it does influence the measured RTT a bit, but then you argue for using a smaller base RTT where the impact of this gets smaller too.<br><br>What matters to the function of congestion controllers is the BDP, and the buffer size.<br></blockquote><br>Which varies based on the real physical RTT to the server. One of the<br>biggest problems in<br>nets today is that larger flow sources (CDNS, HAS(netflix)) keep<br>moving closer and closer to the<br>end node, with all the problems with tcp fairness that short RTTs<br>induce on longer ones, like your radio<br>flow blow.<br></div></blockquote><div><br></div><div>Sure</div><div><br></div><br><blockquote type="cite"><div style="font-size: 12px; font-style: normal; font-variant: normal; font-weight: normal; letter-spacing: normal; line-height: normal; orphans: auto; text-align: start; text-indent: 0px; text-transform: none; white-space: normal; widows: auto; word-spacing: 0px; -webkit-text-stroke-width: 0px;"><blockquote type="cite">As for real RTTs, I like pinging<span class="Apple-converted-space"> </span><a href="http://wwoz.org/">wwoz.org</a><span class="Apple-converted-space"> </span>because it's a radio station in New Orleans. I do sometimes listen to it, then I get traffic via a TCP connection that has this RTT. Very real. My ping delay is consistently above 110ms.<br></blockquote><br>I too use a low rate radio stream to measure the impact of other flows<br>on my listening experience.<br><br>I'd written this before the bufferbloat project got rolling, before<br>I'd even met jim.<br><br><a href="http://nex-6.taht.net/posts/Did_Bufferbloat_Kill_My_Net_Radio/">http://nex-6.taht.net/posts/Did_Bufferbloat_Kill_My_Net_Radio/</a><br><br>... haven't had a problem for 2 years now. :)<br><br>Still do use the stream ripping thing for taking stuff on the road.<br><br><blockquote type="cite"><br>On a side note, I can't help but mention that the "original bufferbloat experiment" features ping over FQ... measuring ping all by itself, pretty much :-)<br></blockquote><br>Huh? The original bufferbloat experiments were against a comcast cable<br>modem, and various other devices, lacking FQ entirely. there was a<br>youtube video, a paper, (things like "daddy, why is the internet slow<br>today" and various other resources. So the earliest stuff was "upload<br>+ ping + complaints about the network from the kids", and the later<br>simultaneous up+down+ping abstraction was to get the kids out of the<br>equation.<br></div></blockquote><div><br></div>Oops, sorry. I referred to something I'd seen presented so often that I thought this must be the "original experiment". Of course it isn't, FQ_CoDel wasn't there from day 1 :) apologies</div><div><br></div><div><br><blockquote type="cite"><div style="font-size: 12px; font-style: normal; font-variant: normal; font-weight: normal; letter-spacing: normal; line-height: normal; orphans: auto; text-align: start; text-indent: 0px; text-transform: none; white-space: normal; widows: auto; word-spacing: 0px; -webkit-text-stroke-width: 0px;">I agree that up+down+ping is not as useful as we'd like now that FQ is<br>on the table.<br><br>I have been attempting to formalize where we are now, adding rigor<br>based on those early benchmarks, with both netperf-wrapper's<br>tcp_bidirectional test, the more advanced rrul and rtt_fairness tests,<br>and a model in ns3. Given how much bloat we've killed elsewhere I<br>would like to continue to improve these tests to properly show<br>interactions with truly isochronous streams (there are several tests<br>in netperf-wrapper now that leverage d-itg for that now (thx toke!),<br>and more bursty yet rate limited flows like videoconferencing which<br>we've been discussing the structure of with various members of the<br>rmcat group.<br><br>My complaint comes from seeing very low bandwidths and long rtts and<br>short queue lengths used in materials used to explain bufferbloat,<br>like this:<br></div></blockquote><div><br></div>Short queue lengths is of course odd. Low bandwidth is sometimes convenient to isolate an effect, as it makes it less likely for the CPU, rwnd, lack of window scaling, or whatever else to be your bottleneck.</div><div><br></div><div><br><blockquote type="cite"><div style="font-size: 12px; font-style: normal; font-variant: normal; font-weight: normal; letter-spacing: normal; line-height: normal; orphans: auto; text-align: start; text-indent: 0px; text-transform: none; white-space: normal; widows: auto; word-spacing: 0px; -webkit-text-stroke-width: 0px;"><a href="https://www.udacity.com/wiki/cn/assignment5-buffer-bloat">https://www.udacity.com/wiki/cn/assignment5-buffer-bloat</a><br><br>when the reality:<br><br>1) There is a double bump you see in tcp for example at truly<br>excessive queue lengths). You also see truly<br>terrible ramp up performance when competing data and ack flows.<br><br>2) the example assumes byte to packet length equivalence (100 packets<br>= 150kb) which is completely untrue -<br>100 packets is in the range of 6.4kb-150kb - and a reason why we have<br>much larger packet based queue lengths in the real world IS because of<br>optimizing for acks, not data in commonly deployed equipment.<br><br>If you want to test a byte based queue length, by all means (I have<br>gradually come to the opinion that byte based packet queues are best),<br>but test either packet or byte based queues under conditions that are<br>ack dominated or data dominated, with both, preferably all scenarios.<br><br>3) it is at 1.5mbit, not the 5,10,100mbit speeds. Below 10mbits you<br>see IW10 or IW4 effects, you see ssthresh not hit, and a variety of<br>other tcp specific issues, rather than the generic bufferbloat<br>problem.<br><br>4) the 20ms in that course is actually reasonable. :)<br><br>I don't understand what the harm would have been to the student to use<br>1000 packets, and test up, down, and up + down, at 10 or 100Mbit.<br><br>5) I'd have liked it if the final experiment in the above course<br>material switched to huge queue lengths (say for example, cisco's 2800<br>default for their 4500 line), or explained the real world situation<br>better, so the takeaway was closer to the real size and scope of the<br>problem.<br><br>Anyway...<br><br>Nearly every paper I've read make one or more of the above<br>modifications to what I'd viewed as the original experiment, and I'd<br>like to add rigor to the definitions of phrases floating around like<br>"bufferbloat episode", and so on.<br><br><br><blockquote type="cite">Michael<br><br></blockquote><br><br><br>--<span class="Apple-converted-space"> </span><br>Dave Täht<br><br>NSFW:<span class="Apple-converted-space"> </span><a href="https://w2.eff.org/Censorship/Internet_censorship_bills/russell_0296_indecent.article">https://w2.eff.org/Censorship/Internet_censorship_bills/russell_0296_indecent.article</a></div></blockquote></div><br></body></html>