General list for discussing Bufferbloat
 help / color / mirror / Atom feed
From: Neil Davies <Neil.Davies@pnsol.com>
To: Stephen Hemminger <shemminger@vyatta.com>
Cc: bloat@lists.bufferbloat.net
Subject: [Bloat] Burst Loss
Date: Thu, 5 May 2011 17:49:18 +0100	[thread overview]
Message-ID: <6E25D2CF-D0F0-4C41-BABC-4AB0C00862A6@pnsol.com> (raw)
In-Reply-To: <20110505091046.3c73e067@nehalam>

On the issue of loss - we did a study of the UK's ADSL access network back in 2006 over several weeks, looking at the loss and delay that was introduced into the bi-directional traffic.

We found that the delay variability (that bit left over after you've taken the effects of geography and line sync rates) was broadly
the same over the half dozen locations we studied - it was there all the time to the same level of  variance and that what did vary by time of day was the loss rate.

We also found out, at the time much to our surprise - but we understand why now, that loss was broadly independent of the offered load - we used a constant data rate (with either fixed or variable packet sizes) .

We found that loss rates were in the range 1% to 3% (which is what would be expected from a large number of TCP streams contending for a limiting resource).

As for burst loss, yes it does occur - but it could be argued that this more the fault of the sending TCP stack than the network.

This phenomenon was well covered in the academic literature in the '90s (if I remember correctly folks at INRIA lead the way) - it is all down to the nature of random processes and how you observe them.  

Back to back packets see higher loss rates than packets more spread out in time. Consider a pair of packets, back to back, arriving over a 1Gbit/sec link into a queue being serviced at 34Mbit/sec, the first packet being 'lost' is equivalent to saying that the first packet 'observed' the queue full - the system's state is no longer a random variable - it is known to be full. The second packet (lets assume it is also a full one) 'makes an observation' of the state of that queue about 12us later - but that is only 3% of the time that it takes to service such large packets at 34 Mbit/sec. The system has not had any time to 'relax' anywhere near to back its steady state, it is highly likely that it is still full. 

Fixing this makes a phenomenal difference on the goodput (with the usual delay effects that implies), we've even built and deployed systems with this sort of engineering embedded (deployed as a network 'wrap') that mean that end users can sustainably (days on end) achieve effective throughput that is better than 98% of (the transmission media imposed) maximum. What we had done is make the network behave closer to the underlying statistical assumptions made in TCP's design.

Neil




On 5 May 2011, at 17:10, Stephen Hemminger wrote:

> On Thu, 05 May 2011 12:01:22 -0400
> Jim Gettys <jg@freedesktop.org> wrote:
> 
>> On 04/30/2011 03:18 PM, Richard Scheffenegger wrote:
>>> I'm curious, has anyone done some simulations to check if the 
>>> following qualitative statement holds true, and if, what the 
>>> quantitative effect is:
>>> 
>>> With bufferbloat, the TCP congestion control reaction is unduely 
>>> delayed. When it finally happens, the tcp stream is likely facing a 
>>> "burst loss" event - multiple consecutive packets get dropped. Worse 
>>> yet, the sender with the lowest RTT across the bottleneck will likely 
>>> start to retransmit while the (tail-drop) queue is still overflowing.
>>> 
>>> And a lost retransmission means a major setback in bandwidth (except 
>>> for Linux with bulk transfers and SACK enabled), as the standard (RFC 
>>> documented) behaviour asks for a RTO (1sec nominally, 200-500 ms 
>>> typically) to recover such a lost retransmission...
>>> 
>>> The second part (more important as an incentive to the ISPs actually), 
>>> how does the fraction of goodput vs. throughput change, when AQM 
>>> schemes are deployed, and TCP CC reacts in a timely manner? Small ISPs 
>>> have to pay for their upstream volume, regardless if that is "real" 
>>> work (goodput) or unneccessary retransmissions.
>>> 
>>> When I was at a small cable ISP in switzerland last week, surely 
>>> enough bufferbloat was readily observable (17ms -> 220ms after 30 sec 
>>> of a bulk transfer), but at first they had the "not our problem" view, 
>>> until I started discussing burst loss / retransmissions / goodput vs 
>>> throughput - with the latest point being a real commercial incentive 
>>> to them. (They promised to check if AQM would be available in the CPE 
>>> / CMTS, and put latency bounds in their tenders going forward).
>>> 
>> I wish I had a good answer to your very good questions.  Simulation 
>> would be interesting though real daa is more convincing.
>> 
>> I haven't looked in detail at all that many traces to try to get a feel 
>> for how much bandwidth waste there actually is, and more formal studies 
>> like Netalyzr, SamKnows, or the Bismark project would be needed to 
>> quantify the loss on the network as a whole.
>> 
>> I did spend some time last fall with the traces I've taken.  In those, 
>> I've typically been seeing 1-3% packet loss in the main TCP transfers.  
>> On the wireless trace I took, I saw 9% loss, but whether that is 
>> bufferbloat induced loss or not, I don't know (the data is out there for 
>> those who might want to dig).  And as you note, the losses are 
>> concentrated in bursts (probably due to the details of Cubic, so I'm told).
>> 
>> I've had anecdotal reports (and some first hand experience) with much 
>> higher loss rates, for example from Nick Weaver at ICSI; but I believe 
>> in playing things conservatively with any numbers I quote and I've not 
>> gotten consistent results when I've tried, so I just report what's in 
>> the packet captures I did take.
>> 
>> A phenomena that could be occurring is that during congestion avoidance 
>> (until TCP loses its cookies entirely and probes for a higher operating 
>> point) that TCP is carefully timing it's packets to keep the buffers 
>> almost exactly full, so that competing flows (in my case, simple pings) 
>> are likely to arrive just when there is no buffer space to accept them 
>> and therefore you see higher losses on them than you would on the single 
>> flow I've been tracing and getting loss statistics from.
>> 
>> People who want to look into this further would be a great help.
>>                 - Jim
> 
> I would not put a lot of trust in measuring loss with pings. 
> I heard that some ISP's do different processing on ICMP's used
> for ping packets. They either prioritize them high to provide 
> artificially good response (better marketing numbers); or 
> prioritize them low since they aren't useful traffic.
> There are also filters that only allow N ICMP requests per second
> which means repeated probes will be dropped.
> 
> 
> 
> -- 
> _______________________________________________
> Bloat mailing list
> Bloat@lists.bufferbloat.net
> https://lists.bufferbloat.net/listinfo/bloat


  parent reply	other threads:[~2011-05-05 16:44 UTC|newest]

Thread overview: 65+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2011-04-26 17:05 [Bloat] Network computing article on bloat Dave Taht
2011-04-26 18:13 ` Dave Hart
2011-04-26 18:17   ` Dave Taht
2011-04-26 18:28     ` dave greenfield
2011-04-26 18:32     ` Wesley Eddy
2011-04-26 19:37       ` Dave Taht
2011-04-26 20:21         ` Wesley Eddy
2011-04-26 20:30           ` Constantine Dovrolis
2011-04-26 21:16             ` Dave Taht
2011-04-27 17:10           ` Bill Sommerfeld
2011-04-27 17:40             ` Wesley Eddy
2011-04-27  7:43       ` Jonathan Morton
2011-04-30 15:56       ` Henrique de Moraes Holschuh
2011-04-30 19:18       ` [Bloat] Goodput fraction w/ AQM vs bufferbloat Richard Scheffenegger
2011-05-05 16:01         ` Jim Gettys
2011-05-05 16:10           ` Stephen Hemminger
2011-05-05 16:30             ` Jim Gettys
2011-05-05 16:49             ` Neil Davies [this message]
2011-05-05 18:34               ` [Bloat] Burst Loss Jim Gettys
2011-05-06 11:40               ` Sam Stickland
2011-05-06 11:53                 ` Neil Davies
2011-05-08 12:42               ` Richard Scheffenegger
2011-05-09 18:06                 ` Rick Jones
2011-05-11  8:53                   ` Richard Scheffenegger
2011-05-11  9:53                     ` Eric Dumazet
2011-05-12 14:16                       ` [Bloat] Publications Richard Scheffenegger
2011-05-12 16:31                   ` [Bloat] Burst Loss Fred Baker
2011-05-12 16:41                     ` Rick Jones
2011-05-12 17:11                       ` Fred Baker
2011-05-13  5:00                     ` Kevin Gross
2011-05-13 14:35                       ` Rick Jones
2011-05-13 14:54                         ` Dave Taht
2011-05-13 20:03                           ` [Bloat] Jumbo frames and LAN buffers (was: RE: Burst Loss) Kevin Gross
2011-05-14 20:48                             ` Fred Baker
2011-05-15 18:28                               ` Jonathan Morton
2011-05-15 20:49                                 ` Fred Baker
2011-05-16  0:31                                   ` Jonathan Morton
2011-05-16  7:51                                     ` Richard Scheffenegger
2011-05-16  9:49                                       ` Fred Baker
2011-05-16 11:23                                         ` [Bloat] Jumbo frames and LAN buffers Jim Gettys
2011-05-16 13:15                                           ` Kevin Gross
2011-05-16 13:22                                             ` Jim Gettys
2011-05-16 13:42                                               ` Kevin Gross
2011-05-16 15:23                                                 ` Jim Gettys
     [not found]                                               ` <-854731558634984958@unknownmsgid>
2011-05-16 13:45                                                 ` Dave Taht
2011-05-16 18:36                                             ` Richard Scheffenegger
2011-05-16 18:11                                         ` [Bloat] Jumbo frames and LAN buffers (was: RE: Burst Loss) Richard Scheffenegger
2011-05-17  7:49                               ` BeckW
2011-05-17 14:16                                 ` Dave Taht
     [not found]                           ` <-4629065256951087821@unknownmsgid>
2011-05-13 20:21                             ` Dave Taht
2011-05-13 22:36                               ` Kevin Gross
2011-05-13 22:08                           ` [Bloat] Burst Loss david
2011-05-13 19:32                         ` Denton Gentry
2011-05-13 20:47                           ` Rick Jones
2011-05-06  4:18           ` [Bloat] Goodput fraction w/ AQM vs bufferbloat Fred Baker
2011-05-06 15:14             ` richard
2011-05-06 21:56               ` Fred Baker
2011-05-06 22:10                 ` Stephen Hemminger
2011-05-07 16:39                   ` Jonathan Morton
2011-05-08  0:15                     ` Stephen Hemminger
2011-05-08  3:04                       ` Constantine Dovrolis
2011-05-08 13:00                 ` Richard Scheffenegger
2011-05-08 12:53               ` Richard Scheffenegger
2011-05-08 12:34             ` Richard Scheffenegger
2011-05-09  3:07               ` Fred Baker

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

  List information: https://lists.bufferbloat.net/postorius/lists/bloat.lists.bufferbloat.net/

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=6E25D2CF-D0F0-4C41-BABC-4AB0C00862A6@pnsol.com \
    --to=neil.davies@pnsol.com \
    --cc=bloat@lists.bufferbloat.net \
    --cc=shemminger@vyatta.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox