[Bloat] Measuring Latency

General list for discussing Bufferbloat
 help / color / mirror / Atom feed

* [Bloat] Measuring Latency
@ 2014-09-13 19:41 Hal Murray
  2014-09-13 19:48 ` Sebastian Moeller
  0 siblings, 1 reply; 8+ messages in thread
From: Hal Murray @ 2014-09-13 19:41 UTC (permalink / raw)
  To: bloat; +Cc: Hal Murray

> When reading it, it strikes me, that you don't directly tell them what to
> do; e.g. add a latency test during upload and download.  ...

Does round trip latency have enough info, or do you need to know how much is 
contributed by each direction?

If I gave you a large collection of latency data from a test run, how do you 
reduce it to something simple that a marketer could compare with the results 
from another test run?

-- 
These are my opinions.  I hate spam.

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [Bloat] Measuring Latency
  2014-09-13 19:41 [Bloat] Measuring Latency Hal Murray
@ 2014-09-13 19:48 ` Sebastian Moeller
  2014-09-13 23:32   ` Jonathan Morton
  0 siblings, 1 reply; 8+ messages in thread
From: Sebastian Moeller @ 2014-09-13 19:48 UTC (permalink / raw)
  To: Hal Murray; +Cc: bloat

Hi Hal,


On Sep 13, 2014, at 21:41 , Hal Murray <hmurray@megapathdsl.net> wrote:

> 
>> When reading it, it strikes me, that you don't directly tell them what to
>> do; e.g. add a latency test during upload and download.  ...
> 
> Does round trip latency have enough info, or do you need to know how much is 
> contributed by each direction?

	RTT is fine, uni-directional transfer time would be too good to be true ;). The “trick” is to measure RTT without load and under hope-fully link saturating load and look at the difference in average RTT and the RTT distributions (so 3 numbers, 2% quantile, average, and 98% quantlie). I think it really is that simple...

> 
> If I gave you a large collection of latency data from a test run, how do you 
> reduce it to something simple that a marketer could compare with the results 
> from another test run?

	I believe the added latency under load would be a marketable number, but we had a discussion in the past where it was argued that marketing wants a number which increases with goodness, so larger = better, something the raw difference is not going to deliver….

Best Regards
	Sebastian

> 
> 
> -- 
> These are my opinions.  I hate spam.
> 
> 
> 
> _______________________________________________
> Bloat mailing list
> Bloat@lists.bufferbloat.net
> https://lists.bufferbloat.net/listinfo/bloat


^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [Bloat] Measuring Latency
  2014-09-13 19:48 ` Sebastian Moeller
@ 2014-09-13 23:32   ` Jonathan Morton
  2014-09-14 14:31     ` Neil Davies
       [not found]     ` <1410691150.20919.YahooMailNeo@web122503.mail.ne1.yahoo.com>
  0 siblings, 2 replies; 8+ messages in thread
From: Jonathan Morton @ 2014-09-13 23:32 UTC (permalink / raw)
  To: Sebastian Moeller; +Cc: Hal Murray, bloat

>>> When reading it, it strikes me, that you don't directly tell them what to
>>> do; e.g. add a latency test during upload and download.  ...
>> 
>> Does round trip latency have enough info, or do you need to know how much is 
>> contributed by each direction?
> 
> RTT is fine, uni-directional transfer time would be too good to be true ;).

To expand on this: to measure one-way delay, you would need finely synchronised clocks (to within a couple of ms) at both ends.  The average consumer doesn't have that sort of setup - not even if they happen to use NTP.  So it's not a goal worth pursuing at this level - save it for scientific investigations, where the necessary effort and equipment can be made available.

>> If I gave you a large collection of latency data from a test run, how do you 
>> reduce it to something simple that a marketer could compare with the results 
>> from another test run?
> 
> 	I believe the added latency under load would be a marketable number, but we had a discussion in the past where it was argued that marketing wants a number which increases with goodness, so larger = better, something the raw difference is not going to deliver…

The obvious solution is to report the inverse value in Hz, a figure of merit that gamers are familiar with (if not exactly in this context).

For example, I occasionally play World Of Tanks, which has a latency meter (in ms) in the top corner, and I can roughly invert that in my head - if I set my shaper too optimistically, I get something like 500ms if something is updating in the background, but this goes down immediately to 100ms once I adjust the setting to a more conservative value - it's a 3G connection, so it's not going to get much better than that.  The corresponding inverted readings would be 2Hz (where I miss most of my shots) and 10Hz (where I can carry the team).  It's probably worth reporting to one decimal place.

WoT isn't exactly the "twitchiest" of online games, either - have you any idea how long it takes to aim a tank gun?  Even so, when some tanks can move at over 30km/h, a half-second difference in position is a whole tank length, so with the slower response I no longer know *where* or *when* to fire, unless the target is a sitting duck.  Even though my framerate is at 30Hz or more and appears smooth, my performance as a player is dependent on the Internet's response frequency, because that is lower.

So here's the outline of a step-by-step methodology:

- Prepare space for a list of latency measurements.  Each measurement needs to be tagged with information about what else was going on at the same time, ie. idle/upload/download/both.  Some latency probes may be lost, and this fact should also be recorded on a per-tag basis.

- Start taking latency measurements, tagging as idle to begin with.  Keep on doing so continuously, changing the tag as required, until several seconds after the bandwidth measurements are complete.

- Latency measurements should be taken sufficiently frequently (several times a second is okay) that there will be at least a hundred samples with each tag, and the frequency of sampling should not change during the test.  Each probe must be tagged with a unique ID, so that losses or re-ordering of packets can be detected and don't confuse the measurement.

- The latency probes should use UDP, not ICMP.  They should also use the same Diffserv/TOS tag as the bandwidth measurement traffic; the default "best-effort" tag is fine.

- To satisfy the above requirements, the latency tester must *not* wait for a previous reply to return before sending the next one.  It should send at regular intervals based on wall-clock time.  But don't send so many probes that they themselves clog the link.

- Once several seconds of "idle" samples are recorded, start the download test.  Change the tag to "download" at this point.

- The download test is complete when all the data sent by the server has reached the client.  Change the tag back to "idle" at this moment.

- Repeat the previous two steps for the upload measurement, using the "upload" tag.

- Repeat again, but perform upload and download tests at the same time (predicting, if necessary, that the bandwidth in each direction should be similar to that previously measured), and use the "both" tag.  Uploads and downloads tend to interfere with each other when the loaded response frequency is poor, so don't simply assume that the results will be the same as in the individual tests - *measure* it.

- Once a few seconds of "idle" samples have been taken, stop measuring and start analysis.

- Separate the list of latency samples by tag, and sort the four resulting lists in ascending order.

- In each list, find the sample nearest 98% of the way through the list.  This is the "98th percentile", a good way of finding the "highest" value while discarding irrelevant outliers.  The highest latency is the one that users will notice.  Typically poor results: idle 50ms, download 250ms, upload 500ms, both 800ms.

- Correct the 98-percentile latencies for packet loss, by multiplying it by the number of probes *sent* with the appropriate tag, and then dividing it by the number of probes *received* with that tag.  It is not necessary to report packet loss in any other way, *except* for the "idle" tag.

- Convert the corrected 98-percentile latencies into "response frequencies" by dividing one second by them.  The typical figures above would become: idle 20.0 Hz, download 4.0 Hz, upload 2.0 Hz, both 1.25 Hz - assuming there was no packet loss.  These figures are comparable in meaning and importance to "frames per second" figures in games.

- Report these response frequencies, to a precision of at least one decimal place, alongside and with equal visual importance to, the bandwidth figures.  For example:

	IDLE:      Response  20.0 Hz     Packet loss   0.00 %
	DOWNLOAD:  Response   4.0 Hz     Bandwidth    20.00 Mbps
	UPLOAD:    Response   2.0 Hz     Bandwidth     2.56 Mbps
	BIDIRECT:  Response   1.3 Hz     Bandwidth    15.23 / 2.35 Mbps

- Improving the response figures in the loaded condition will probably also improve the *bidirectional* bandwidth figures as a side-effect, while having a minimal effect on the *unidirectional* figures.  A connection with such qualities can legitimately be described as supporting multiple activities at the same time.  A connection with the example figures shown above can *not*.

The next trick is getting across the importance of acknowledging that more than one person in the average household wants to use the Internet at the same time these days, and they often want to do different things from each other.  In this case, the simplest valid argument probably has a lot going for it.

An illustration might help to get the concept across.  A household with four users in different rooms: father in the study downloading a video, mother in the kitchen on the (VoIP) phone, son in his bedroom playing a game, daughter in her bedroom uploading photos.  All via a single black-box modem and connection.  Captions should emphasise that mother and son both need low latency (high response frequency), while father and daughter need high bandwidth (in opposite directions!), and that they're doing all these things at the same time.

 - Jonathan Morton

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [Bloat] Measuring Latency
  2014-09-13 23:32   ` Jonathan Morton
@ 2014-09-14 14:31     ` Neil Davies
  2014-09-14 16:55       ` Sebastian Moeller
  2014-09-14 16:57       ` Jonathan Morton
       [not found]     ` <1410691150.20919.YahooMailNeo@web122503.mail.ne1.yahoo.com>
  1 sibling, 2 replies; 8+ messages in thread
From: Neil Davies @ 2014-09-14 14:31 UTC (permalink / raw)
  To: Jonathan Morton; +Cc: Hal Murray, bloat

Gents,

This is not actually true - you can measure one-way delays without completely accurately synchronised clocks (they have to be reasonably precise, not accurate) - see CERN thesis at http://goo.gl/ss6EBq

It is possible, with appropriate measurements, to construct arguments that make marketeers salivate (or the appropriate metaphor) - you can compare the relative effects of technology, location and instantaneous congestion. See slideshare at http://goo.gl/6vytmD

Neil

On 14 Sep 2014, at 00:32, Jonathan Morton <chromatix99@gmail.com> wrote:

>>>> When reading it, it strikes me, that you don't directly tell them what to
>>>> do; e.g. add a latency test during upload and download.  ...
>>> 
>>> Does round trip latency have enough info, or do you need to know how much is 
>>> contributed by each direction?
>> 
>> RTT is fine, uni-directional transfer time would be too good to be true ;).
> 
> To expand on this: to measure one-way delay, you would need finely synchronised clocks (to within a couple of ms) at both ends.  The average consumer doesn't have that sort of setup - not even if they happen to use NTP.  So it's not a goal worth pursuing at this level - save it for scientific investigations, where the necessary effort and equipment can be made available.
> 
>>> If I gave you a large collection of latency data from a test run, how do you 
>>> reduce it to something simple that a marketer could compare with the results 
>>> from another test run?
>> 
>> 	I believe the added latency under load would be a marketable number, but we had a discussion in the past where it was argued that marketing wants a number which increases with goodness, so larger = better, something the raw difference is not going to deliver…
> 
> The obvious solution is to report the inverse value in Hz, a figure of merit that gamers are familiar with (if not exactly in this context).
> 
> For example, I occasionally play World Of Tanks, which has a latency meter (in ms) in the top corner, and I can roughly invert that in my head - if I set my shaper too optimistically, I get something like 500ms if something is updating in the background, but this goes down immediately to 100ms once I adjust the setting to a more conservative value - it's a 3G connection, so it's not going to get much better than that.  The corresponding inverted readings would be 2Hz (where I miss most of my shots) and 10Hz (where I can carry the team).  It's probably worth reporting to one decimal place.
> 
> WoT isn't exactly the "twitchiest" of online games, either - have you any idea how long it takes to aim a tank gun?  Even so, when some tanks can move at over 30km/h, a half-second difference in position is a whole tank length, so with the slower response I no longer know *where* or *when* to fire, unless the target is a sitting duck.  Even though my framerate is at 30Hz or more and appears smooth, my performance as a player is dependent on the Internet's response frequency, because that is lower.
> 
> 
> So here's the outline of a step-by-step methodology:
> 
> - Prepare space for a list of latency measurements.  Each measurement needs to be tagged with information about what else was going on at the same time, ie. idle/upload/download/both.  Some latency probes may be lost, and this fact should also be recorded on a per-tag basis.
> 
> - Start taking latency measurements, tagging as idle to begin with.  Keep on doing so continuously, changing the tag as required, until several seconds after the bandwidth measurements are complete.
> 
> - Latency measurements should be taken sufficiently frequently (several times a second is okay) that there will be at least a hundred samples with each tag, and the frequency of sampling should not change during the test.  Each probe must be tagged with a unique ID, so that losses or re-ordering of packets can be detected and don't confuse the measurement.
> 
> - The latency probes should use UDP, not ICMP.  They should also use the same Diffserv/TOS tag as the bandwidth measurement traffic; the default "best-effort" tag is fine.
> 
> - To satisfy the above requirements, the latency tester must *not* wait for a previous reply to return before sending the next one.  It should send at regular intervals based on wall-clock time.  But don't send so many probes that they themselves clog the link.
> 
> - Once several seconds of "idle" samples are recorded, start the download test.  Change the tag to "download" at this point.
> 
> - The download test is complete when all the data sent by the server has reached the client.  Change the tag back to "idle" at this moment.
> 
> - Repeat the previous two steps for the upload measurement, using the "upload" tag.
> 
> - Repeat again, but perform upload and download tests at the same time (predicting, if necessary, that the bandwidth in each direction should be similar to that previously measured), and use the "both" tag.  Uploads and downloads tend to interfere with each other when the loaded response frequency is poor, so don't simply assume that the results will be the same as in the individual tests - *measure* it.
> 
> - Once a few seconds of "idle" samples have been taken, stop measuring and start analysis.
> 
> - Separate the list of latency samples by tag, and sort the four resulting lists in ascending order.
> 
> - In each list, find the sample nearest 98% of the way through the list.  This is the "98th percentile", a good way of finding the "highest" value while discarding irrelevant outliers.  The highest latency is the one that users will notice.  Typically poor results: idle 50ms, download 250ms, upload 500ms, both 800ms.
> 
> - Correct the 98-percentile latencies for packet loss, by multiplying it by the number of probes *sent* with the appropriate tag, and then dividing it by the number of probes *received* with that tag.  It is not necessary to report packet loss in any other way, *except* for the "idle" tag.
> 
> - Convert the corrected 98-percentile latencies into "response frequencies" by dividing one second by them.  The typical figures above would become: idle 20.0 Hz, download 4.0 Hz, upload 2.0 Hz, both 1.25 Hz - assuming there was no packet loss.  These figures are comparable in meaning and importance to "frames per second" figures in games.
> 
> - Report these response frequencies, to a precision of at least one decimal place, alongside and with equal visual importance to, the bandwidth figures.  For example:
> 
> 	IDLE:      Response  20.0 Hz     Packet loss   0.00 %
> 	DOWNLOAD:  Response   4.0 Hz     Bandwidth    20.00 Mbps
> 	UPLOAD:    Response   2.0 Hz     Bandwidth     2.56 Mbps
> 	BIDIRECT:  Response   1.3 Hz     Bandwidth    15.23 / 2.35 Mbps
> 
> - Improving the response figures in the loaded condition will probably also improve the *bidirectional* bandwidth figures as a side-effect, while having a minimal effect on the *unidirectional* figures.  A connection with such qualities can legitimately be described as supporting multiple activities at the same time.  A connection with the example figures shown above can *not*.
> 
> 
> The next trick is getting across the importance of acknowledging that more than one person in the average household wants to use the Internet at the same time these days, and they often want to do different things from each other.  In this case, the simplest valid argument probably has a lot going for it.
> 
> An illustration might help to get the concept across.  A household with four users in different rooms: father in the study downloading a video, mother in the kitchen on the (VoIP) phone, son in his bedroom playing a game, daughter in her bedroom uploading photos.  All via a single black-box modem and connection.  Captions should emphasise that mother and son both need low latency (high response frequency), while father and daughter need high bandwidth (in opposite directions!), and that they're doing all these things at the same time.
> 
> - Jonathan Morton
> 
> _______________________________________________
> Bloat mailing list
> Bloat@lists.bufferbloat.net
> https://lists.bufferbloat.net/listinfo/bloat


^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [Bloat] Measuring Latency
  2014-09-14 14:31     ` Neil Davies
@ 2014-09-14 16:55       ` Sebastian Moeller
  2014-09-14 17:26         ` Neil Davies
  2014-09-14 16:57       ` Jonathan Morton
  1 sibling, 1 reply; 8+ messages in thread
From: Sebastian Moeller @ 2014-09-14 16:55 UTC (permalink / raw)
  To: Neil Davies; +Cc: Hal Murray, bloat

Hi Neil,


On Sep 14, 2014, at 16:31 , Neil Davies <neil.davies@pnsol.com> wrote:

> Gents,
> 
> This is not actually true - you can measure one-way delays without completely accurately synchronised clocks (they have to be reasonably precise, not accurate) - see CERN thesis at http://goo.gl/ss6EBq

	I might have mis understood the thesis, but the requirement of 1000s of samples somehow does not work well with the use case we have been discussing in this thread, improvement of speed tests so that they include latency under load measurements.  Also looking at the thesis I am a bit unsure about the one way delay measurement method, it relies ion using the minimum one way delays times. My own observations of RTTs for ATM quantization show that even for 1000 samples per packet size the min is a much worse estimate than the median, so for their method to work over the open internet we are talking about gigantic numbers of samples… (now admitted I might have screwed up royally with my own analysis, I do this for a hobby only). Cool thesis, nice and impressive work, but not the best fit for the quest for a better speedtest I guess…

Best Regards
	Sebastian


> 
> It is possible, with appropriate measurements, to construct arguments that make marketeers salivate (or the appropriate metaphor) - you can compare the relative effects of technology, location and instantaneous congestion. See slideshare at http://goo.gl/6vytmD
> 
> Neil
> 
> On 14 Sep 2014, at 00:32, Jonathan Morton <chromatix99@gmail.com> wrote:
> 
>>>>> When reading it, it strikes me, that you don't directly tell them what to
>>>>> do; e.g. add a latency test during upload and download.  ...
>>>> 
>>>> Does round trip latency have enough info, or do you need to know how much is 
>>>> contributed by each direction?
>>> 
>>> RTT is fine, uni-directional transfer time would be too good to be true ;).
>> 
>> To expand on this: to measure one-way delay, you would need finely synchronised clocks (to within a couple of ms) at both ends.  The average consumer doesn't have that sort of setup - not even if they happen to use NTP.  So it's not a goal worth pursuing at this level - save it for scientific investigations, where the necessary effort and equipment can be made available.
>> 
>>>> If I gave you a large collection of latency data from a test run, how do you 
>>>> reduce it to something simple that a marketer could compare with the results 
>>>> from another test run?
>>> 
>>> 	I believe the added latency under load would be a marketable number, but we had a discussion in the past where it was argued that marketing wants a number which increases with goodness, so larger = better, something the raw difference is not going to deliver…
>> 
>> The obvious solution is to report the inverse value in Hz, a figure of merit that gamers are familiar with (if not exactly in this context).
>> 
>> For example, I occasionally play World Of Tanks, which has a latency meter (in ms) in the top corner, and I can roughly invert that in my head - if I set my shaper too optimistically, I get something like 500ms if something is updating in the background, but this goes down immediately to 100ms once I adjust the setting to a more conservative value - it's a 3G connection, so it's not going to get much better than that.  The corresponding inverted readings would be 2Hz (where I miss most of my shots) and 10Hz (where I can carry the team).  It's probably worth reporting to one decimal place.
>> 
>> WoT isn't exactly the "twitchiest" of online games, either - have you any idea how long it takes to aim a tank gun?  Even so, when some tanks can move at over 30km/h, a half-second difference in position is a whole tank length, so with the slower response I no longer know *where* or *when* to fire, unless the target is a sitting duck.  Even though my framerate is at 30Hz or more and appears smooth, my performance as a player is dependent on the Internet's response frequency, because that is lower.
>> 
>> 
>> So here's the outline of a step-by-step methodology:
>> 
>> - Prepare space for a list of latency measurements.  Each measurement needs to be tagged with information about what else was going on at the same time, ie. idle/upload/download/both.  Some latency probes may be lost, and this fact should also be recorded on a per-tag basis.
>> 
>> - Start taking latency measurements, tagging as idle to begin with.  Keep on doing so continuously, changing the tag as required, until several seconds after the bandwidth measurements are complete.
>> 
>> - Latency measurements should be taken sufficiently frequently (several times a second is okay) that there will be at least a hundred samples with each tag, and the frequency of sampling should not change during the test.  Each probe must be tagged with a unique ID, so that losses or re-ordering of packets can be detected and don't confuse the measurement.
>> 
>> - The latency probes should use UDP, not ICMP.  They should also use the same Diffserv/TOS tag as the bandwidth measurement traffic; the default "best-effort" tag is fine.
>> 
>> - To satisfy the above requirements, the latency tester must *not* wait for a previous reply to return before sending the next one.  It should send at regular intervals based on wall-clock time.  But don't send so many probes that they themselves clog the link.
>> 
>> - Once several seconds of "idle" samples are recorded, start the download test.  Change the tag to "download" at this point.
>> 
>> - The download test is complete when all the data sent by the server has reached the client.  Change the tag back to "idle" at this moment.
>> 
>> - Repeat the previous two steps for the upload measurement, using the "upload" tag.
>> 
>> - Repeat again, but perform upload and download tests at the same time (predicting, if necessary, that the bandwidth in each direction should be similar to that previously measured), and use the "both" tag.  Uploads and downloads tend to interfere with each other when the loaded response frequency is poor, so don't simply assume that the results will be the same as in the individual tests - *measure* it.
>> 
>> - Once a few seconds of "idle" samples have been taken, stop measuring and start analysis.
>> 
>> - Separate the list of latency samples by tag, and sort the four resulting lists in ascending order.
>> 
>> - In each list, find the sample nearest 98% of the way through the list.  This is the "98th percentile", a good way of finding the "highest" value while discarding irrelevant outliers.  The highest latency is the one that users will notice.  Typically poor results: idle 50ms, download 250ms, upload 500ms, both 800ms.
>> 
>> - Correct the 98-percentile latencies for packet loss, by multiplying it by the number of probes *sent* with the appropriate tag, and then dividing it by the number of probes *received* with that tag.  It is not necessary to report packet loss in any other way, *except* for the "idle" tag.
>> 
>> - Convert the corrected 98-percentile latencies into "response frequencies" by dividing one second by them.  The typical figures above would become: idle 20.0 Hz, download 4.0 Hz, upload 2.0 Hz, both 1.25 Hz - assuming there was no packet loss.  These figures are comparable in meaning and importance to "frames per second" figures in games.
>> 
>> - Report these response frequencies, to a precision of at least one decimal place, alongside and with equal visual importance to, the bandwidth figures.  For example:
>> 
>> 	IDLE:      Response  20.0 Hz     Packet loss   0.00 %
>> 	DOWNLOAD:  Response   4.0 Hz     Bandwidth    20.00 Mbps
>> 	UPLOAD:    Response   2.0 Hz     Bandwidth     2.56 Mbps
>> 	BIDIRECT:  Response   1.3 Hz     Bandwidth    15.23 / 2.35 Mbps
>> 
>> - Improving the response figures in the loaded condition will probably also improve the *bidirectional* bandwidth figures as a side-effect, while having a minimal effect on the *unidirectional* figures.  A connection with such qualities can legitimately be described as supporting multiple activities at the same time.  A connection with the example figures shown above can *not*.
>> 
>> 
>> The next trick is getting across the importance of acknowledging that more than one person in the average household wants to use the Internet at the same time these days, and they often want to do different things from each other.  In this case, the simplest valid argument probably has a lot going for it.
>> 
>> An illustration might help to get the concept across.  A household with four users in different rooms: father in the study downloading a video, mother in the kitchen on the (VoIP) phone, son in his bedroom playing a game, daughter in her bedroom uploading photos.  All via a single black-box modem and connection.  Captions should emphasise that mother and son both need low latency (high response frequency), while father and daughter need high bandwidth (in opposite directions!), and that they're doing all these things at the same time.
>> 
>> - Jonathan Morton
>> 
>> _______________________________________________
>> Bloat mailing list
>> Bloat@lists.bufferbloat.net
>> https://lists.bufferbloat.net/listinfo/bloat
> 


^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [Bloat] Measuring Latency
  2014-09-14 14:31     ` Neil Davies
  2014-09-14 16:55       ` Sebastian Moeller
@ 2014-09-14 16:57       ` Jonathan Morton
  1 sibling, 0 replies; 8+ messages in thread
From: Jonathan Morton @ 2014-09-14 16:57 UTC (permalink / raw)
  To: Neil Davies; +Cc: Hal Murray, bloat

On 14 Sep, 2014, at 5:31 pm, Neil Davies wrote:

> This is not actually true - you can measure one-way delays without completely accurately synchronised clocks (they have to be reasonably precise, not accurate) - see CERN thesis at http://goo.gl/ss6EBq

I read the abstract of that, and came away with the distinct impression that I wouldn't learn anything from reading the rest of the paper.  Not that there *isn't* good information there, but that it's most likely not in a form that I can digest.

And that means it's probably impractical to implement on a consumer broadband test.  Timestamps - sure, why not - but I don't yet see what you could do with them.

> It is possible, with appropriate measurements, to construct arguments that make marketeers salivate (or the appropriate metaphor) - you can compare the relative effects of technology, location and instantaneous congestion. See slideshare at http://goo.gl/6vytmD

I'm sure those must be a different breed of marketing types than I have in mind.  There are major ISPs who claim that 3% packet loss "is not a fault" - on an idle wire line, not wireless, not congested.  They are all about sales and retention by brute force and semi-monopoly position, not by genuinely providing superior service.

Hence why we have to turn to the external consumer-oriented organisations, the speed-test sites among them.  They will have to serve as *our* marketing tool.

 - Jonathan Morton

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [Bloat] Measuring Latency
  2014-09-14 16:55       ` Sebastian Moeller
@ 2014-09-14 17:26         ` Neil Davies
  0 siblings, 0 replies; 8+ messages in thread
From: Neil Davies @ 2014-09-14 17:26 UTC (permalink / raw)
  To: Sebastian Moeller; +Cc: Hal Murray, bloat

Ok, 

if you take a look at §B2 in http://goo.gl/ss6EBq it shows how to perform the correction for the clock difference given two sets of measurements of delay (or more specifically delay and loss, the quality attenuation or ∆Q), in §4.2 to §4.4 it gives a semi-formal explanation (http://goo.gl/ss6EBq does this in an even more informal way) of how to extract the basis set for the way in which quality attenuation accrues (and hence delay) in networks.

The point is there are three fundamental components of how this delay "accrues" - G,S and V - it is the 'V' that congestion has an effect on - G, and S are the the bits that are "fixed" for a given endpoint. The way G and S are calculated means that you can get pretty accurate estimators for them in a few (5 or so) samples. Noting that the minimum times that are being used are *per packet size* (or in the case of things like ATM per quantisation size). The rate at which the estimator tracks the true G and S (and hence, when those effects are subtracted from the observed values, the magnitude and stability of the V)  is very high, the theoretical discussion makes very limited assumptions about the distribution (and hence the underlying of the delay caused) of V.

The "speed" of a network connection is really the point, at which, the 'V' starts to increase rapidly with the offered load, if you look at slides 19 and 20 of http://goo.gl/ss6EBq you can see the measurement effects of the total offered load exceeding the capacity of a multiplexing point on the path - when the "speed" (offered load) was too high. It is the measurement of the trend of 'V' that gives you the turning point, that is "the speed" .

Neil

On 14 Sep 2014, at 17:55, Sebastian Moeller <moeller0@gmx.de> wrote:

> Hi Neil,
> 
> 
> On Sep 14, 2014, at 16:31 , Neil Davies <neil.davies@pnsol.com> wrote:
> 
>> Gents,
>> 
>> This is not actually true - you can measure one-way delays without completely accurately synchronised clocks (they have to be reasonably precise, not accurate) - see CERN thesis at http://goo.gl/ss6EBq
> 
> 	I might have mis understood the thesis, but the requirement of 1000s of samples somehow does not work well with the use case we have been discussing in this thread, improvement of speed tests so that they include latency under load measurements.  Also looking at the thesis I am a bit unsure about the one way delay measurement method, it relies ion using the minimum one way delays times. My own observations of RTTs for ATM quantization show that even for 1000 samples per packet size the min is a much worse estimate than the median, so for their method to work over the open internet we are talking about gigantic numbers of samples… (now admitted I might have screwed up royally with my own analysis, I do this for a hobby only). Cool thesis, nice and impressive work, but not the best fit for the quest for a better speedtest I guess…
> 
> Best Regards
> 	Sebastian
> 
> 
>> 
>> It is possible, with appropriate measurements, to construct arguments that make marketeers salivate (or the appropriate metaphor) - you can compare the relative effects of technology, location and instantaneous congestion. See slideshare at http://goo.gl/ss6EBq
>> 
>> Neil
>> 
>> On 14 Sep 2014, at 00:32, Jonathan Morton <chromatix99@gmail.com> wrote:
>> 
>>>>>> When reading it, it strikes me, that you don't directly tell them what to
>>>>>> do; e.g. add a latency test during upload and download.  ...
>>>>> 
>>>>> Does round trip latency have enough info, or do you need to know how much is 
>>>>> contributed by each direction?
>>>> 
>>>> RTT is fine, uni-directional transfer time would be too good to be true ;).
>>> 
>>> To expand on this: to measure one-way delay, you would need finely synchronised clocks (to within a couple of ms) at both ends.  The average consumer doesn't have that sort of setup - not even if they happen to use NTP.  So it's not a goal worth pursuing at this level - save it for scientific investigations, where the necessary effort and equipment can be made available.
>>> 
>>>>> If I gave you a large collection of latency data from a test run, how do you 
>>>>> reduce it to something simple that a marketer could compare with the results 
>>>>> from another test run?
>>>> 
>>>> 	I believe the added latency under load would be a marketable number, but we had a discussion in the past where it was argued that marketing wants a number which increases with goodness, so larger = better, something the raw difference is not going to deliver…
>>> 
>>> The obvious solution is to report the inverse value in Hz, a figure of merit that gamers are familiar with (if not exactly in this context).
>>> 
>>> For example, I occasionally play World Of Tanks, which has a latency meter (in ms) in the top corner, and I can roughly invert that in my head - if I set my shaper too optimistically, I get something like 500ms if something is updating in the background, but this goes down immediately to 100ms once I adjust the setting to a more conservative value - it's a 3G connection, so it's not going to get much better than that.  The corresponding inverted readings would be 2Hz (where I miss most of my shots) and 10Hz (where I can carry the team).  It's probably worth reporting to one decimal place.
>>> 
>>> WoT isn't exactly the "twitchiest" of online games, either - have you any idea how long it takes to aim a tank gun?  Even so, when some tanks can move at over 30km/h, a half-second difference in position is a whole tank length, so with the slower response I no longer know *where* or *when* to fire, unless the target is a sitting duck.  Even though my framerate is at 30Hz or more and appears smooth, my performance as a player is dependent on the Internet's response frequency, because that is lower.
>>> 
>>> 
>>> So here's the outline of a step-by-step methodology:
>>> 
>>> - Prepare space for a list of latency measurements.  Each measurement needs to be tagged with information about what else was going on at the same time, ie. idle/upload/download/both.  Some latency probes may be lost, and this fact should also be recorded on a per-tag basis.
>>> 
>>> - Start taking latency measurements, tagging as idle to begin with.  Keep on doing so continuously, changing the tag as required, until several seconds after the bandwidth measurements are complete.
>>> 
>>> - Latency measurements should be taken sufficiently frequently (several times a second is okay) that there will be at least a hundred samples with each tag, and the frequency of sampling should not change during the test.  Each probe must be tagged with a unique ID, so that losses or re-ordering of packets can be detected and don't confuse the measurement.
>>> 
>>> - The latency probes should use UDP, not ICMP.  They should also use the same Diffserv/TOS tag as the bandwidth measurement traffic; the default "best-effort" tag is fine.
>>> 
>>> - To satisfy the above requirements, the latency tester must *not* wait for a previous reply to return before sending the next one.  It should send at regular intervals based on wall-clock time.  But don't send so many probes that they themselves clog the link.
>>> 
>>> - Once several seconds of "idle" samples are recorded, start the download test.  Change the tag to "download" at this point.
>>> 
>>> - The download test is complete when all the data sent by the server has reached the client.  Change the tag back to "idle" at this moment.
>>> 
>>> - Repeat the previous two steps for the upload measurement, using the "upload" tag.
>>> 
>>> - Repeat again, but perform upload and download tests at the same time (predicting, if necessary, that the bandwidth in each direction should be similar to that previously measured), and use the "both" tag.  Uploads and downloads tend to interfere with each other when the loaded response frequency is poor, so don't simply assume that the results will be the same as in the individual tests - *measure* it.
>>> 
>>> - Once a few seconds of "idle" samples have been taken, stop measuring and start analysis.
>>> 
>>> - Separate the list of latency samples by tag, and sort the four resulting lists in ascending order.
>>> 
>>> - In each list, find the sample nearest 98% of the way through the list.  This is the "98th percentile", a good way of finding the "highest" value while discarding irrelevant outliers.  The highest latency is the one that users will notice.  Typically poor results: idle 50ms, download 250ms, upload 500ms, both 800ms.
>>> 
>>> - Correct the 98-percentile latencies for packet loss, by multiplying it by the number of probes *sent* with the appropriate tag, and then dividing it by the number of probes *received* with that tag.  It is not necessary to report packet loss in any other way, *except* for the "idle" tag.
>>> 
>>> - Convert the corrected 98-percentile latencies into "response frequencies" by dividing one second by them.  The typical figures above would become: idle 20.0 Hz, download 4.0 Hz, upload 2.0 Hz, both 1.25 Hz - assuming there was no packet loss.  These figures are comparable in meaning and importance to "frames per second" figures in games.
>>> 
>>> - Report these response frequencies, to a precision of at least one decimal place, alongside and with equal visual importance to, the bandwidth figures.  For example:
>>> 
>>> 	IDLE:      Response  20.0 Hz     Packet loss   0.00 %
>>> 	DOWNLOAD:  Response   4.0 Hz     Bandwidth    20.00 Mbps
>>> 	UPLOAD:    Response   2.0 Hz     Bandwidth     2.56 Mbps
>>> 	BIDIRECT:  Response   1.3 Hz     Bandwidth    15.23 / 2.35 Mbps
>>> 
>>> - Improving the response figures in the loaded condition will probably also improve the *bidirectional* bandwidth figures as a side-effect, while having a minimal effect on the *unidirectional* figures.  A connection with such qualities can legitimately be described as supporting multiple activities at the same time.  A connection with the example figures shown above can *not*.
>>> 
>>> 
>>> The next trick is getting across the importance of acknowledging that more than one person in the average household wants to use the Internet at the same time these days, and they often want to do different things from each other.  In this case, the simplest valid argument probably has a lot going for it.
>>> 
>>> An illustration might help to get the concept across.  A household with four users in different rooms: father in the study downloading a video, mother in the kitchen on the (VoIP) phone, son in his bedroom playing a game, daughter in her bedroom uploading photos.  All via a single black-box modem and connection.  Captions should emphasise that mother and son both need low latency (high response frequency), while father and daughter need high bandwidth (in opposite directions!), and that they're doing all these things at the same time.
>>> 
>>> - Jonathan Morton
>>> 
>>> _______________________________________________
>>> Bloat mailing list
>>> Bloat@lists.bufferbloat.net
>>> https://lists.bufferbloat.net/listinfo/bloat
>> 
> 


^ permalink raw reply	[flat|nested] 8+ messages in thread

* [Bloat] Fw:  Measuring Latency
       [not found]     ` <1410691150.20919.YahooMailNeo@web122503.mail.ne1.yahoo.com>
@ 2014-09-14 17:51       ` Alex Burr
  0 siblings, 0 replies; 8+ messages in thread
From: Alex Burr @ 2014-09-14 17:51 UTC (permalink / raw)
  To: bloat




Oops, intended to CC the list...



> On Sunday, September 14, 2014 11:39 AM, Alex Burr <ajb44.geo@yahoo.com> wrote:
> > 
> 
> 
> 
> 
> On Sunday, September 14, 2014 12:32 AM, Jonathan Morton 
> <chromatix99@gmail.com> wrote:
> 
> 
>> 
>> 
>>  To expand on this: to measure one-way delay, you would need finely 
> synchronised clocks (to within a 
>>  couple of ms) at both ends.  The average consumer doesn't have that 
> sort of setup - not even if they 
>>  happen to use NTP.  So it's not a goal worth pursuing at this level - 
> save it for scientific 
>>  Investigations, where the necessary effort and equipment can be made 
> available.
> 
> 
> This is true for absolute one way delay. But since we are interested in the 
> variation
> with load, it's actually easier. When you take the difference of two delays, 
> the synchronisation offset cancels out, so you are left with just the drift. 
> That may still be an insurmountable problem, but I would suggest at least 
> putting timestamps in the packets, even if you don't estimate one-way delay 
> up-front. That way people can try and get clever with clients later, without 
> having to change the server.
> 
> Alex 
> 


^ permalink raw reply	[flat|nested] 8+ messages in thread

end of thread, other threads:[~2014-09-14 17:51 UTC | newest]

Thread overview: 8+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2014-09-13 19:41 [Bloat] Measuring Latency Hal Murray
2014-09-13 19:48 ` Sebastian Moeller
2014-09-13 23:32   ` Jonathan Morton
2014-09-14 14:31     ` Neil Davies
2014-09-14 16:55       ` Sebastian Moeller
2014-09-14 17:26         ` Neil Davies
2014-09-14 16:57       ` Jonathan Morton
     [not found]     ` <1410691150.20919.YahooMailNeo@web122503.mail.ne1.yahoo.com>
2014-09-14 17:51       ` [Bloat] Fw: " Alex Burr

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox