Historic archive of defunct list bloat-devel@lists.bufferbloat.net
 help / color / mirror / Atom feed
From: Dave Taht <dave.taht@gmail.com>
To: Rick Jones <rick.jones2@hp.com>
Cc: bloat-devel <bloat-devel@lists.bufferbloat.net>
Subject: Re: oprofiling is much saner looking now with rc6-smoketest
Date: Tue, 30 Aug 2011 20:28:28 -0700	[thread overview]
Message-ID: <CAA93jw668dXfLGpKmjcWOZuwT5binZX=P0SON5EkdWFPuGP0qA@mail.gmail.com> (raw)
In-Reply-To: <CAA93jw4Ug9EucfnuO=oc93UwcDKm6c-JjY7RjK94nJt0VKT+=Q@mail.gmail.com>

I took a little more time out to play with netperf at these extreme
performance values, while puzzled about the performance knee observed
midway through the previous tests.

The three tests runs this evening (and captures!) are up at:

http://huchra.bufferbloat.net/~d/rc6-smoke-captures/

For test 3, I rebooted the router into it's default tx ring (64), and
set a txqueuelen of 128, running cubic...

Measured throughput was mildly better (admittedly on a fresh boot,
oprofile not even loaded) 229Mbit,
and we didn't have a drop off at all, so I'm still chasing that...

What I found interesting was the 10 second periodicity of the
drop-offs. My assumption is that this is a timer being fired from
somewhere (netperf?) that blocks the transmission...

http://huchra.bufferbloat.net/~d/rc6-smoke-captures/txqueuelen128and10seconddropcycle.png

test 4 will repeat the above sans oprofile, with the current default
cerowrt settings for dma tx (4) and txqueuelen 8. If I get to it
tonight.

On Tue, Aug 30, 2011 at 6:58 PM, Dave Taht <dave.taht@gmail.com> wrote:
> I have put the current rc6 smoketest up at:
>
> http://huchra.bufferbloat.net/~cero1/rc6-smoketest/
>
> So far it's proving very stable. Wireless performance is excellent and
> wired performance dramatically improved. No crash bugs thus far,
> though I had a scare...
>
> For the final rc6, which I hope to have done by friday, I'm in the
> process of cleanly re-assembling the patch set (sorry, the sources are
> a bit of a mess at present). For this rc, I'm hoping that a new
> iptables lands, in particular, and I have numerous other little things
> in the queue to sort out.
>
> All that said, getting oprofile running is not hard, and I do
> appreciate smoke testers helping out!!! as I don't think I'll be able
> to get another release candidate done before linux plumbers.
>
> install the correct image on your router from the above via web
> interface or sysupgrade -n
> reboot
> edit /etc/opkg.conf to have that url in it
> opkg update
> opkg install oprofile
> cd /tmp
> mkdir /tmp/oprofile
> wget http://huchra.bufferbloat.net/~d/rc6-smoke-captures/vmlinux
> opcontrol --vmlinux=/tmp/vmlinux --session-dir=/tmp/oprofile (saving
> profile data to flash is a bad idea)
>
> opcontrol --start
> # do your testing
> opcontrol --dump
>
> opreport -c # or whatever options you like.
>
>
> On Tue, Aug 30, 2011 at 6:45 PM, Dave Taht <dave.taht@gmail.com> wrote:
>> On Tue, Aug 30, 2011 at 6:01 PM, Rick Jones <rick.jones2@hp.com> wrote:
>>> On 08/30/2011 05:32 PM, Dave Taht wrote:
>>
>>>> It bugs me that iptables and conntrack eat so much cpu for what
>>>> is an internal-only connection, e.g. one that
>>>> doesn't need conntracking.
>>>
>>> The csum_partial is a bit surprising - I thought every NIC and its dog
>>> offered CKO these days - or is that something happening with
>>> ip_tables/contrack?
>>
>> If this chipset supports it, so far as I know, it isn't documented or
>> implemented.
>>
>>> I also thought that Linux used an integrated
>>> copy/checksum in at least one direction, or did that go away when CKO became
>>> prevalent?
>>
>> Don't know.
>>
>>>
>>> If this is inbound, and there is just plain checksumming and not anything
>>> funny from conntrack, I would have expected checksum to be much larger than
>>> copy.  Checksum (in the inbound direction) will take the cache misses and
>>> the copy would not.  Unless... the data cache of the processor is getting
>>> completely trashed - say from the netserver running on the router not
>>> keeping up with the inbound data fully and so the copy gets "far away" from
>>> the checksum verification.
>>
>> 220Mbit isn't good enough for ya? Previous tests ran at about 140Mbit, but due
>> to some major optimizations by felix to fix a bunch of mis-alignment
>> issues. Through the router, I've seen 260Mbit - which is perilously
>> close to the speed that I can drive it at from the test boxes.
>>
>>>
>>> Does perf/perf_events (whatever the followon to perfmon2 is called) have
>>> support for the CPU used in the device?  (Assuming it even has a PMU to be
>>> queried in the first place)
>>
>> Yes. Don't think it's enabled. It is running flat out, according to top.
>>
>>>
>>>> That said, I understand that people like their statistics, and me,
>>>> I'm trying to make split-tcp work better, ultimately, one day....
>>>>
>>>> I'm going to rerun this without the fw rules next.
>>>
>>> It would be interesting to see if the csum time goes away.  Long ago and far
>>> away when I was beating on a 32-core system with aggregate netperf TCP_RR
>>> and enabling or not FW rules, conntrack had a non-trivial effect indeed on
>>> performance.
>>
>> Stays about the same. iptables time drops. How to disable conntrack?
>> Don't you only really
>> need it for nat?
>>
>>>
>>> http://markmail.org/message/exjtzel7vq2ugt66#query:netdev%20conntrack%20rick%20jones%2032%20netperf+page:1+mid:s5v5kylvmlfrpb7a+state:results
>>>
>>> I think will get to the start of that thread.  The subject is '32 core
>>> net-next stack/netfilter "scaling"'
>>>
>>> rick jones
>>>
>>
>>
>>
>> --
>> Dave Täht
>> SKYPE: davetaht
>> US Tel: 1-239-829-5608
>> http://the-edge.blogspot.com
>>
>
>
>
> --
> Dave Täht
> SKYPE: davetaht
> US Tel: 1-239-829-5608
> http://the-edge.blogspot.com
>



-- 
Dave Täht
SKYPE: davetaht
US Tel: 1-239-829-5608
http://the-edge.blogspot.com

  reply	other threads:[~2011-08-31  3:28 UTC|newest]

Thread overview: 10+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2011-08-31  0:32 Dave Taht
2011-08-31  1:01 ` Rick Jones
2011-08-31  1:10   ` Simon Barber
2011-08-31  1:20     ` Simon Barber
2011-08-31  1:45   ` Dave Taht
2011-08-31  1:58     ` Dave Taht
2011-08-31  3:28       ` Dave Taht [this message]
2011-08-31 16:19         ` Rick Jones
2011-08-31 15:55     ` Rick Jones
2011-08-31  1:41 ` Dave Taht

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to='CAA93jw668dXfLGpKmjcWOZuwT5binZX=P0SON5EkdWFPuGP0qA@mail.gmail.com' \
    --to=dave.taht@gmail.com \
    --cc=bloat-devel@lists.bufferbloat.net \
    --cc=rick.jones2@hp.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox