From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mail-wm0-x22e.google.com (mail-wm0-x22e.google.com [IPv6:2a00:1450:400c:c09::22e]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by lists.bufferbloat.net (Postfix) with ESMTPS id 84FBD3B2A4 for ; Mon, 6 Nov 2017 19:32:23 -0500 (EST) Received: by mail-wm0-x22e.google.com with SMTP id s66so432617wmf.5 for ; Mon, 06 Nov 2017 16:32:23 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:subject:from:in-reply-to:date:cc:message-id:references :to; bh=ZmOJagTdWRSPb/wjHMFIASW+7FgODih8+TIt6tA08Oo=; b=Zm32JtKwuGSfuFNjS2K1PjCBQCa0KRU4qYNzLve8vtnkT7i1Ms9Sq7mcKeKHHAIXq3 EVIA8YqCr2LSIbpfMDS6FsA3q6vquHAjVATzfyQYEgd8ytPRHifxI76fGn7z4lGtVr+9 QdsMXrKb4tRhrXrmxiKovXWI+/fL1OLz9inygHC5Cy0xdXiGpBqqWevWX+edS7z/ssBU 8ZwHFb+OmUdW8vYyTKgRu1uFDiZ0U2Lgy2m6gXMdoCC8AcyqNCuFQvJ280GZWnME/3xw Gm+no7AsUrwLYCHoBUn6UX70L+lDoenuwVl17ZqCA62cIimaahDQYhLfc/DQATjMGNj2 Fx0g== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:subject:from:in-reply-to:date:cc :message-id:references:to; bh=ZmOJagTdWRSPb/wjHMFIASW+7FgODih8+TIt6tA08Oo=; b=LjiHj1HDpSpZ+4UjiDZCn8aJw4/7LjivH0PHzHA7IAjqeQs7Vn1aUKa/htXWl34Ep3 whpDGZLH8pmgQNni7u86YXGsSzhmkTDNQOO9ZkThj1QeninuFwqf5f3ex0RziM4Fbr2g UZbxNK5A2xx94GykChWgvtyGp/GKbHSPC9039KILD5/sF65ji0sQeQ7KQ0zLvkcssiI1 yNNaokVBLKnbG2I5CcRHGNznzqTCYxf2BbBPF58Txpq8wFlU/TuedCAbhmTjl74mOP+u vK/WRgtYf9BV8AE6IqA1qOaA99FLfXM0aCyQEYr3BS0RV4G047wCg3QzyOnDFIokiGBe bvnA== X-Gm-Message-State: AJaThX5fRRCy+AG1A8XRA8gT+lgiXy4iDIWbRm69mEsMNL/4LtoAAiq2 mtLzwSzeWuIXmT4MMyvLeXYnQW4F X-Google-Smtp-Source: ABhQp+S//i/cA8urII1d95iBMFbLS3zrGBjR278P8CimBBOkV29Q/RwYHU2/0XbVodLRW+JyDIBl+A== X-Received: by 10.28.63.9 with SMTP id m9mr7094060wma.137.1510014742453; Mon, 06 Nov 2017 16:32:22 -0800 (PST) Received: from [10.72.0.20] (h-1169.lbcfree.net. [185.99.119.68]) by smtp.gmail.com with ESMTPSA id y145sm164901wmd.40.2017.11.06.16.32.21 (version=TLS1 cipher=ECDHE-RSA-AES128-SHA bits=128/128); Mon, 06 Nov 2017 16:32:21 -0800 (PST) Content-Type: multipart/alternative; boundary="Apple-Mail=_09D44BE8-B57F-4CDB-B9A2-34F2D7AC7BE8" Mime-Version: 1.0 (Mac OS X Mail 9.3 \(3124\)) From: Pete Heist In-Reply-To: Date: Tue, 7 Nov 2017 01:32:19 +0100 Cc: make-wifi-fast@lists.bufferbloat.net Message-Id: References: <535CD0D7-CB3A-4C42-AB13-548313F9002A@gmail.com> <3FA33ADA-E5C8-4DD1-841E-3A8E558BAADC@gmail.com> <3D513EEE-348B-4770-9D65-3C5F1320540C@gmail.com> <1442452F-518E-4DB2-852B-2ACA9CEBB869@gmail.com> To: Jonathan Morton X-Mailer: Apple Mail (2.3124) Subject: Re: [Make-wifi-fast] Flent test hardware X-BeenThere: make-wifi-fast@lists.bufferbloat.net X-Mailman-Version: 2.1.20 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 07 Nov 2017 00:32:23 -0000 --Apple-Mail=_09D44BE8-B57F-4CDB-B9A2-34F2D7AC7BE8 Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset=utf-8 > On Nov 5, 2017, at 9:15 PM, Jonathan Morton = wrote: > Regarding single-core performance, the Sempron 3850 may be better in = that respect than my E-450, which itself is perfectly capable of = handling GigE at line rate (unlike an Atom). It has an improved core = design which may compensate for the lower clock speed. >=20 > There are three members of AMD's "cat" family: Bobcat, Jaguar, Puma. = My E-450 has Bobcat, which is the oldest and maxes out at 1.65 GHz = (there is no turbo clock). Various consoles got Jaguar cores, as did = the Kabini APU which forms your Sempron. The Puma core is the newest, = and appears in "Carrizo-L" APUs in last year's netbooks. The design = improvements between Bobcat and Jaguar are much larger than between = Jaguar and Puma; consult Agner Fog's optimisation manuals if you want to = know the gory details. >=20 > The AM1 Kabini family also includes the Athlon 5150 and 5350, which = are exactly the same as the 3850 except for clock speeds. If the 3850 = isn't fast enough, you can get an easy 50% boost from upgrading to the = 5350. >=20 These days it seems like most hardware is capable of GigE at line rate, = but the smoothness of individual flows or other factors may be affected = by CPU, either cores or speed. I wrote a script to run rrul_be_nflows = over the loopback adapter and vary a few things: - number of flows - number of CPU cores enabled - CPU frequency - whether or not to limit loopback to 2gbit (simulating GigE) This was an attempt to learn more about how the number of cores and CPU = frequency can affect the results, but because I don=E2=80=99t actually = have a box with two NICs and this uses loopback, it's just for learning, = and should be taken with a grain of salt. Here are a few of the results = from my MBP and Mini (six tabs of results): = https://docs.google.com/spreadsheets/d/1MVxGsreiGKNXhfkMIheNFrH_GVllFfiH9R= U5ws5l_aY/edit#gid=3D739582898 = Not too many surprises: - I saw more dramatic differences overall when going from 1-2 cores (not = good to test with only a single core) - other things being equal, more cores are capable of higher throughput, = but > 4 cores didn=E2=80=99t change much on my MBP - with higher flow counts, more cores can make individual flows a bit = smoother, and ping times sometimes lower and smoother - I=E2=80=99d rather have a dual-core 1.6 GHz Core 2 Duo than a = single-core 2.26 GHz Core 2 Duo, but because my hardware doesn=E2=80=99t = let me lower CPU frequency to half of maximum, I couldn=E2=80=99t test a = doubling of frequency vs going from 1-2 cores, which would be = interesting to me (anyone else with hardware that can do that care to = try?) If anyone wants to run the script (takes a little modification), it=E2=80=99= s here: = https://drive.google.com/file/d/1oasB3ePDMMr1WdRi_8koQmRO_qUjVB52/view?usp= =3Dsharing = As for the hardware, running everything on one box looks more = complicated than I thought, not just because of Flent, but because of = the scripts I wrote for setup and teardown and what they do. So now = I=E2=80=99m thinking of taking one for the team and getting two boxes to = avoid the hassle. I=E2=80=99ll still see. Anyway, I added some ASUS / Intel builds for more on the high end: = https://docs.google.com/spreadsheets/d/1MVxGsreiGKNXhfkMIheNFrH_GVllFfiH9R= U5ws5l_aY/edit#gid=3D199471438 = If I ever see a comparison between the effects of doubling clock speed = or going from 2-4 cores, it might help decide on the CPU.= --Apple-Mail=_09D44BE8-B57F-4CDB-B9A2-34F2D7AC7BE8 Content-Transfer-Encoding: quoted-printable Content-Type: text/html; charset=utf-8
On Nov 5, 2017, at 9:15 PM, Jonathan Morton <chromatix99@gmail.com> wrote:

Regarding single-core performance, the Sempron = 3850 may be better in that respect than my E-450, which itself is = perfectly capable of handling GigE at line rate (unlike an Atom).  = It has an improved core design which may compensate for the lower clock = speed.

There are three members of AMD's = "cat" family: Bobcat, Jaguar, Puma.  My E-450 has Bobcat, which is = the oldest and maxes out at 1.65 GHz (there is no turbo clock).  = Various consoles got Jaguar cores, as did the Kabini APU which forms = your Sempron.  The Puma core is the newest, and appears in = "Carrizo-L" APUs in last year's netbooks.  The design improvements = between Bobcat and Jaguar are much larger than between Jaguar and Puma; = consult Agner Fog's optimisation manuals if you want to know the gory = details.

The AM1 Kabini family also = includes the Athlon 5150 and 5350, which are exactly the same as the = 3850 except for clock speeds.  If the 3850 isn't fast enough, you = can get an easy 50% boost from upgrading to the = 5350.

These = days it seems like most hardware is capable of GigE at line rate, but = the smoothness of individual flows or other factors may be affected by = CPU, either cores or speed. I wrote a script to run rrul_be_nflows over = the loopback adapter and vary a few things:

- number of flows
- number of CPU cores = enabled
- CPU frequency
- = whether or not to limit loopback to 2gbit (simulating GigE)

This was an = attempt to learn more about how the number of cores and CPU frequency = can affect the results, but because I don=E2=80=99t actually have a box = with two NICs and this uses loopback, it's just for learning, and should = be taken with a grain of salt. Here are a few of the results from my MBP = and Mini (six tabs of results):


Not too many surprises:
- I saw more dramatic differences overall when going from 1-2 = cores (not good to test with only a single core)
- = other things being equal, more cores are capable of higher throughput, = but > 4 cores didn=E2=80=99t change much on my MBP
- with higher flow counts, more cores can make individual = flows a bit smoother, and ping times sometimes lower and = smoother
- I=E2=80=99d rather have a dual-core 1.6 = GHz Core 2 Duo than a single-core 2.26 GHz Core 2 Duo, but because my = hardware doesn=E2=80=99t let me lower CPU frequency to half of maximum, = I couldn=E2=80=99t test a doubling of frequency vs going from 1-2 cores, = which would be interesting to me (anyone else with hardware that can do = that care to try?)

If anyone wants to run the script (takes a little = modification), it=E2=80=99s here:


As for the hardware, running everything on one box looks more = complicated than I thought, not just because of Flent, but because of = the scripts I wrote for setup and teardown and what they do. So now = I=E2=80=99m thinking of taking one for the team and getting two boxes to = avoid the hassle. I=E2=80=99ll still see.

Anyway, I added some ASUS / Intel = builds for more on the high end:


If I ever see a comparison between the = effects of doubling clock speed or going from 2-4 cores, it might help = decide on the CPU.
= --Apple-Mail=_09D44BE8-B57F-4CDB-B9A2-34F2D7AC7BE8--