From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mail-bw0-f43.google.com (mail-bw0-f43.google.com [209.85.214.43]) (using TLSv1 with cipher RC4-SHA (128/128 bits)) (Client CN "smtp.gmail.com", Issuer "Google Internet Authority" (verified OK)) by huchra.bufferbloat.net (Postfix) with ESMTPS id F0F0B20218A for ; Tue, 30 Aug 2011 17:32:03 -0700 (PDT) Received: by bkbzv15 with SMTP id zv15so293761bkb.16 for ; Tue, 30 Aug 2011 17:32:01 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=mime-version:date:message-id:subject:from:to:content-type :content-transfer-encoding; bh=nlqotBQ5aRBsNiMX+5KePnXrxofmzxw1IRGmboUEFJM=; b=SzIK1Wc43YYzE5IqaNVeAWiuz+Iu4Dp+fItkUmckfQcBMHbMGpopk6zjHwQRsJq/bs VVl5J1xChxxC2aweTwqPUeIUexqv1PQMdqD0z1YSrAC6nryIXcdfYrL+ju8isQ6iI4IV CDq1VEzc3Si2XOkUXWYQqu0TrRHIC6EdeziKU= MIME-Version: 1.0 Received: by 10.223.98.130 with SMTP id q2mr2322410fan.137.1314750720988; Tue, 30 Aug 2011 17:32:00 -0700 (PDT) Received: by 10.152.40.194 with HTTP; Tue, 30 Aug 2011 17:32:00 -0700 (PDT) Date: Tue, 30 Aug 2011 17:32:00 -0700 Message-ID: Subject: oprofiling is much saner looking now with rc6-smoketest From: Dave Taht To: bloat-devel Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable X-BeenThere: bloat-devel@lists.bufferbloat.net X-Mailman-Version: 2.1.13 Precedence: list List-Id: "Developers working on AQM, device drivers, and networking stacks" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 31 Aug 2011 00:32:04 -0000 I get about 190Mbit/sec from netperf now, on GigE, with oprofiling enabled, driver buffers of 4, txqueue of 8, cerowrt default iptables rules, AND web10g patched into kernel 3.0.3. This is much saner than rc3, and judging from the csum_partial and copy_user being roughly equal, there isn't much left to be gained... Nice work. (Without oprofiling, and without web10g and with tcp cubic I can get past 250Mbit) CPU: MIPS 24K, speed 0 MHz (estimated) Counted INSTRUCTIONS events (Instructions completed) with a unit mask of 0x00 (No unit mask) count 100000 samples % app name symbol name ---------------------------------------------------------------------------= ---- 17277 13.8798 vmlinux csum_partial 17277 100.000 vmlinux csum_partial [self] ---------------------------------------------------------------------------= ---- 16607 13.3415 vmlinux __copy_user 16607 100.000 vmlinux __copy_user [self] ---------------------------------------------------------------------------= ---- 11913 9.5705 ip_tables /ip_tables 11913 100.000 ip_tables /ip_tables [self] ---------------------------------------------------------------------------= ---- 8949 7.1893 nf_conntrack /nf_conntrack 8949 100.000 nf_conntrack /nf_conntrack [self] In this case I was going from laptop - gige - through another rc6-smoketest router - to_this_box's internal lan port. It bugs me that iptables and conntrack eat so much cpu for what is an internal-only connection, e.g. one that doesn't need conntracking. That said, I understand that people like their statistics, and me, I'm trying to make split-tcp work better, ultimately, one day.... I'm going to rerun this without the fw rules next. --=20 Dave T=E4ht SKYPE: davetaht US Tel: 1-239-829-5608 http://the-edge.blogspot.com