From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from cassarossa.samfundet.no (cassarossa.samfundet.no [IPv6:2001:67c:29f4::29]) by lists.bufferbloat.net (Postfix) with ESMTPS id BDC053B2D4 for ; Tue, 5 Jan 2016 18:18:03 -0500 (EST) Received: from yesthisis.dog ([2001:67c:29f4::3007] helo=pannekake.samfundet.no ident=unknown) by cassarossa.samfundet.no with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.84) (envelope-from ) id 1aGarX-000331-7u for bloat@lists.bufferbloat.net; Wed, 06 Jan 2016 00:17:59 +0100 Received: from sesse by pannekake.samfundet.no with local (Exim 4.84) (envelope-from ) id 1aGarX-0002tY-5V for bloat@lists.bufferbloat.net; Wed, 06 Jan 2016 00:17:59 +0100 Date: Wed, 6 Jan 2016 00:17:59 +0100 From: "Steinar H. Gunderson" To: bloat@lists.bufferbloat.net Message-ID: <20160105231759.GA8545@sesse.net> References: <165DB6E9-A6DD-4626-B6AA-E1B2DBA0B5FA@gmail.com> <568C1209.5050403@taht.net> <20160105192930.GA7803@sesse.net> <568C1B5E.1070008@taht.net> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: <568C1B5E.1070008@taht.net> X-Operating-System: Linux 4.3.0 on a x86_64 User-Agent: Mutt/1.5.23 (2014-03-12) Subject: Re: [Bloat] Hardware upticks X-BeenThere: bloat@lists.bufferbloat.net X-Mailman-Version: 2.1.20 Precedence: list List-Id: General list for discussing Bufferbloat List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 05 Jan 2016 23:18:03 -0000 On Tue, Jan 05, 2016 at 11:37:02AM -0800, Dave Täht wrote: >> Anyway, the biggest cost of a context switch isn't necessarily the time used >> to set up registers and such. It's increased L1 pressure; your CPU is now >> running different code and looking at (largely) different data. > A L1/L2 Icache dedicated to interrupt processing code could make a great > deal of difference, if only cpu makers and benchmarkers would make > CS time something we valued. > > Dcache, not so much, except for the intel architectures which are now > doing DMA direct to cache. (any arms doing that?) Note that I'm saying L1 pressure, not “bad choice of what to keep in L1”. If you dedicate L1i space to interrupt processing code (which includes, presumably, large parts of your TCP/IP stack?), you will have less for your normal userspace, and I'd like to see some very hard data on that being a win before I'll believe it at face value. In a sense, if you tie your interrupts to a dedicated core, you get exactly this, though. /* Steinar */ -- Homepage: https://www.sesse.net/