From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mail-wg0-f41.google.com (mail-wg0-f41.google.com [74.125.82.41]) (using TLSv1 with cipher RC4-SHA (128/128 bits)) (Client CN "smtp.gmail.com", Issuer "Google Internet Authority" (verified OK)) by huchra.bufferbloat.net (Postfix) with ESMTPS id C562321F0CC for ; Wed, 15 Aug 2012 21:08:52 -0700 (PDT) Received: by wgbds1 with SMTP id ds1so244310wgb.4 for ; Wed, 15 Aug 2012 21:08:50 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type:content-transfer-encoding; bh=T5VdCVyfPcPR9xWsWcc3oDU8pEsHqbB0dNXhR6Kw5D4=; b=0gohdWWkM+cC8LdOFXujBa2iya2gYS3fnBqZhtY1989Z01uct8Nok0KmJuvYlVRiy/ O49EQY/nbsPKc7M9/euamABJWgG922IabVHrtg8O/xm5JocaNSZMI2OcdWc3+a+JRbN/ BhVr0JPQE69pK7lwWEKRVgHgA/0PaIGJ5qDnB5ds9o06Z06atG+xjiPUmP7SJqYeFJaX hBZUEoc5eDcylP/rOqGoxMnSBHNImYNQ1DfFd5BgKlcJ6AC4iKmFXyv3xcXzXLmGhUDR w2zT3xo2BjrauB3wCt+kg60UGmX3AsAc9Rs844K8ZBkEJxCK7NIsYMbTwPXPuQrlAe3p u76Q== MIME-Version: 1.0 Received: by 10.216.71.82 with SMTP id q60mr12001959wed.64.1345090130222; Wed, 15 Aug 2012 21:08:50 -0700 (PDT) Received: by 10.223.143.69 with HTTP; Wed, 15 Aug 2012 21:08:50 -0700 (PDT) In-Reply-To: <36D61FDC-9AA9-46CC-ACBB-2D28B250C660@gmx.de> References: <36D61FDC-9AA9-46CC-ACBB-2D28B250C660@gmx.de> Date: Wed, 15 Aug 2012 21:08:50 -0700 Message-ID: From: Dave Taht To: Sebastian Moeller Content-Type: text/plain; charset=windows-1252 Content-Transfer-Encoding: quoted-printable Cc: cerowrt-devel@lists.bufferbloat.net Subject: Re: [Cerowrt-devel] cerowrt 3.3.8-17 is released X-BeenThere: cerowrt-devel@lists.bufferbloat.net X-Mailman-Version: 2.1.13 Precedence: list List-Id: Development issues regarding the cerowrt test router project List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 16 Aug 2012 04:08:53 -0000 re: ath: skbuff alloc of size 1926 failed as for the ath skbuff problem, I've seen that a lot. I had put hard packet limits (~600) on fq_codel in -11 and prior that were too low and it mostly went away, but I hit tail drop behavior everywhere, instead of codel behavior. What I have now (typically 1200) may well be too high, but not as overly high as the default (10k packets). There may be another means of increasing the size of that slab pool or making it less onerous. I would like it if codel "kicked in" earlier than it currently does. The code in ns2 is currently using half the period that the linux code is. This would control things better, or so I hope (planning on trying this as I get time) I am also considering means of artificially upscaling the drop scheduler when we get close to queue limits. See some discussions on the codel list for these issues. (sims are easier to deal with than cerowrt, too!) as for bind, it should be automagically restarted from xinetd, no need to fiddle with anything. However, since you are already under massive memory pressure, it may well fail to start up that way, too. At the moment, I've largely given up on bind on anything but a more core home gw, and am running dnsmasq on everything (3700v2, picostations, nanostations) but the 3800s. (and the ones I run it on, aren't being used for wifi right now). Lastly: Swap space won't help you on exhausting kernel limits. I'm glad you can reproduce the ath: slab problem - I can get it too at high rates using netperf over wifi. I will try a 3700v2 with and without bind to see if it's still there in 3.3.8-17. In the meantime if anyone knows how to get more allocations in that (2048? 4096?) slab by default, perhaps that will help? On Wed, Aug 15, 2012 at 10:23 AM, Sebastian Moeller wrote= : > Hi Dave, > > great work, as always I upgraded my production router to the latest and g= reatest (since I only have one router=85). And it works quite well for norm= al usage=85 > Netalyzr reports around 2800ms seconds of uplink buffering, yet saturatin= g the uplink does not affect ping times to a remote target noticeably, basi= cally the same as for all codellized ceo versions I tested so far... > > Some notes and a question: > I noticed that even given plenty of swap space (1GB on a usb stick), usin= g http://broadband.mpi-sws.org/residential/ to exercise UDP stress (on the = uplink I assume) I can easily produce (I run the test from a macosx via 5GH= z wireless over 1.5 yards): > Aug 15 01:16:29 nacktmulle kern.err kernel: [175395.132812] ath: skbuff a= lloc of size 1926 failed > (and plenty of those=85). > What then happens is that the OOM killer will aim for bind (reasonable si= nce it is the largest single process) and kill it. When I try to restart bi= nd by: > root@nacktmulle:~# /etc/rc.d/S47namedprep start > root@nacktmulle:~# /etc/rc.d/S48named restart > Stopping isc-bind > /etc/chroot/named//var/run/named/named.pid not found, trying brute forc= e > killall: named: no process killed > Kicking isc-bind in xinetd > rndc: connect failed: 127.0.0.1#953: connection refused > And bind does not start again and the router becomes less than useful. No= w I assume I am doing something wrong, but what, if you have any idea how t= o solve this short of a reboot of the router (my current method) I would be= happy to learn > > > > best regards > sebastian > > On Aug 12, 2012, at 11:08 PM, Dave Taht wrote: > >> I'm too tired to write up a full set of release notes, but I've been >> testing it all day, >> and it looks better than -10 and certainly better than -11, but I won't = know >> until some more folk sit down and test it, so here it is. >> >> http://huchra.bufferbloat.net/~cero1/3.3/3.3.8-17/ >> >> fresh merge with openwrt, fix to a bind CVE, fixes for 6in4 and quagga >> routing problems, >> and a few tweaks to fq_codel setup that might make voip better. >> >> Go forth and break things! >> >> In other news: >> >> Van Jacobson gave a great talk about bufferbloat, BQL, codel, and fq_cod= el >> at last week's ietf meeting. Well worth watching. At the end he outlines >> the deployment problems in particular. >> >> http://recordings.conf.meetecho.com/Recordings/watch.jsp?recording=3DIET= F84_TSVAREA&chapter=3Dpart_3 >> >> Far more interesting than this email! >> >> >> -- >> Dave T=E4ht >> http://www.bufferbloat.net/projects/cerowrt/wiki - "3.3.8-17 is out >> with fq_codel!" >> _______________________________________________ >> Cerowrt-devel mailing list >> Cerowrt-devel@lists.bufferbloat.net >> https://lists.bufferbloat.net/listinfo/cerowrt-devel > --=20 Dave T=E4ht http://www.bufferbloat.net/projects/cerowrt/wiki - "3.3.8-17 is out with fq_codel!"